Annotate-a-genome: Difference between revisions

From QiuLab
Jump to navigation Jump to search
imported>Weigang
imported>Weigang
Line 70: Line 70:


=Protocol=
=Protocol=
# Predict genes with Glimmer if necessary
# Obtain genome accession from the genome links and fetch sequences:
<syntaxhighlight lang="php">
<syntaxhighlight lang="bash" line enclose="div">
<?php
./bioseq -z 'b31_accession' -o 'genbank' > b31.gb # Reference genome for ortholog identification. Choose main, cp26, or lp54
    $v = "string";    // sample initialization
./bioseq -z 'gb_accession' -o 'genbank' > new.gb
?>
./gb2fas -n b31.gb > b31.nuc # Extract CDS
html text
./gb2fas -n new.gb > new.nuc
<?
./bioseq -t b31.nuc > b31.pep # Translate (and remove those with internal stop codons)
    echo $v;        // end of php code
./bioseq -t new.nuc > new.pep
?>
</syntaxhighlight>
</syntaxhighlight>
# Predict orthologs with reciprocal BLAST
# Predict orthologs with reciprocal BLAST
## Prepare BLAST databases
# Load into database
# Load into database
# Visualize results using synteny browser
# Visualize results using synteny browser

Revision as of 21:52, 13 February 2014

Project Goals

  • Annotate and add newly sequenced Borrelia genomes to BorreliaBase
  • Build an informatics pipeline for gene prediction, ortholog calls, databasing, and synteny analysis

Download genome sequences from GenBank

Strain Species Group Genome Sequences Notes
B31 B. burgdorferi (reference genome) Lyme Disease Example
CA382 B. burgdorferi (California) Lyme Disease Example
CA8 B. burgdorferi (California) Lyme Disease Example
BgVir B. garinii (Russia) Lyme Disease Example
NMJW1 B. garinii (China) Lyme Disease Example
HLJ01 B. afzelii (China) Lyme Disease Example
Ly B. duttonii (Tanzania) Relapsing Fever Example
A1 B. recurrentis (Ethiopia) Relapsing Fever Example
DAH B. hermsii (Washington State) Relapsing Fever Example
91E135 B. turicatae (Texas) Relapsing Fever Example
Achema B. crocidurae (Mauritania) Relapsing Fever Example
HR1 B. parkeri (??) Relapsing Fever Example
LB-2001 B. miyamotoi (Northeast US) Relapsing Fever Example

Protocol

  1. Obtain genome accession from the genome links and fetch sequences:
./bioseq -z 'b31_accession' -o 'genbank' > b31.gb # Reference genome for ortholog identification. Choose main, cp26, or lp54
./bioseq -z 'gb_accession' -o 'genbank' > new.gb
./gb2fas -n b31.gb > b31.nuc # Extract CDS
./gb2fas -n new.gb > new.nuc
./bioseq -t b31.nuc > b31.pep # Translate (and remove those with internal stop codons)
./bioseq -t new.nuc > new.pep
  1. Predict orthologs with reciprocal BLAST
    1. Prepare BLAST databases
  2. Load into database
  3. Visualize results using synteny browser