Annotate-a-genome: Difference between revisions
Jump to navigation
Jump to search
imported>Weigang m (→Protocol) |
imported>Weigang m (→Protocol) |
||
Line 70: | Line 70: | ||
=Protocol= | =Protocol= | ||
# | # Obtain genome accession from the genome links and fetch sequences: | ||
<syntaxhighlight lang=" | <syntaxhighlight lang="bash" line enclose="div"> | ||
./bioseq -z 'b31_accession' -o 'genbank' > b31.gb # Reference genome for ortholog identification. Choose main, cp26, or lp54 | |||
./bioseq -z 'gb_accession' -o 'genbank' > new.gb | |||
./gb2fas -n b31.gb > b31.nuc # Extract CDS | |||
./gb2fas -n new.gb > new.nuc | |||
./bioseq -t b31.nuc > b31.pep # Translate (and remove those with internal stop codons) | |||
./bioseq -t new.nuc > new.pep | |||
</syntaxhighlight> | </syntaxhighlight> | ||
# Predict orthologs with reciprocal BLAST | # Predict orthologs with reciprocal BLAST | ||
## Prepare BLAST databases | |||
# Load into database | # Load into database | ||
# Visualize results using synteny browser | # Visualize results using synteny browser |
Revision as of 21:52, 13 February 2014
Project Goals
- Annotate and add newly sequenced Borrelia genomes to BorreliaBase
- Build an informatics pipeline for gene prediction, ortholog calls, databasing, and synteny analysis
Download genome sequences from GenBank
Strain | Species | Group | Genome Sequences | Notes |
---|---|---|---|---|
B31 | B. burgdorferi (reference genome) | Lyme Disease | Example | |
CA382 | B. burgdorferi (California) | Lyme Disease | Example | |
CA8 | B. burgdorferi (California) | Lyme Disease | Example | |
BgVir | B. garinii (Russia) | Lyme Disease | Example | |
NMJW1 | B. garinii (China) | Lyme Disease | Example | |
HLJ01 | B. afzelii (China) | Lyme Disease | Example | |
Ly | B. duttonii (Tanzania) | Relapsing Fever |
|
Example |
A1 | B. recurrentis (Ethiopia) | Relapsing Fever | Example | |
DAH | B. hermsii (Washington State) | Relapsing Fever | Example | |
91E135 | B. turicatae (Texas) | Relapsing Fever | Example | |
Achema | B. crocidurae (Mauritania) | Relapsing Fever |
|
Example |
HR1 | B. parkeri (??) | Relapsing Fever | Example | |
LB-2001 | B. miyamotoi (Northeast US) | Relapsing Fever | Example |
Protocol
- Obtain genome accession from the genome links and fetch sequences:
./bioseq -z 'b31_accession' -o 'genbank' > b31.gb # Reference genome for ortholog identification. Choose main, cp26, or lp54
./bioseq -z 'gb_accession' -o 'genbank' > new.gb
./gb2fas -n b31.gb > b31.nuc # Extract CDS
./gb2fas -n new.gb > new.nuc
./bioseq -t b31.nuc > b31.pep # Translate (and remove those with internal stop codons)
./bioseq -t new.nuc > new.pep
- Predict orthologs with reciprocal BLAST
- Prepare BLAST databases
- Load into database
- Visualize results using synteny browser