Summer 2021: Difference between revisions
Jump to navigation
Jump to search
imported>Weigang |
imported>Weigang |
||
Line 15: | Line 15: | ||
* Questions & Goals: | * Questions & Goals: | ||
** Generalized algorithms for antigen with arbitrary tree shape | ** Generalized algorithms for antigen with arbitrary tree shape | ||
*** Data set 1. Neutral evolution (with exponentially distributed branch lengths). Binary strings (L=100 bits) evolved from a coalescent tree of 20 leaves. Simulated with <code>rcoal(20); rTraitDisc()</code>. [[Monte_Carlo_Club#Summer_Project_1._Systems_evolution_of_biofilm.2Fswarming_pathway_.28with_Dr_Joao_Xavier_of_MSKCC.29|code from previous work]] | |||
*** Data set 2. Two major clades. HA sequences from fluB | |||
*** Data set 3. Four major clades. Dengue | |||
*** Data set 4. Star-shaped tree, driven by recombination. OspC | |||
*** Data set 5. Multiple major clades. vls cassette in Lyme species | |||
** Combination algorithms | ** Combination algorithms | ||
** Naive Bayes models to integrate immunogenicity data | ** Naive Bayes models to integrate immunogenicity data |
Revision as of 18:55, 3 June 2021
Project 1. Borrelia genomics
- Participants
- Questions & Goals:
- Upgrade database, genome pipeline, and website (Lia)
- Phylogeography & evolutionary maintenance of divided genome (Saymon)
- vls evolution (with simulation) & development of immunoflorescence microsopy methods(Lily)
- Reading list
- Schward et al (2021). Multipartite Genome of Lyme Disease Borrelia: Structure, Variation and Prophages
- Stevenson & Seshu (2018). Regulation of Gene and Protein Expression in the Lyme Disease Spirochete
Project 2. Design algorithms for vaccines
- Participants:
- Dr Saad Mneimneih (CS Department)
- Brian
- Questions & Goals:
- Generalized algorithms for antigen with arbitrary tree shape
- Data set 1. Neutral evolution (with exponentially distributed branch lengths). Binary strings (L=100 bits) evolved from a coalescent tree of 20 leaves. Simulated with
rcoal(20); rTraitDisc()
. code from previous work - Data set 2. Two major clades. HA sequences from fluB
- Data set 3. Four major clades. Dengue
- Data set 4. Star-shaped tree, driven by recombination. OspC
- Data set 5. Multiple major clades. vls cassette in Lyme species
- Data set 1. Neutral evolution (with exponentially distributed branch lengths). Binary strings (L=100 bits) evolved from a coalescent tree of 20 leaves. Simulated with
- Combination algorithms
- Naive Bayes models to integrate immunogenicity data
- Natural language models to improve structural stability (see Project 4 below)
- Generalized algorithms for antigen with arbitrary tree shape
- Reading list
Project 3. HIV compartmentalized evolution
- Participants
- Lily
- Questions and goals
- Do HIV evolve cell type tropisms within the host? Specifically, the Neural(N)-tropism vs T-cell(T)-tropism?
- Build a classifier of N-tropism HIV subtypes
- A presentation for an HIV conference in October
- Reading list
- HIV compartmentalized evolution: Evering et al (2014)
- Data sets
- ~500 sequences of env genes from 15 patients
- 2nd time point single-cell genome sequences for some of the patients
- Experimentally verified N-tropism subtypes
- Approach
- Evolutionary mechanisms: mutation, recombination, and test of adaptive selection
- Evolutionary rates & signature (BEAST)
Project 4. Natural Language models of proteins
- Participants
- Questions & Goals
- Learn, implement, and compare the existing tools
- Fine-tuning for OspC, to be integrated with the centroid algorithm
- 2nd-generation centroid design: k-means algorithm (with applications to vls, Dengue, flu B)