Computational Genomics (KIZ, Fall 2024): Difference between revisions

From QiuLab
Jump to navigation Jump to search
No edit summary
 
(15 intermediate revisions by the same user not shown)
Line 1: Line 1:
[[File:Banner-comp-genomics.png|center]]
[[File:Banner-comp-genomics.png|800px|center]]
<center>'''Computational Genomics'''</center>
<center>Sept, Oct & Nov of 2024</center>
<center>Sept, Oct & Nov of 2024</center>
<center>'''Guest Instructor:''' Weigang Qiu, Ph.D.<br>Professor, Department of Biological Sciences, City University of New York, Hunter College & Graduate Center<br>Adjunct Faculty, Department of Physiology and Biophysics,
<center>'''Guest Instructor:''' Weigang Qiu, Ph.D.<br>Professor, Department of Biological Sciences, City University of New York, Hunter College & Graduate Center<br>Adjunct Faculty, Department of Physiology and Biophysics,
Line 11: Line 10:
----
----
==Course Overview==
==Course Overview==
Welcome to BioMedical Genomics, a computer workshop for advanced undergraduates and graduate students. A genome is the total genetic content of an organism. Driven by breakthroughs such as the decoding of the first human genome and next-generation DNA -sequencing technologies, biomedical sciences are undergoing a rapid and irreversible transformation into a highly data-intensive field.  
Welcome to Computational Genomics, a 9-week computer workshop for graduate students. A genome is the total genetic content of an organism. Driven by breakthroughs such as the decoding of the first human genome and next-generation DNA -sequencing technologies, biomedical sciences are undergoing a rapid and irreversible transformation into a highly data-intensive field.  


Genome information is revolutionizing virtually all aspects of life sciences including basic research, medicine, and agriculture. Meanwhile, use of genomic data requires life scientists to be familiar with concepts and skills in biology, computer science, as well as data analysis.  
Genome information is revolutionizing virtually all aspects of life sciences including basic research, medicine, and agriculture. Meanwhile, use of genomic data requires life scientists to be familiar with concepts and skills in biology, computer science, as well as data analysis.  
Line 21: Line 20:
==Learning goals==
==Learning goals==
By the end of this course successful students will be able to:  
By the end of this course successful students will be able to:  
* Describe next-generation sequencing  (NGS) technologies & contrast it with traditional Sanger sequencing
* Use Linux commands & compose simple shell scripts to automate a bioinformatics pipeline
* Explain applications of NGS technology including pathogen genomics, cancer genomics, human genomic variation, transcriptomics, meta-genomics, epi-genomics, and microbiome.
* Program in Python for parsing texts and simulating evolution
* Visualize and explore genomics data using RStudio
* Visualize data and perform statistical analysis using R/RStudio
* Replicate key results using a raw data set produced by a primary research paper
* Compose a bioinformatics research report


==Web Links==
==Web Links==
Line 42: Line 41:


==Course Schedule==
==Course Schedule==
===Week 1, Tuesday, Sep 24, 2024===
===Week 1, Thursday, Oct 10, 2024===
* Computer setup: Linux terminals
* Computer setup: Linux terminals; Install mini-conda: https://docs.anaconda.com/miniconda/
* <span style="color: blue">Linux Tutorial: Linux commands</span>
* <span style="color: blue">Pre-test 1: Tree-thinking Skills</span>
* Assignment 1.
* <span style="color: blue">Pre-test 2: Genomics & Data Science</span>
===Week 2, Tuesday, Oct 01, 2024===
 
* Computer setup: conda; jupyter-notebook
===Week 2, Thursday, Oct 17, 2024===
* <span style="color: green">Python Tutorial 1: Basic Python</span>
* <span style="color: green">Linux Tutorial: BpWrapper Toolkit (https://github.com/bioperl/p5-bpwrapper)</span>
* Assignment 1
===Week 3, Thursday, Oct 24, 2024===
* <span style="color: red">Quiz #1</span>
* <span style="color: green">Python Tutorial I: Advanced Python</span>
* Assignment 2
* Assignment 2
===Week 3, Tuesday, Oct 08, 2024===
===Week 4, Thursday, Oct 31, 2024===
* <span style="color: red">Quiz #1</span>
* <span style="color: green">Python Tutorial II: Regular expression & scientific computing with Python</span>
* <span style="color: green">Python Tutorial 2: Advanced Python</span>
* Assignment 3
* Assignment 3
===Week 4, Tuesday, Oct 15, 2024===
===Week 5, Thursday, Nov 7, 2024===
* <span style="color: green">Python Tutorial 3: Regular expression & scientific computing with Python</span>
* <span style="color: red">Quiz #2</span>
* Computer setup: R & RStudio
* <span style="color: orange">R Tutorial 1: Basic R</span>
* Assignment 4
* Assignment 4
===Week 5, Tuesday, Oct 22, 2024===
 
* <span style="color: red">Quiz #2</span>
===Week 6, Thursday, Nov 14, 2024===
* <span style="color: orange">Computer setup: R & RStudio</span>
* <span style="color: orange">R Tutorial 2: Data visualization & basic statistics</span>
* R Tutorial 1: Basic R
* Assignment 5
* Assignment 5
===Week 6, Tuesday, Oct 29, 2024===
===Week 7, Thursday, Nov 21, 2024===
* <span style="color: orange">R Tutorial 2: Data visualization & basic statistics</span>
* Assignment 6
===Week 7, Tuesday, Nov 05, 2024===
* <span style="color: red">Quiz #3</span>
* <span style="color: red">Quiz #3</span>
* <span style="color: cyan">Genomics Tutorial 1: Cluster analysis & scRNA analysis</span>
* <span style="color: #33ff8a">Genomics Tutorial 1: Cluster analysis & scRNA analysis</span>
* Final report (draft 1: Background, Hypothesis, Significance, Material & Methods)
* Final report (draft 1: Background, Hypothesis, Significance, Material & Methods)
===Week 8, Tuesday, Nov 12, 2024===
===Week 8, Thursday, Nov 28, 2024===
* <span style="color: cyan">Genomics Tutorial 2. NGS data analysis</span>
* <span style="color: #33ff8a ">Genomics Tutorial 2. NGS data analysis</span>
* Final report (draft 2: Results & Discussion)
* Final report (draft 2: Results & Discussion)
===Week 9, Tuesday, Nov 19, 2024===
 
* <span style="color: cyan">Genomics Tutorial 3. Monte Carlo simulations of genome evolution</span>
===Week 9, Thursday, Dec 5, 2024===
* <span style="color: #33ff8a ">Genomics Tutorial 3. Monte Carlo simulations of genome evolution</span>
* Final report (draft 3: Conclusions, future directions, reference)
* Final report (draft 3: Conclusions, future directions, reference)
* <span style="color: red">Final report due Nov 26</span>
* <span style="color: red">Final report due Nov 26</span>
===Week 10, Thursday, Dec 10, 2024===
* Final presentation

Latest revision as of 14:30, 23 August 2024

Banner-comp-genomics.png
Sept, Oct & Nov of 2024
Guest Instructor: Weigang Qiu, Ph.D.
Professor, Department of Biological Sciences, City University of New York, Hunter College & Graduate Center
Adjunct Faculty, Department of Physiology and Biophysics, Institute for Computational Biomedicine, Weil Cornell Medical College
Office: B402 Belfer Research Building, 413 East 69th Street, New York, NY 10021, USA
Email: wqiu@hunter.cuny.edu
Lab Website: https://wiki.genometracker.org


Host: Dr Yun Gao, Ph.D.
Kunming Institute of Zoology (KIZ)

Course Overview

Welcome to Computational Genomics, a 9-week computer workshop for graduate students. A genome is the total genetic content of an organism. Driven by breakthroughs such as the decoding of the first human genome and next-generation DNA -sequencing technologies, biomedical sciences are undergoing a rapid and irreversible transformation into a highly data-intensive field.

Genome information is revolutionizing virtually all aspects of life sciences including basic research, medicine, and agriculture. Meanwhile, use of genomic data requires life scientists to be familiar with concepts and skills in biology, computer science, as well as data analysis.

This workshop is designed to introduce computational analysis of genomic data through hands-on computational exercises, using published studies.

The pre-requisites of the course are college-level courses in molecular biology, cell biology, and genetics. Introductory courses in computer programming and statistics are preferred but not strictly required.

Learning goals

By the end of this course successful students will be able to:

  • Use Linux commands & compose simple shell scripts to automate a bioinformatics pipeline
  • Program in Python for parsing texts and simulating evolution
  • Visualize data and perform statistical analysis using R/RStudio
  • Compose a bioinformatics research report

Web Links

Assignments, Quizzes, and Final Report

Student performance will be evaluated by attendance, three (3) quizzes, six (6) assignments, and a final report:

  • Attendance & participation: 30 pts
  • Assignments: 6 x 10 = 60 pts
  • Open-Book Quizzes: 3 x 20 pts = 60 pts
  • Final presentation: 50 pts

Total: 200 pts

Course Schedule

Week 1, Thursday, Oct 10, 2024

Week 2, Thursday, Oct 17, 2024

Week 3, Thursday, Oct 24, 2024

  • Quiz #1
  • Python Tutorial I: Advanced Python
  • Assignment 2

Week 4, Thursday, Oct 31, 2024

  • Python Tutorial II: Regular expression & scientific computing with Python
  • Assignment 3

Week 5, Thursday, Nov 7, 2024

  • Quiz #2
  • Computer setup: R & RStudio
  • R Tutorial 1: Basic R
  • Assignment 4

Week 6, Thursday, Nov 14, 2024

  • R Tutorial 2: Data visualization & basic statistics
  • Assignment 5

Week 7, Thursday, Nov 21, 2024

  • Quiz #3
  • Genomics Tutorial 1: Cluster analysis & scRNA analysis
  • Final report (draft 1: Background, Hypothesis, Significance, Material & Methods)

Week 8, Thursday, Nov 28, 2024

  • Genomics Tutorial 2. NGS data analysis
  • Final report (draft 2: Results & Discussion)

Week 9, Thursday, Dec 5, 2024

  • Genomics Tutorial 3. Monte Carlo simulations of genome evolution
  • Final report (draft 3: Conclusions, future directions, reference)
  • Final report due Nov 26

Week 10, Thursday, Dec 10, 2024

  • Final presentation