BioMed-R-2022: Difference between revisions

From QiuLab
Jump to navigation Jump to search
imported>Weigang
imported>Weigang
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
<center>'''BIOL47120 Biomedical Genomics II'''</center>
<center>'''BIOL47120 Biomedical Genomics II'''</center>
<center>Spring 2022, Saturdays 9-11:30 noon</center>
<center>Spring 2022, Saturdays 9-11:30 noon</center>
<center>[https://huntercollege.zoom.us/j/7624906348 Synchronous Zoom Session, with Meeting ID: 762 490 6348]</center>
<center>[https://us02web.zoom.us/j/3573613633 Synchronous Zoom Session, with Meeting ID: 357 361 3633]</center>
<center>'''Instructor:''' Weigang Qiu, Ph.D., Professor, Department of Biological Sciences, Hunter College, CUNY; '''Email:''' wqiu@hunter.cuny.edu</center>
<center>'''Instructor:''' Weigang Qiu, Ph.D., Professor, Department of Biological Sciences, Hunter College, CUNY; '''Email:''' wqiu@hunter.cuny.edu</center>
<center>'''Office:''' B402 Belfer Research Building, 413 East 69th Street, New York, NY 10021, USA; '''Office hour''': Wed 11-1 (online with Zoom, see link above); In-person by appointment</center>
<center>'''Office:''' B402 Belfer Research Building, 413 East 69th Street, New York, NY 10021, USA; '''Office hour''': Wed 11-1 (online with Zoom, see link above); In-person by appointment</center>
Line 81: Line 81:


===Jan 29, 2022===
===Jan 29, 2022===
* Introduction
* Introduction to NGS (slides available on Blackboard)
* R Tutorial 1: Use interface, basic operations, load data. (slides available on Blackboard)
* Literature search (by groups)
{| class="wikitable sortable mw-collapsible"
* Assignment 1 (Due 2/4/2022. Friday 8pm EST). Upload your slide on Blackboard
! In-class Exercise & Assignment 1 (15 pts)
|-
|
* (5 pts, Due in-class) Transform the following "untidy/wide" table into a "tidy/tall" table (print a hard copy)
<pre>
PropertyName,Density_250m,Density_500m,Density_1000m
HighbridgePark,0.006561319,0.009462031,0.010578611
BronxRiverParkway,0.001318749,0.001978858,0.002652118
CrotonaPark,0.009412087,0.01164712,0.01202321
ClaremontPark,0.016391948,0.019972485,0.020350481
VanCortlandtPark,0.000550151,0.000979312,0.001372675
</pre>
* (10 pts, Due 2/7) Make a single slide of a primary research paper using next-generation sequencing (NGS) technologies, show the following
** proper citation (authors, title, year, journal, URL)
** NGS method (Illumina, PacBio, or NanoPore)
** NGS application (genomics, cancer, transcriptome, microbiome, proteome, metagenomics, human variation, etc)
** a key figure, with a caption explaining x-axis, y-axis, samples, experiments
** raw data table (show first few columns and first few rows)
** for example, a student has worked on tissue regeneration, the search in PubMed with key words "regeneration zebra fish transcriptome" found the following primary paper as the best because of the high quality of journal and the availability of raw data: https://www.ncbi.nlm.nih.gov/pubmed/28096348
** Papers to AVOID:
*** Reviews (not original research paper): e.g.,[https://pubmed.ncbi.nlm.nih.gov/30089861/ a review paper (usually appear first on PubMed; SKIP)]
*** Original research, but no computer readable raw data: e.g., [https://www-ncbi-nlm-nih-gov.proxy.wexler.hunter.cuny.edu/pmc/articles/PMC7737787/ Raw data not readable (PDF is not computer-readable)]
|}


===Feb 5, 2022===
===Feb 5, 2022===
* Introduction to NGS: (slides available on Blackboard)
* Introduction to NGS: (slides available on Blackboard)
* 1-slide presentations on Next-Generation Sequencing Technologies (Group I)
* 1-slide presentations on NGS
* R Tutorial, Part 2. Data manipulation with dplyr. Slides: [[File:R-tutorials-2.pdf|thumbnail]]
* R Tutorial, Part 1
{| class="wikitable sortable mw-collapsible"
* Assignment 2. Submit R code on Blackboard
! In-class Exercise 2 (10 pts; Due in class)
|-
|
* Show R commands for the following operations
** load the "tidyverse" library
** load the "iris" data
** Select columns "Sepal.Length" & "Species"
** Filter rows 2 through 10
** Add a column "logSepalLength" by taking the logarithm of the said column
** Calculate mean and standard deviation of Petal.Length in each species
** Save all commands in a script "in-class-ex-2.R"
|}
* Assignment 1 Due next day


===Feb 12, 2022===
===Feb 12, 2022===
Line 131: Line 95:


===Feb 19, 2022===
===Feb 19, 2022===
* R Tutorial: Part 4. BioStat (chi-square & t-test) Lecture slides: [[File:R-tutorial-4.pdf|thumbnail]]
* R Tutorial: Part 2.  
* Presentation
* Assignment 3 (10 pts). R Markdown upload (by Wed 8pm)
* Assignment 3 (10 pts). R Markdown upload (by Wed 8pm)


===Feb 26, 2022===
===Feb 26, 2022===
* Review
* TBD
* Quiz (30 pts)
** NGS slides & R tutorial Slides
** Excluding the four statistical tests
** Openbook but due in class (on Blackboard)


===March 5, 2022===
===March 5, 2022===
* R Tutorial: Part 4. BioStat (regression & ANOVA) [[File:R-tutorial-5.pdf|thumbnail]]
* TBD
* Paper review
* Assignment 4 (10 pts). Submit R markdown on Blackboard


===March 12, 2022===
===March 12, 2022===
* Paper review
* TBD
* Review for mid-term exam: 6 PDF presentations (intro to NGS & 5 R-tutorials)


===March 19, 2022===
===March 19, 2022===
* Mid-term
* Mid-term
** Focus on four statistical tests (with visualization
** Submit sectioned R markdown PDF: [http://diverge.hunter.cuny.edu/~weigang/Rmarkdown-template.Rmd R markdown template (by Hector)]


===March 26, 2022===
===March 26, 2022===
Line 172: Line 126:


===April 30, 2022===
===April 30, 2022===
* Group presentation #3. R Markdown & Data analysis
* TBD


===May 7, 2022===
===May 7, 2022===
* Consultation by appointment (no live session)
* TBD


===May 14, 2022===
===May 14, 2022===
* Consultation by appointment (no live session)
* TBD
* '''Submit your Teacher's Evaluation''', using either:
* '''Submit your Teacher's Evaluation''', using either:
** Personal computer at [http://www.hunter.cuny.edu/te www.hunter.cuny.edu/te]; or,
** Personal computer at [http://www.hunter.cuny.edu/te www.hunter.cuny.edu/te]; or,

Latest revision as of 21:31, 29 January 2022

BIOL47120 Biomedical Genomics II
Spring 2022, Saturdays 9-11:30 noon
Synchronous Zoom Session, with Meeting ID: 357 361 3633
Instructor: Weigang Qiu, Ph.D., Professor, Department of Biological Sciences, Hunter College, CUNY; Email: wqiu@hunter.cuny.edu
Office: B402 Belfer Research Building, 413 East 69th Street, New York, NY 10021, USA; Office hour: Wed 11-1 (online with Zoom, see link above); In-person by appointment
MA plot Volcano plot Heat map
fold change (y-axis) vs. total expression levels (x-axis)
p-value (y-axis) vs. fold change (x-axis)
genes significantly down or up-regulated (at p<1e-4)

Course Overview

Welcome to Introductory BioMedical Genomics, a seminar course for advanced undergraduates and graduate students. A genome is the total genetic content of an organism. Driven by breakthroughs such as the decoding of the first human genome and rapid DNA and RNA-sequencing technologies, biomedical sciences are undergoing a rapid & irreversible transformation into a highly data-intensive field, that requires familiarity with concepts in both biology, computational, and data sciences.

Genome information is revolutionizing virtually all aspects of life sciences including basic research, medicine, and agriculture. Meanwhile, use of genomic data requires life scientists to be familiar with concepts and skills in biology, computer science, as well as statistics.

This workshop is designed to introduce computational analysis of genomic data through hands-on computational exercises. Students are expected to be able to replicate key results of data analysis from published studies.

The pre-requisites of the course are college-level courses in molecular biology, cell biology, and genetics. Introductory courses in computer programming and statistics are preferred but not strictly required.

Learning goals

By the end of this course successful students will be able to:

  • Describe next-generation sequencing (NGS) technologies & contrast it with traditional Sanger sequencing
  • Explain applications of NGS technology including pathogen genomics, cancer genomics, human genomic variation, transcriptomics, meta-genomics, epi-genomics, and microbiome.
  • Visualize and explore genomics data using R & RStudio
  • Replicate key results using a raw data set produced by a primary research paper

A sample of original NGS paper with data sets

Web Links

Quizzes and Exams

Student performance will be evaluated by attendance, weekly assignments, quizzes, and a final report in R Markdown:

  • Attendance & In-class participation: 100 pts
  • Assignments: 5 x 10 = 50 pts
  • Quizzes: 2 x 25 pts = 50 pts
  • Mid-term: 50 pts
  • Final presentation & report: 50 pts

Total: 300 pts

Tips for Success

To maximize the your experience we strongly recommend the following strategies:

  • Attendance is required. Video camera on is preferred.
  • Follow the directions for efficiently, finding high-impact papers, reading science research papers and preparing presentations.
  • Read the papers, watch required videos and do the exercises regularly, long before you attend class.
  • Attend all classes, as required. Late arrival results in loss of points.
  • Keep up with online exercises. Don’t wait until the due date to start tasks.
  • Take notes or annotate slides while attending the lectures.
  • Listen actively and participate in class and in online discussions.
  • Review and summarize material within 24 hrs after class.
  • Observe the deadlines for submitting your work. Late submissions incur penalties.
  • Put away cell phones, do not TM, email or play computer games in class.

Hunter/CUNY Policies

  • Policy on Academic Integrity

Hunter College regards acts of academic dishonesty (e.g., plagiarism, cheating on homework, online exercises or examinations, obtaining unfair advantage, and falsification of records and official documents) as serious offenses against the values of intellectual honesty. The College is committed to enforcing the CUNY Policy on Academic Integrity, and we will pursue cases of academic dishonesty according to the Hunter College Academic Integrity Procedures. Students will be asked to read this statement before exams.

  • ADA Policy

In compliance with the American Disability Act of 1990 (ADA) and with Section 504 of the Rehabilitation Act of 1973, Hunter College is committed to ensuring educational parity and accommodations for all students with documented disabilities and/or medical conditions. It is recommended that all students with documented disabilities (Emotional, Medical, Physical, and/or Learning) consult the Office of AccessABILITY, located in Room E1214B, to secure necessary academic accommodations. For further information and assistance, please call: (212) 772- 4857 or (212) 650-3230.

  • Syllabus Policy

Except for changes that substantially affect implementation of the evaluation (grading) statement, this syllabus is a guide for the course and is subject to change with advance notice, announced in class or posted on Blackboard.

Course Schedule

Jan 29, 2022

  • Introduction to NGS (slides available on Blackboard)
  • Literature search (by groups)
  • Assignment 1 (Due 2/4/2022. Friday 8pm EST). Upload your slide on Blackboard

Feb 5, 2022

  • Introduction to NGS: (slides available on Blackboard)
  • 1-slide presentations on NGS
  • R Tutorial, Part 1
  • Assignment 2. Submit R code on Blackboard

Feb 12, 2022

  • No class (holiday break)

Feb 19, 2022

  • R Tutorial: Part 2.
  • Assignment 3 (10 pts). R Markdown upload (by Wed 8pm)

Feb 26, 2022

  • TBD

March 5, 2022

  • TBD

March 12, 2022

  • TBD

March 19, 2022

  • Mid-term

March 26, 2022

  • TBD

April 2, 2022

  • TBD

April 9, 2022

  • TBD

April 16, 2020

  • No class (Spring break)

April 23, 2022

  • TBD

April 30, 2022

  • TBD

May 7, 2022

  • TBD

May 14, 2022

May 21, 2022

  • 10 pm: Final report Due (Blackboard submission)