QuBi/modules/biol203-geno-pheno-association: Difference between revisions

From QiuLab
Jump to navigation Jump to search
imported>Lab
imported>Lab
No edit summary
Line 38: Line 38:
</center>
</center>


===Test association at Locus A===
===Test association with locus A===
Following the above two examples, perform both the genotype and allele association tests using the class data.
Following the above two examples, perform both the genotype and allele association tests using the class data.
<center>
<center>
Line 69: Line 69:
</center>
</center>


===Test association at Locus B===
===Test association with Locus B===
Table 4a. Genotype counts at Locus B for each phenotype
Table 4a. Genotype counts at Locus B for each phenotype
<center>
<center>
Line 100: Line 100:
==Web Exercise. Search for gene information using NCBI online databases==
==Web Exercise. Search for gene information using NCBI online databases==
# Point your browser to the [http://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE_TYPE=BlastSearch&BLAST_SPEC=OGP__9606__9558&LINK_LOC=blasthome NCBI Human Genome Resource] page
# Point your browser to the [http://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE_TYPE=BlastSearch&BLAST_SPEC=OGP__9606__9558&LINK_LOC=blasthome NCBI Human Genome Resource] page
# Copy and paste sequence at Locus A into the first text box (add a FASTA heading, e.g., ">Locus_A")
# Copy and paste sequence provided on Blackboard- this is the sequence of the gene associated with the taster phenotype
# Expand the "Algorithm parameters" tab and change "Expect threshold" to 0.00001 (10e-5). Define "expect value" in your owns words after watching the linked Youtube video.
# Expand the "Algorithm parameters" tab and change "Expect threshold" to 0.00001 (10e-5). Define "expect value" in your owns words after watching the linked Youtube video.
# Press "BLAST". Copy & Paste the top hit in your final lab report.
# Press "BLAST". Copy & Paste the top hit in your final lab report.
# Repeat the above for the sequence at Locus B. Copy and paste the top hit in your final lab report.
# Briefly describe the function of the gene based on information gathered on the locus page


==Lab Report IV==
==Lab Report IV==
Line 109: Line 109:
## A printout of contingency test for Locus A, including expected counts, observed counts, chi-square statistic, degree of freedom, and p values
## A printout of contingency test for Locus A, including expected counts, observed counts, chi-square statistic, degree of freedom, and p values
## Same as above for Locus B
## Same as above for Locus B
## A printout of alignment for the top BLAST hits for Locus A sequence
## A printout of alignment for the top BLAST hits for the sequence provided
## Same as above for Locus B
Additional questions to include in your report:
Additional questions to include in your report:
## State what is the ''null hypothesis'' in a chi-square test & what is the ''alternative hypothesis''
## State what is the ''null hypothesis'' in a chi-square test & what is the ''alternative hypothesis''
Line 116: Line 115:
## What can you conclude when p-value is '''below''' the threshold of significance (e.g., p = 0.05)?
## What can you conclude when p-value is '''below''' the threshold of significance (e.g., p = 0.05)?
## What would you conclude when p-value is '''above''' the critical value?
## What would you conclude when p-value is '''above''' the critical value?
## Which of the two genes shows significant ''genotype'' association with the PTC Taster/Non-Taster phenotype?
## Is there a statistically significant association between one of the alleles tested and the Taster phenotype?  
## Is there a statistically significant association between the ''alleles'' and the Taster phenotype?  
## Which genotype is over-represented in the Non-Tasters?
## Which genotype is over-represented in the Non-Tasters?
## Which allele is over-represented in the Non-Tasters?
## Which allele is over-represented in the Non-Tasters?
## Are there exceptions? What are possible causes for exceptions?
## Are there exceptions? What are possible causes for exceptions?
## Define e-value in a BLAST search
## Define e-value in a BLAST search

Revision as of 15:48, 9 May 2018

BIOL 203 Bioinformatics Exercises for Lab 13

Test phenotype-genotype association

Introduction: GWAS & Contingency Test

Genome-Wide Association Study (GWAS) is a method for mapping phenotypes to genotypes. In a typical GWAS study, frequencies of alleles (e.g., C or T at position 785) are determined in a sample of affected individuals (the "cases") as well as in a sample of unaffected individuals (the "controls"). For example, the following table shows results of a hypothetical case-control study at a locus segregating with two alleles (C and T):

Table 1. Sample Genotype Frequencies

T/T T/C C/C Total
Case 0 24 127 ?
Control 9 68 114 ?
Total ? ? ? ?

Association between the genotype and the phenotype could be assessed with a contingency table analysis. In this case, Χ2 = 26.4, p=0.0005, suggesting a significant association between genotypes and diseases. (By comparing the expected and observed counts, one could conclude that the C/C genotypes are over-represented in disease cases.)

  1. Perform an online contingency table analysis using the hypothetical data in Table 1.
  1. Deriving from Table 1, fill the following table with allele counts. Then perform a 2-by-2 contingency table analysis using the link above. Is there a statistically significant association between alleles and disease phenotype? Which allele (C or T) is over-represented in (i.e., statistically associated with) disease cases?

Table 2. Sample Allele Frequencies

T C Total
Case ? ? ?
Control ? ? ?
Total ? ? ?

Test association with locus A

Following the above two examples, perform both the genotype and allele association tests using the class data.

Table 3a. Genotype counts at Locus A

A1/A1 A1/A2 A2/A2 Row Sum
Taster ? ? ? ?
Non-Taster ? ? ? ?
Column Sum ? ? ? ?

Calculate allele counts & then test for association

Table 3b. Allele counts at Locus A

A1 A2 Row Sum
Taster ? ? ?
Non-Taster ? ? ?
Column Sum ? ? ?

Test association with Locus B

Table 4a. Genotype counts at Locus B for each phenotype

B1/B1 B1/B2 B1/B3 B2/B2 B2/B3 B3/B3 Row Sum
Taster ? ? ? ? ? ? ?
Non-Taster ? ? ? ? ? ? ?
Column Sum ? ? ? ? ? ? ?

Calculate allele counts & then test for association Table 4b. Allele counts at Locus A

B1 B2 B3 Row Sum
Taster ? ? ? ?
Non-Taster ? ? ? ?
Column Sum ? ? ? ?

Web Exercise. Search for gene information using NCBI online databases

  1. Point your browser to the NCBI Human Genome Resource page
  2. Copy and paste sequence provided on Blackboard- this is the sequence of the gene associated with the taster phenotype
  3. Expand the "Algorithm parameters" tab and change "Expect threshold" to 0.00001 (10e-5). Define "expect value" in your owns words after watching the linked Youtube video.
  4. Press "BLAST". Copy & Paste the top hit in your final lab report.
  5. Briefly describe the function of the gene based on information gathered on the locus page

Lab Report IV

  1. Your report should include the following results:
    1. A printout of contingency test for Locus A, including expected counts, observed counts, chi-square statistic, degree of freedom, and p values
    2. Same as above for Locus B
    3. A printout of alignment for the top BLAST hits for the sequence provided

Additional questions to include in your report:

    1. State what is the null hypothesis in a chi-square test & what is the alternative hypothesis
    2. Explain what probability is represented by the p-value.
    3. What can you conclude when p-value is below the threshold of significance (e.g., p = 0.05)?
    4. What would you conclude when p-value is above the critical value?
    5. Is there a statistically significant association between one of the alleles tested and the Taster phenotype?
    6. Which genotype is over-represented in the Non-Tasters?
    7. Which allele is over-represented in the Non-Tasters?
    8. Are there exceptions? What are possible causes for exceptions?
    9. Define e-value in a BLAST search