QuBi/modules/biol302

From QiuLab
Revision as of 23:41, 21 February 2013 by imported>Ppagan
Jump to navigation Jump to search
BIOL 302 Lab (Bioinformatics Exercises)

Research in molecular genetics requires effective use of bioinformatic tools to analyze and understand the genetic materials being worked with. The following exercises will expose you to real-world scenarios and introduce you to the methods and tools you can use to solve these problems.

MODULE: Identification of mdm2 Splice Variants Using BLAST

4seq.png

A diagram of the MDM2 gene used in this exercise, along with its splice variants. By the end of this module you will create a similar diagram.

Objectives

  • Learn to use Genbank database and BLAST tool to analyze nucleotide sequences
  • Use BLAST to identify

Key Concepts

Blast
Genbank
Annotation
Accession Number
Alternative Splicing

Exercise

Genbank Accession # cDNA Clone Description Cell Line Length (bp)
AF527840 Genomic DNA 34,088
EU076746 P2-MDM2-C1 cDNA missing exons 5-9 & 11 MANCA 427
EU076747 P2-MDM2-10 cDNA missing exon 10 ML-1 842
EU076748 P2-MDM2-C cDNA missing exons 5-9 A876 505
EU076749 P2-MDM2-FL Full-length cDNA SJSA-1 845

Explore the gene annotation for AF527840.

Sequences on genbank have both basic reference information (such as what the sequence is, what organism it came from, and bibliographical information) and sequence annotations. Some sequences are more richly annotated than others - it is up to researchers to annotate the sequences they generate, which requires extra work. For this exercise you will be working will a well-annotated sequence: accession number AF527840. Explore its annotation and use it to complete the following set of tasks.

  1. DRAW a diagram of this gene using the information and coordinates listed in the annotation. (Note: this is the bulk of the assignment and this diagram is needed for the last set of questions. Don't get lazy on this.)
    1. Label the top of the diagram with basic information, such as the gene's name, organism, etc.. Someone should be able to pick up your diagram and know exactly what they're looking at.
    2. Including introns, exons, 3'/5' UTRs, +1, and exact coordinates. (The mRNA annotation states which segments are used to create mRNA, and the CDS annotation states which parts code amino acids (CDS = coding sequence)).
    3. Draw the diagram mostly to scale. It does NOT have to be perfect, but make a reasonable effort. Put a scale bar and length markers on your drawing.
  2. How does the sequence vary at positions X, X, and X for this gene? Do these change the AA for the resulting peptide?
  3. What kinds of repeat regions can be found in this gene?

Explore the graphical presentation of the gene to answer the following question.

Genbank provides graphical representations of the sequences on its database: click the "Graphics" link below the sequence title, OR click "Display Settings" above the title, and choose "Graphics". Take a few minutes to explore this graphical browser and answer the following question:

  1. A question that can only be answered from looking at this graph.
  2. Another question that can only be answered from looking at this graph.

Use BLAST to determine which exons are used in the mRNA transcripts.

This is the most "bioinformatic" part of the assignment. Blast ALL FOUR of the mRNA sequences (EU076746, EU076747, EU076748, EU076749) against the main sequence (AF527840) and use the results to answer the following questions. If you labeled your diagram well (with coordinates!), this task should go by quickly.

  1. Which exons are used to create EU076746, EU076747, EU076748, EU076749?
  2. Do the BLAST search results corresponding to exons exactly match the start/end positions of the exons as labeled in your diagram? If not, what is the most likely reason for this?
  3. Do any of the BLAST results match regions outside of exons? If so, what regions?


MODULE: Cloning of murine mdm2 gene sequence to study cis acting DNA elements

TODO: Summarize project

Key Concepts

  • Clone
  • Vector
  • Rotational Symmetry

Exercise

You're working on a research project. So far you've cloned this fragment:

TCTAGATGCATTTACGAAGGAGACAGAAAACGTCTTTCGGCAATAGCTCTCAAATGCAAAACGACGTCGG CGAGCTGTCCCTTACCTGGAGGCCCGCAGGAGAAGCGCGGTGATCCGAGAGGGTCCCCCAGGGGTGTCCG GTCGGTCTCCCGCTCGCCCAGCAGACGGCTGCGGAAACGGGGCAGCGTTTAAATAACCCCAGCTGGAGAC ATGTCAGGACTTAGCTCCTCCGACAGCCGACGCCGGACGTGTCCCAACTTGACCAGCCCCACAGGAAGAG CTGAGTCAACTCGGCCCAGCCCAGTCCCACCCGTCCCGGAAGCCGCATCCCGGCGAGTCCGGGACCAGGC ACCTGTCACCTCCTGGACCCCAGCAACGAGCCCAGCGCGACCCCGGAGCGGGCCCGAATTCTCTAGA

Into this vector:

PGL2 basic.png pGL2 Luciferase Reporter Vectors

Using these restriction enzymes:

Restriction Enzyme Recognition Sequence Overhang
XbaI TCTAGA CTAG
NheI GCTAGC CTAG

In this module, you will analyze your fragment sequence and resultant clone with the help of this diagram:

Zauberman fig2.png

PROPERLY CITE THIS Taken from A functional p53-responsive intronic promoter is contained within the human mdm2 gene. Pubmed PDF


Annotate fragment and draw diagram of resultant clone

This exercise is not bioinformatics-heavy, but sequence analysis is a major task of the field. You will create another annotated drawing of a DNA molecule (the plasmid with your fragment cloned in) and annotate your fragment sequence by hand. You should have a printout of the fragment sequence to directly annotate.

  1. On your sequence,

BRAINSTORMING

The students could do a blast on their fragment to see how it was constructed (I just did it now, and it's an interesting project). If they blast their entire sequence, they'll find this as part of their hits. Blast Hit List [key:sequence_track,name:Sequence,display_name:Sequence,id:STD1,category:Sequence,annots:Sequence,ShowLabel:true[key:gene_model_track,CDSProductFeats:false][key:alignment_track,name:other%20alignments,annots:NG%20Alignments|Refseq%20Alignments|Gnomon%20Alignments|Unnamed,shown:false]&v=158:253&appname=ncbiblast&link_loc=fromHSP Visualization of blast hit]

TCTAGATGCATTTACGAAGGAGACAGAAAACGTCTTTCGGCAATAGCTCTCAAATGCAAAACGACGTCGG CGAGCTGTCCCTT

ACCTGGAGGCCCGCAGGAGAAGCGCGGTGATCCGAGAGGGTCCCCCAGGGGTGTCCG GTCGGTCTCCCGCTCGCCCAGCAGACGGCTGCGGAAACGGGGCAGCGTTTAAATAACCCCAGCTGGAGAC ATGTCAGGACTTAGCTCCTCCGACAGCCGACGCCG

GACGTGTCCCAACTTGACCAGCCCCACAGGAAGAG CTGAGTCAACTCGGCCCAGCCCAGTCCCACCCGTCCCGGAAGCCGCATCCCGGCGAGTCCGGGACCAGGC ACCTGTCACCTCCTGGACCCCAGCAACGAGCCCAGCGCGACCCCGGAGCGGGCCCGAATTCTCTAGA

/BRAINSTORMING