<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.genometracker.org/index.php?action=history&amp;feed=atom&amp;title=QuBi%2Fmodule%2Fbio203-mdm2-lab</id>
	<title>QuBi/module/bio203-mdm2-lab - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.genometracker.org/index.php?action=history&amp;feed=atom&amp;title=QuBi%2Fmodule%2Fbio203-mdm2-lab"/>
	<link rel="alternate" type="text/html" href="https://wiki.genometracker.org/index.php?title=QuBi/module/bio203-mdm2-lab&amp;action=history"/>
	<updated>2026-05-27T21:23:05Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.39.2</generator>
	<entry>
		<id>https://wiki.genometracker.org/index.php?title=QuBi/module/bio203-mdm2-lab&amp;diff=3966&amp;oldid=prev</id>
		<title>imported&gt;Weigang: Created page with &quot;&lt;span style=&quot;color: Seagreen;font-weight:bold;font-size:large;&quot;&gt;Lab 12. Bioinformatics Exercises: BLAST &amp; Gene Structure&lt;span&gt; ==Expected Learning Outcomes== * Be able to perf...&quot;</title>
		<link rel="alternate" type="text/html" href="https://wiki.genometracker.org/index.php?title=QuBi/module/bio203-mdm2-lab&amp;diff=3966&amp;oldid=prev"/>
		<updated>2016-03-24T21:49:43Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;&amp;lt;span style=&amp;quot;color: Seagreen;font-weight:bold;font-size:large;&amp;quot;&amp;gt;Lab 12. Bioinformatics Exercises: BLAST &amp;amp; Gene Structure&amp;lt;span&amp;gt; ==Expected Learning Outcomes== * Be able to perf...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&amp;lt;span style=&amp;quot;color: Seagreen;font-weight:bold;font-size:large;&amp;quot;&amp;gt;Lab 12. Bioinformatics Exercises: BLAST &amp;amp; Gene Structure&amp;lt;span&amp;gt;&lt;br /&gt;
==Expected Learning Outcomes==&lt;br /&gt;
* Be able to perform NCBI BLAST search for homologous sequences in GenBank.&lt;br /&gt;
* Be able to identify individual gene elements based on an NCBI GenBank file.&lt;br /&gt;
* Be able to identify alternative splice forms a single gene using NCBI web tools&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==Lab Report Grading Policy==&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;Introduction (5 pts)&amp;#039;&amp;#039;&amp;#039;. Define these terms: bioinformatics, homology, BLAST, e-value, alternative splicing (Your statements are not to be copied from the Lab Manual.)&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;Materials and Methods (5 pts)&amp;#039;&amp;#039;&amp;#039;. List and describe steps of a BLAST search &amp;amp; steps of identifying alternative splicing variants with BLAST.&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;Results (20 pts)&amp;#039;&amp;#039;&amp;#039;. Copy your results in a document during the exercises, and hand in an organized copy. Include answers to all queries and questions.&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;Discussion and Conclusion (15 pts)&amp;#039;&amp;#039;&amp;#039;. Answer the five discussion questions. Summary/Conclusion: a sentence or two will suffice.&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;References (5 pt)&amp;#039;&amp;#039;&amp;#039;. Credit is given for pertinent references obtained from sources other than the Lab Manual.&lt;br /&gt;
&lt;br /&gt;
(Total: 50 pts)&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==Introduction==&lt;br /&gt;
Research in molecular genetics requires effective use of online bioinformatic tools to analyze and understand the genetic materials being worked with. The following exercises will expose you to real-world scenarios and introduce you to the methods and tools you can use to solve these problems.&lt;br /&gt;
&lt;br /&gt;
In biology, homology is defined as a common or shared evolutionary origin. Therefore, homologous sequences are sequences diverged from a common ancestor. Note that the word &amp;quot;homology&amp;quot; is different from &amp;quot;similarity&amp;quot;: homologous structures or sequences may not be similar (e.g., forearms in mammals and birds) and, conversely, similar structures or sequences may not be homologous (e.g., wings in birds and bats).&lt;br /&gt;
&lt;br /&gt;
BLAST is a computer algorithm allowing for efficient search of similar sequences in a large database. While BLAST performs a similar function to Google search, you should not use Google to look for similar sequences in a human or other genome. When sequences are similar with a sufficient statistical significance (measured by e-value, see below), we consider these sequences homologous to each other.&lt;br /&gt;
----&lt;br /&gt;
==Exercise 1. Homology searching using BLAST==&lt;br /&gt;
# Go to the NCBI-BLAST website at [http://blast.ncbi.nlm.nih.gov/Blast.cgi NCBI/BLAST Home Page]&lt;br /&gt;
# What is BLAST? Read and copy the expanded answer by clicking on &amp;quot;more&amp;quot;&lt;br /&gt;
# Since BLAST finds matches between biological sciences, it needs a &amp;quot;query&amp;quot; sequence as input as well as a &amp;quot;database&amp;quot; to search against. To find matches of a sequence in human genome, what would be your &amp;quot;query&amp;quot; sequence and what would be your &amp;quot;database&amp;quot;?&lt;br /&gt;
# Start BLASTing against the mouse genome by clicking &amp;quot;Mouse&amp;quot; under &amp;quot;BLAST Assembled RefSeq Genomes&amp;quot;&lt;br /&gt;
# Copy and paste the following sequence into the &amp;quot;Enter Query Sequence&amp;quot; box:&lt;br /&gt;
&amp;lt;div style=&amp;quot;font-family:Monospace;line-height:1;width:550px;border-style:solid;border-width:1px;border-color:#AAAAFF;background-color:#EEEEFF;padding-left:5px;padding-right:5px;padding-top:0px;padding-bottom:0px;&amp;quot;&amp;gt;&lt;br /&gt;
CTAGATGCATTTACGAAGGAGACAGAAAACGTCTTTCGGCAATAGCTCTCAAATGCAAAACGACGTCGG&lt;br /&gt;
CGAGCTGTCCCTTACCTGGAGGCCCGCAGGAGAAGCGCGGTGATCCGAGAGGGTCCCCCAGGGGTGTCCG&lt;br /&gt;
GTCGGTCTCCCGCTCGCCCAGCAGACGGCTGCGGAAACGGGGCAGCGTTTAAATAACCCCAGCTGGAGAC&lt;br /&gt;
ATGTCAGGACTTAGCTCCTCCGACAGCCGACGCCGGACGTGTCCCAACTTGACCAGCCCCACAGGAAGAG&lt;br /&gt;
CTGAGTCAACTCGGCCCAGCCCAGTCCCACCCGTCCCGGAAGCCGCATCCCGGCGAGTCCGGGACCAGGC&lt;br /&gt;
ACCTGTCACCTCCTGGACCCCAGCAACGAGCCCAGCGCGACCCCGGAGCGGGCCCGAATTCT&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;ol start=&amp;quot;6&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Scroll down to the bottom of the page and click &amp;quot;BLAST&amp;quot;&lt;br /&gt;
&amp;lt;li&amp;gt;Wait for 10-30 seconds for the results to return (&amp;#039;&amp;#039;&amp;#039;be patient&amp;#039;&amp;#039;&amp;#039;). Once the result page is loaded, locate and copy/write down&lt;br /&gt;
the following information for the first hit:&lt;br /&gt;
&amp;lt;ol&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Species and strain&lt;br /&gt;
&amp;lt;li&amp;gt;Chromosome&lt;br /&gt;
&amp;lt;li&amp;gt;Length of your query sequence&lt;br /&gt;
&amp;lt;li&amp;gt;Sequence identity, number of matched bases, and number of gaps between the matched sequences&lt;br /&gt;
&amp;lt;/ol&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Click the link for &amp;quot;5&amp;#039; side&amp;quot; (next to &amp;#039;&amp;#039;&amp;#039;Features&amp;#039;&amp;#039;&amp;#039;) will bring you a standard GenBank file of this gene. Locate and copy the&lt;br /&gt;
following structural information about this gene:&lt;br /&gt;
&amp;lt;ol&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Gene accession (ID number)&lt;br /&gt;
&amp;lt;li&amp;gt;Total length of the gene&lt;br /&gt;
&amp;lt;li&amp;gt;Number of introns&lt;br /&gt;
&amp;lt;li&amp;gt;Which is the non-template (mRNA analog) strand: the above sequence itself or its reverse complement? [Hint: note the word &amp;#039;&amp;#039;&amp;#039;complement&amp;#039;&amp;#039;&amp;#039; in mRNA and cDNA lines)&lt;br /&gt;
&amp;lt;/ol&amp;gt;&lt;br /&gt;
&amp;lt;/ol&amp;gt;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==Exercise 2. Explore the structure of human &amp;#039;&amp;#039;mdm2&amp;#039;&amp;#039; gene==&lt;br /&gt;
# Search [http://www.ncbi.nlm.nih.gov/sites/entrez?db=Nucleotide GenBank] using the accession AF527840. Read the GenBank file and find out from the feature table how many introns and exons this sequence has according to the &amp;quot;mRNA&amp;quot; and &amp;quot;CDS&amp;quot; features.&lt;br /&gt;
# Click on &amp;quot;mRNA&amp;quot; and notice that exon sequences are now highlighted&lt;br /&gt;
# Fill in &amp;#039;&amp;#039;&amp;#039;Table 1&amp;#039;&amp;#039;&amp;#039; for each EXON you could identify:&lt;br /&gt;
# Fill in &amp;#039;&amp;#039;&amp;#039;Table 2&amp;#039;&amp;#039;&amp;#039; for each INTRON you could identify:&lt;br /&gt;
# Click on &amp;quot;CDS&amp;quot; and notice that coding sequences are now highlighted&lt;br /&gt;
# Fill in &amp;#039;&amp;#039;&amp;#039;Table 3&amp;#039;&amp;#039;&amp;#039; for each coding sequence you could identify:&lt;br /&gt;
# DRAW a diagram of this gene based on the above exon and intron coordinates.&lt;br /&gt;
##Label the top of the diagram with basic information, such as the gene&amp;#039;s name and species information.&lt;br /&gt;
##Label coordinates for introns, exons, 3&amp;#039;/5&amp;#039; UTRs, start-codon, and stop-codon coordinates.&lt;br /&gt;
##Draw the diagram mostly to scale. It does NOT have to be perfect, but make a reasonable effort. Put a scale bar and length markers on your drawing.&lt;br /&gt;
# Answer the following questions:&lt;br /&gt;
## What is the total length of exons, introns, and coding sequences of this gene?&lt;br /&gt;
## Are all exon sequences code for proteins? Which exons are non-coding in mdm2?&lt;br /&gt;
## Align the first 5 bases of all introns. Which bases are conserved near intron start (&amp;quot;donor site&amp;quot;)?&lt;br /&gt;
## Align the last 5 bases of all introns. Which bases are conserved near intron end (&amp;quot;acceptor site&amp;quot;)?&lt;br /&gt;
## Using [http://weblogo.berkeley.edu/ WebLogo] and make a sequence logo for the acceptor site and another sequence logo for the donor site. To do so, copy &amp;amp; paste individual sequences at the acceptor site into [http://weblogo.threeplusone.com/create.cgi this text box] and click &amp;quot;Create Logo&amp;quot;. Save the resulting image file and paste it into your notebook file. Repeat for the donor-site sequences.&lt;br /&gt;
&amp;lt;center&amp;gt;&lt;br /&gt;
Table 1. &amp;#039;&amp;#039;mdm2&amp;#039;&amp;#039; Exons&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Exon # !! Start Position !! End Position !! Length&lt;br /&gt;
|-&lt;br /&gt;
| #1 || 1971 || 2271 || 301&lt;br /&gt;
|-&lt;br /&gt;
| #2 || ? || ? || ?&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Table 2. &amp;#039;&amp;#039;mdm2&amp;#039;&amp;#039; Introns&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Intro Number !! Start Position !! End Position !! Length !! First 5 bases  !! Last 5 bases !! Phase*&lt;br /&gt;
|-&lt;br /&gt;
| #1 || 2272 || 2987 || 616 || GTACT || TGTAG || ?&lt;br /&gt;
|-&lt;br /&gt;
| #2 || ? || ? || ? || ? || ? || ?&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
* Introns have phases. Phase 0 introns sit between 2 codons, phase 1 intron sit between the 1st codon position and the 2nd codon position, and phase 3 introns sit between the 2nd and 3rd codon position. How would you find out the phase of an intron? [Hint, use Table 3 CDS positions below].&lt;br /&gt;
&lt;br /&gt;
Table 3. &amp;#039;&amp;#039;mdm2&amp;#039;&amp;#039; Coding Sequences (CDS)&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! CDS # !! Start Position !! End Position !! Length&lt;br /&gt;
|-&lt;br /&gt;
| #1 || 2992 || 3072 || 81&lt;br /&gt;
|-&lt;br /&gt;
| #2 || ? || ? || ?&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;/center&amp;gt;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==Exercise 3. Alternative splicing: Use BLAST to determine which exons are present in an mRNA==&lt;br /&gt;
&amp;lt;center&amp;gt;&lt;br /&gt;
&amp;lt;div style=&amp;quot;font-family:Monospace;line-height:1;width:800px;border-style:solid;border-width:1px;border-color:#AAAAFF;background-color:#EEEEFF;padding-left:5px;padding-right:5px;padding-top:0px;padding-bottom:0px;&amp;quot;&amp;gt;&lt;br /&gt;
[[File:4seq.png|800px]]&lt;br /&gt;
A diagram of the MDM2 gene used in this exercise, along with its splice variants. By the end of this module you will create a similar diagram.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/center&amp;gt;&lt;br /&gt;
You will use the following table for your exercise:&lt;br /&gt;
&amp;lt;center&amp;gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
! Genbank Accession # !! cDNA Clone !! Description !! Cell Line !! Length (bp)&lt;br /&gt;
|-					&lt;br /&gt;
| AF527840 ||		 || Genomic DNA			 || 	 	|| 34,088&lt;br /&gt;
|-&lt;br /&gt;
| EU076746 || P2-MDM2-C1 || cDNA  || MANCA	|| 427&lt;br /&gt;
|-&lt;br /&gt;
| EU076747 || P2-MDM2-10 || cDNA || ML-1	|| 842&lt;br /&gt;
|-&lt;br /&gt;
| EU076748 || P2-MDM2-C  || cDNA  || A876	|| 505&lt;br /&gt;
|-&lt;br /&gt;
| EU076749 || P2-MDM2-FL || cDNA		 || SJSA-1	|| 845&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;/center&amp;gt;&lt;br /&gt;
Blast one of the mRNA sequences (EU076746, EU076747, EU076748, EU076749) against the main sequence (AF527840) and use the results to answer the following questions. Suggested procedures:&lt;br /&gt;
# Go to the [http://www.ncbi.nlm.nih.gov/BLAST/ NCBI BLAST website]&lt;br /&gt;
# Click the link “Align two (or more) sequences using BLAST (bl2seq)” under “Specialized BLAST” (near the page bottom)&lt;br /&gt;
# In the “Sequence 1” text box, type in &amp;quot;EU076748&amp;quot; (or other cDNA accession in the table). In the “Sequence 2” text box, type “AF527840” (the accession for the genomics).&lt;br /&gt;
# Click “Align”. You should get a “Blast Result” output page.&lt;br /&gt;
# Fill in the following table based on BLAST-identified coordinates:&lt;br /&gt;
&amp;lt;center&amp;gt;&lt;br /&gt;
Table 4. A splice variant of mdm2 (Your choice of mRNA accession:________)&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Match # !! Query start !! Query end !! Subject start !! Subject end !! Exon # (consult Table 1)&lt;br /&gt;
|-&lt;br /&gt;
| ? || ? || ? || ? || ? || ?&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;/center&amp;gt;&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==Discussion Questions==&lt;br /&gt;
# Explain the following BLAST terms: “Expect” (e-value) [http://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&amp;amp;PAGE_TYPE=BlastDocs&amp;amp;DOC_TYPE=FAQ#expect Read this FAQ], “Identities”, “Gap”, “Strand”.&lt;br /&gt;
# Which is a statistically more significant match by BLAST, a match with an e-value=1e-5 or a match with an e-value of 1?&lt;br /&gt;
# If you want your match to be biologically relevant (and not random, chance matches), should you use the default e-value cutoff of 10?&lt;br /&gt;
# List and describe individual elements of a typical human gene based on mdm2. &lt;br /&gt;
# What is the &amp;quot;GT-AG&amp;quot; rule? Explain how to read the sequence logos. Explain the significance of sequence conservation at exon-intron junctions.&lt;br /&gt;
# Describe biological significance of alternative splicing, using mdm2 gene as an example.&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
==Reference==&lt;br /&gt;
* Arva NC, Talbott KE, Okoro DR, Brekman A, Qiu WG, Bargonetti J. 2008. Disruption of the p53-Mdm2 complex by Nutlin-3 reveals different cancer cell phenotypes. Ethnicity and Disease. 18(S2):1-8. [http://www.ncbi.nlm.nih.gov/pubmed/18646312 PubMed Abstract]&lt;br /&gt;
----&lt;/div&gt;</summary>
		<author><name>imported&gt;Weigang</name></author>
	</entry>
</feed>