340 likes | 493 Views
Tutorial: Some Start-up Towards Using some “simple” Phylogenetic Reconstruction Packages. Winthrop June 28 – July 2, 2014 Terrell L. Hodge Western Michigan University terrell.hodge@wmich.edu. Recall: EXAMPLE OF Phylogenetic Tree Reconstruction.
E N D
Tutorial: Some Start-up Towards Using some “simple” Phylogenetic Reconstruction Packages Winthrop June 28 – July 2, 2014 Terrell L. Hodge Western Michigan University terrell.hodge@wmich.edu
Recall: EXAMPLE OF Phylogenetic Tree Reconstruction Output from “simplest” method: parsimony, which aims to minimize the number of differences in the character states per site: The output gave three trees, all with the minimum number, 97, of differing character states.
AIM 1: Can WE Reconstruct this tree ourselves, EVEN ELIMINATE the AMBIGUITY, as a Way to Build OUR Skills? • The trees were constructed by using “Phylogenetic Tree Constructor”, a BioQuest project outcome available at BEN (BiosciEdNet). • Try to do this now yourself: A screen shot appears on the next page. Go to: http://www.securebio.umb.edu/cgi-bin/TreeConstructor.pl and follow the easy instructions. Parsimony is the only tree reconstruction method used there.
AIM 2: Can we Obtain Our OWN Sequence data from Genbank? • One entry point: http://www.ncbi.nlm.nih.gov/ • From “Popular Resources” box, can choose • “Nucleotide” under “All Databases” to find DNA sequences • “BLAST” to search for sequences similar to one you already have in hand. • The next page has a screen shot.
NEXT: • Going to “Nucleotide” to searching for DNA sequences for “greenbottle fly”; once there, added “and coI” to the search terms to pick out commonly used gene for animal studies • Screen shot on next pages.
NEXT: • Going to “GenBank” to view accession for selected sequence (happens here to be the first one listed on preceding output of Nucleotide search; used link from there). • Screen shot on next pages.
NEXT: • Going from accession entry for selected sequence to view/retrieve sequence in FASTA format. • Screen shot on next pages.
NEXT: • One can copy, paste, and save this FASTA formatted sequence, e.g., in a text file. • For later processing, be sure to keep the “>” and then the actual DNA sequence data together. Only this FASTA format will be wanted for downstream application, so… • rearrange the data as needed, keeping other identifying features elsewhere in your file, e.g., gi|317444450|emb|FR719169.1| Luciliacuprina partial COI gene for cytochromeoxidase subunit 1, isolate LU9 >TAAATTTTACTTCAGCTACTATAATTATTGCTGTACCAACTGGAATTAAAATTTTCAGTTGATTAGCAACTCTTTATGGAACTCAATTAAACTACTCTCCTGCTACTTTATGAGCTTTAGGATTGTATTTTTATTTACTGTAGGAGGTTTAACTGGAGTTGTTTTAGCTAACTCTTCAATTGATATTATTCTACATGATACTTATTACGTAGTAGCTCACTTCCATTATGTTTTATCAATAGGAGCTGTATTTGCTATTATAGCAGGATTTGTTCATTGATACCCTTTATTTACAGGATTAACTTTAAATACTAAGATATTAAAAAGTCAATTTGCTATTATATTCATTGGAGTAAATTTAACATTTTTCCCCCAACATTTTTTAGGATTAGCAGGAATACCACGACGATATTCAGACTACCCAGATGCTTACACAACTTGAAATGTAATTTCTACAATTGGGTCAACAATTTCTTTATTAGGAATTTTATTCTTCTTCTTTATTATTTGAGAAAGTCTTGTATCTCAACGTCAAGTTTTATTCCCTATTCAATTAAATTCATCAATTGAATGATTACAAAATACTCCACCAGCTGAACATAGTTATTCTGAATTACCTTTATTAACTAA
NEXT: • To find other sequences which are similar to your selected one, you can paste the FASTA formatted sequence data just acquired directly into BLAST. • You can also use the “BLAST” link from the accession page of the sequence just identified. • A screen shot of the outcome of choosing option 2 above appears next.
NEXT: • To continue on to find other sequences which are similar to your selected one, you can paste into the box the FASTA formatted sequence data just acquired directly into BLAST, or • proceed using the accession number data conveniently inserted if going directly from a link from a “Nucleotide” search. • For this exercise (a nucleotide-nucleotide search) • select “nr/nt” database, • under “Program Selection”, select “blastn” (this is optimized for “somewhat similar sequences”), and • check box “show results in a new window” (this is just handy). • Other options are described in accompanying handout, “BIOS 5460 Assignment 1”. • A screen shot of the final outcome of then hitting the “BLAST” button appears next.
More INFO • More info on reading and evaluating the output of the previous call is in the handout “BIOS 5460 Assignment 1”. (Thanks go to Dr. Todd Barkman of the Department of Biological Sciences at Western Michigan University.)
Recall: EXAMPLE OF Phylogenetic Tree Reconstruction Output from “simplest” method: parsimony, which aims to minimize the number of differences in the character states per site: The output gave three trees, all with the minimum number, 97, of differing character states.
Can We FIND DATA FOR THE Other Species and Try to reconstruct the tree ourselves? • Search Nucleotide for “flesh fly and coI” and “fruit fly and coI”. • What are the outcomes? In particular, see the box “Top Organisms” on the right-hand side of the respective Nucleotide pages.
AIM 3: Assuming we could obtain sequence data, could we use a more sophisticated program to reconstruct phylogenetic trees, using different reconstruction methods?
One single-stop shopping SITE for phylogenetic tree reconstruction • http://www.phylogeny.fr/ • Select the “one-click” option under “Phylogeny analysis” • Upload FASTA formatted sequences in the box, or… • Just to see how it runs, click on “load example of sequences”, then the “Submit” button. Phylogeny.fr Robust Phylogenetic Analysis For The Non-Specialist
Click there to get preloaded sample sequences. The “Submit” button is lower down on this screen.
Explore further • See intermediate processes executed step-by-step by restarting using the “advanced mode” instead of “one-click” under the “phylogenetic analysis” tab on the page-top toolbar. • Restart using “a la carte” on the “phylogenetic analysis tab” instead, and you can then choose to vary the tree reconstruction methods (some options and outcomes shown on next page).
Aim 4: discussion and further resources – what (ELSE) do real biologists/mathbiologists use? • Mega (e.g., Mega 5) (free) • GenSpring (not free) • Magic (free, good for undergrads, out of Davidson) • GenePattern (free) • Phylip, PAUP, Clustal, more…
HopE YOU hAD Fun! There’s so much more to learn…