640 likes | 821 Views
Practical Applications of Sequence Analysis. MIC 753 - The Theory and Application of Sequence Analysis. Instructor (Me). Elliot Lefkowitz Office: BBRB 277A Phone: 4-1946 E-mail: ElliotL@uab.edu. Time and Place. April 3 to June 5 2001 no class May 17 no class May 29 Tuesdays: 3:00 – 4:30
E N D
Practical Applications of Sequence Analysis MIC 753 - The Theory and Application of Sequence Analysis
Instructor (Me) • Elliot Lefkowitz • Office: BBRB 277A • Phone: 4-1946 • E-mail: ElliotL@uab.edu
Time and Place • April 3 to June 5 2001 • no class May 17 • no class May 29 • Tuesdays: 3:00 – 4:30 • Thursdays: 1:00 – 2:30 • BBRB 170
World-Wide-Web Site • http://genome.microbio.uab.edu/MIC753 • General Information • Handouts • Lecture Notes • Discussion List • Links
Discussion List • Talk with Elliot • Ask questions • Get answers • Express your frustration • Public!
Email ElliotL@uab.edu • Talk with Elliot • Ask questions • Get answers • Express your frustration • Private! • Send me your Email Address!!
Sequence analysis at UAB http://genome.microbio.uab.edu • Many local sites mentioned in this course have links from this page
Course Content • Tools for Sequence Analysis • GCG • NCBI • Databases • Other Web-based tools • Specialized Software • Publication • Understanding their use
GCGGenetics Computer Group • The Wisconsin Package • Complete package of programs for sequence analysis • Nucleic acid and Protein databases • Supporting databases
GCG Costs Program and database costs are subsidized by: • UAB Health Services Foundation • UAB AIDS Center
GCG Interfaces • Command line • Text based commands • UNIX • Graphics display • SeqLab • Xwindows • Windows environment • SeqWeb • Web-based sequence analysis
Other Useful Programs • Available for: • Macintosh • Windows PCs • UNIX Workstations • Web-based tools
Microcomputer Sequence Analysis Packages • Macintosh and IBM
Pros and Cons • Initially expensive • Databases supplied on CD-ROM • Available on-line from Genbank • Packages vary in completeness • Generally easy to use
GCG Documentation • Web-based • http://genome.microbio.uab.edu/GCG/GCG10/gcgmanual.html • Username: gcg • Password: uab • Order from GCG • http://www.gcg.com/ordering/orderdocform.html • UNIX documentation
GCG Users Guide GCG Manual Version 10.1 • Basic guide to UNIX • Running GCG programs • Program descriptions • Working with data • Generating graphics
GCG Program Manual • Complete Program Guide • Description • Input • Output • Parameters • Algorithms • Example
SeqLab User’s Guide and Tutorial • GCG XWindows Interface • Windows-like control • menus • dialog boxes • mouse control
SeqWeb Interface • http://gcg.genome.uab.edu • Flat-file version • http://gcg.genome.uab.edu:90 • Relational database version • Web-based access to GCG programs • Online help
A Practical Guide to the Analysis of Genes and Proteins • Strongly Recommended • Editors • Andreas Baxevanis • B.F.Francis Ouellette • John Wiley & Sons, 1998
Sequence, Structure, and Databanks : A Practical Approach • Editors • Des Higgins • Willie Taylor • Oxford Univ Press, 2000
Sequence Analysis Primer • Editors • Michael Gribskov • John Devereux • Groves Press, 1991
Sequence Analysis in Molecular Biology • Gunnar von Heijne, Editor • Academic Press, 1987
Molecular Systematics • David M. Hillis and Craig Moritz (eds.) • Sinauer Associates, 1990
MacClade • Analysis of Phylogeny and Character Evolution • Wayne P. Maddison and David R. Maddison • Program and Manual • Sinauer Associates, 1992
PAUP 4.0 • Phylogenetic Analysis Using Parsimony • David L. Swofford • Program and Manual • Mac • PC (DOS) • UNIX • Sinauer Associates, 1998 • Available later this year
Molecular Evolution • Computer Analysis of Protein and Nucleic Acid Sequences • Methods in Enzymology, Vol. 183, Doolittle, R.F. (ed.) • Academic Press, 1990
Getting Started • Introduction; Communications • Using Computers • UNIX • Running GCG Programs • Text • Graphics • SeqLab • SeqWeb
Data • Sequence and Other Data • SeqStore • Genbank Services • Sequence Editing • Fragment Assembly • Genbank submission
Sequence Patterns • Mapping • Pattern Recognition • Protein Analysis
Sequence Similarities • Sequence Comparison • Database Searching • Multiple Sequence Analysis • Prediction of functional motifs • Profile Analysis
RNA • Secondary Structure Prediction
Phylogenetic Analysis • Theory • Application
MicroArray Analysis • Aubrey Hill • Director, Bioinformatics • UAB Comprehensive Cancer Center
Genome Analysis • Mapping • Sequencing • Analysis
Getting some real work done • Graphics and Publication • Working with your results
General Lecture Outline • What the program does • What the program wants • How the program works • How can I manipulate the program? • What do I get in the end?
Internet-based Tools • What is available • How do I find them? • How do I use them?
Class Projects • Project 1 - Preparation of Web-based Bioinformatic Solutions • Project 2 - Analysis of a gene family • Understanding • Practice A+
Preparation of a Web-based Bioinformatic Solution • Each student will design a web site that describes online, web-based solutions that are available to help solve a particular bioinformatic problem. • Due Date: Monday, May 21
Possible Projects • Protein motif identification • Protein structure prediction • Protein Families • RNA Secondary structure prediction • Human genome analysis • Mouse genome analysis • Microbial genome analysis • Database access and searching • Evolutionary analysis • Sequence assembly • Microarray analysis
Project 1 Guidelines • You should have your topic approved by Elliot prior to beginning this project. The list below provides examples of bioinformatic problems that may be appropriate, or choose your own. • Your web site should include links to the appropriate web pages along with descriptions of what is available and a how-to guide to provide on-line help in using the bioinformatic tools. • You should also provide a review of the described sites that focuses on their ability to solve the problem at hand, and ease-of-use. • You can utilize whatever web-creation software you wish. Recommendations of easy-to-use web design programs are Microsoft FrontPage for the PC and Adobe PageMill for the Macintosh. • Up to TWO students may work on this project together and submit a common set of files for their web site.
Analysis of a gene family • Each student will be required to identify a sequence family that they will analyze in different ways throughout the course. Each student must be working on a unique set of genes. Elliot will approve each choice on a first-come, first-serve basis. So it is to your advantage to accomplish the following assignment as soon as possible. • Due Date: Monday, June 11
Project 2 Guidleines • Pick an interesting gene family that contains at least six members either in multiple species, or in multiple alleles. • Email a description of the gene family to Elliot (ElliotL@uab.edu) and why you think it will be worthwhile to study. • Elliot will let you know if your choice is satisfactory. If not, you will have to choose another set of genes. • Throughout the course, you should apply the various analytical programs demonstrated in class to your gene family. It will be your decision to choose which programs to use in your analysis. Your analysis should be compressive and relevant to the chosen gene family. • Prepare a short journal-type paper describing your work.