120 likes | 360 Views
The European Bioinformatics Institute (EBI) Toolbox http://www.ebi.ac.uk/Tools/. Julie Pellegrini Introduction to Bioinformatics. What does the Toolbox include?. Homology and Similarity Programs Protein Functional Analysis Structural Analysis Sequence Analysis
E N D
The European Bioinformatics Institute (EBI) Toolboxhttp://www.ebi.ac.uk/Tools/ Julie Pellegrini Introduction to Bioinformatics
What does the Toolbox include? • Homology and Similarity Programs • Protein Functional Analysis • Structural Analysis • Sequence Analysis • Miscellaneous Tools (clustering, visualization tools for gene expression)
Homology and Similarity Programs • Fasta • Nucleotide, protein, ion channels • BLAST • Nucleotide, protein, ‘Alternative Splicing’, vectors • Smith & Waterman • Protein sequence comparison • Databases • Proteomes, Genomes, Parasites, SNP
Protein Functional Analysis • InterPro • Protein Motif Database • InterProScan • Search InterPro by accession number or sequence • Search features • Protein repeats, motifs, biochemical function, family
Structural Analysis • Protein structure analysis and secondary structure prediction • DALI, MaxSprout • Active Site Search • NMR elucidation • Protein Quaternary Structure • PQS, PQS-Quick
Sequence Analysis • Unknown structure and function can be elucidated through comparison with database of known structures/sequences/functions • ClustalW • Global Multiple Alignment • EMBOSSALIGN • Like BLAST2 but Global • Gene finding tools • Translation tools • Promoter Analysis tools
BioLayout CAST-protein compositional filtering Expression Profiler-gene expression tools NEWT Swiss-Prot Database Readseq Converts format Webservices Access various biological databases Protein Colourer Colors amino acids QuickGo Ontology Database search Protein Annotation Assistant XEMBL EMBL nucleotide sequence data in variety of formats Miscellaneous Tools
I’ve just discovered a gene! Or have I…? YeastGenome isolation and sequencing revealed possible protein-coding sequence: ATGTTTACTGGTCAGGAGTATCATTCCGTAGACTCTAATTCCAACAAGCAAAAAGACAACAATAAACGTGGTATTGATGACACATCAAAGATCTTGAATAATAAGATACCGCACTCTGTTAGTGATACTTCTGCCGCCGCCACCACCACTTCTACTATGAACAATTCTGCTTTAAGTAGATCCTTAGATCCTACTGACATAAACTATAGCACAAATATGGCTGGTGTGGTTGACCAAATACATGATTATACTACTTCCAATAGAAATTCTTTAACCCCACAATATTCTATTGCAGCTGGAAACGTCAATTCGCATGATCGGGTTGTTAAACCCAGCGCCAATTCAAACTATCAGCAGGCTGCATACCTTCGACAACAGCAACAGCAGGATCAGCGACAACAGTCACCCTCTATGAAAACTGAAGAGGAATCCCAACTCTACGGTGATATTCTGATGAATTCTGGTGTCGTACAGGATATGCATCAGAATCTGGCCACTCATACAAATCTGAGCCAACTGTCGTCTACCCGTAAGTCCGCTCCGAATGATTCTACTACAGCCCCGACTAATGCGTCCAACATCGCCAATACGGCTTCTGTGAACAAGCAGATGTATTTCATGAACATGAATATGAATAACAACCCACATGCCTTGAACGATCCATCCATCCTGGAAACATTGTCGCCATTTTTTCAACCTTTTGGTGTTGATGTAGCACATTTACCTATGACGAATCCACCAATTTTCCAAAGTTCTTTGCCTGGATGCGATGAGCCAATTAGAAGAAGAAGAATATCAATCTCTAACGGTCAAATAAGCCAGCTAGGCGAAGATATTGAAACTTTGGAAAACCTGCACAACACACAGCCGCCCCCGATGCCCAATTTTCACAATTATAATGGTCTGAGCCAAACTAGGAATGTATCAAACAAGCCGGTCTTCAACCAAGCAGTGCCGGTTAGTAGTATTCCACAATACAATGCAAAAAAAGTTATTAATCCCACGAAGGACTCCGCATTGGGTGATCAGAGCGTTATTTACTCGAAAAGTCAGCAGCGAAATTTTGTAAACGCGCCATCAAAGAATACTCCAGCGGAGAGTATAAGTGATTTGGAAGGCATGACGACGTTTGCGCCAACTACTGGAGGTGAAAATAGGGGCAAATCTGCACTTAGGGAATCTCACTCTAATCCTAGCTTCACTCCAAAATCTCAAGGATCTCATTTAAATTTAGCGGCGAACACACAGGGAAATCCAATCCCTGGTACTACGGCATGGAAGAGAGCAAGATTGTTAGAAAGAAATCGAATTGCAGCTTCGAAATGTAGACAAAGGAAAAAGGTTGCGCAGCTGCAGCTCCAAAAGGAATTTAACGAAATTAAAGACGAGAATAGAATTTTACTGAAAAAGTTAAATTACTATGAAAAACTAATCTCTAAATTCAAGAAATTCTCCAAAATTCATTTACGTGAACATGAAAAACTAAATAAAGACTCAGATAATAATGTTAATGGCACTAATAGTAGCAACAAAAATGAAAGCATGACTGTGGATTCATTAAAGATCATTGAAGAACTTTTAATGATCGATTCAGACGTTACAGAAGTGGATAAAGATACTGGTAAGATCATAGCCATCAAGCACGAGCCATACTCTCAACGTTTCGGAAGCGATACTGACGATGACGATATAGATCTCAAGCCCGTAGAAGGTGGTAAGGATCCAGACAACCAATCATTACCCAATTCTGAAAAGATAAAATAA
What sorts of protein does my newly discovered gene code for?? • Paste discovered gene into Transeq to find protein sequence
How does our protein sequence compare with known cross-references? • Here is a Case for Sequence Alignments! • Use Align, ClustalW…
Align Output Findings We have in fact found what our gene codes for (YCA2) and it has already been added to the protein database.