1 / 21

D2DBT9 - Genetic Analysis and Bioinformatics

D2DBT9 - Genetic Analysis and Bioinformatics. Collecting Public Information on Proteins – Self-guided practical session Dr. Jaume Bacardit jaume.bacardit@nottingham.ac.uk. Protein we are going to use today…. We are going to use in most examples the AXR4 protein from Arabidopsis Thaliana.

carlyn
Download Presentation

D2DBT9 - Genetic Analysis and Bioinformatics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. D2DBT9 - Genetic Analysis and Bioinformatics Collecting Public Information on Proteins – Self-guided practical sessionDr. Jaume Bacarditjaume.bacardit@nottingham.ac.uk

  2. Protein we are going to use today… • We are going to use in most examples the AXR4 protein from Arabidopsis Thaliana MAIITEEEEDPKTLNPPKNKPKDSDFTKSESTMKNPKPQSQNPFPFWFYFTVVVSLATII FISLSLFSSQNDPRSWFLSLPPALRQHYSNGRTIKVQVNSNESPIEVFVAESGSIHTETV VIVHGLGLSSFAFKEMIQSLGSKGIHSVAIDLPGNGFSDKSMVVIGGDREIGFVARVKEV YGLIQEKGVFWAFDQMIETGDLPYEEIIKLQNSKRRSFKAIELGSEETARVLGQVIDTLG LAPVHLVLHDSALGLASNWVSENWQSVRSVTLIDSSISPALPLWVLNVPGIREILLAFSF GFEKLVSFRCSKEMTLSDIDAHRILLKGRNGREAVVASLNKLNHSFDIAQWGNSDGINGI PMQVIWSSEASKEWSDEGQRVAKALPKAKFVTHSGSRWPQESKSGELADYISEFVSLLPK SIRRVAEEPIPEEVQKVLEEAKAGDDHDHHHGHGHAHAGYSDAYGLGEEWTTT

  3. Biological databases • Uniprot • NCBI Entrez • Pfam

  4. UniProt • UniProt is a collaboration between the European Bioinformatics Institute (EBI), the Swiss Institute of Bioinformatics (SIB) and the Protein Information Resource (PIR). • Main protein data base • http://www.uniprot.org/

  5. Querying UniProt with a protein Name: AXR4 Uniprot ID

  6. Included in the AXR4 page…. • Annotation of protein • Function, location, specificity, disruption phenotype • Gene Ontology • Sequence • Transmembrane potential • Bibliographic references • Cross-references to other databases • GenBank, PIR, KEGG, TAIR (Adapbidopsis-specific) • Identify each of these

  7. Alternatively, if we know the sequence but not the protein…

  8. Returns these results • The top hit is the most likely protein for the query sequence, especially if the alignment score is high

  9. Identifying homologs and aligning them • The blast section of uniprot is also useful to identify homologs. These will be the next hits in the list after the first one. • Now let’s select a few of them and click ‘Align’ on the green bar

  10. Aligning the homologs (ClustalW)

  11. ClustalW also generates phylogenetic trees(we will discuss about them in a future lecture)

  12. Not only in Uniprot we have protein information…. • The NCBI’s Entrez system returns this for AXR4

  13. Pfam: sequence-based detection of protein families

  14. Pfam returns three possible sequence motifs (but no significant results)

  15. Screenshot from 2009 The database is larger now Protein Data Bank (PDB) Put your PDB ID Here Each protein in PDB is identified by a 4-letter code

  16. Entry 2p31 Let’s click at display PDB file

  17. PDB file for 2p31 Sequence Atomic coordinates of the amino acids

  18. Other Biological Databases

  19. Prediction sites • Not all public information about proteins is actual solid information • There are also sites that perform predictions based on a protein’s primary sequence • Sometimes these services take long to perform their predictions, and ask for an email address to notify the user when finished

  20. Firestar:Functional sites prediction

  21. TMHMM: Transmembrane prediction

More Related