1 / 23

Biology 224 Dr. Tom Peavy Sept 27 & 29

Protein Structure & Analysis. Biology 224 Dr. Tom Peavy Sept 27 & 29. <Images from Bioinformatics and Functional Genomics by Jonathan Pevsner> . Protein families. Protein localization. protein. Protein function. Gene ontology (GO): --cellular component --biological process

roger
Download Presentation

Biology 224 Dr. Tom Peavy Sept 27 & 29

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Protein Structure & Analysis Biology 224 Dr. Tom Peavy Sept 27 & 29 <Images from Bioinformatics and Functional Genomics by Jonathan Pevsner>

  2. Protein families Protein localization protein Protein function Gene ontology (GO): --cellular component --biological process --molecular function Physical properties

  3. The Human Proteome Organisation (HUPO) Proteomics Standards Initiative (PSI) • Work groups • Protein Separation • Mass Spectrometry • Molecular Interactions • Protein Modifications • Proteomics Informatics • Themes • Controlled vocabularies • MIAPE: Minimum information about a proteomics experiment

  4. Protein domains, motifs & signatures

  5. Definitions • Signature: • a protein category such as a domain or motif • (a defining property of the protein or family) • Domain: • a region of a protein that can adopt a 3D structure • a fold • a family is a group of proteins that share a domain • examples: zinc finger domain • immunoglobulin domain • Motif (or fingerprint): • a short, conserved region of a protein • typically 10 to 20 contiguous amino acid residues

  6. Definition of a domain According to InterPro at EBI (http://www.ebi.ac.uk/interpro/): A domain is an independent structural unit, found alone or in conjunction with other domains or repeats. Domains are evolutionarily related. According to SMART (http://smart.embl-heidelberg.de): A domain is a conserved structural entity with distinctive secondary structure content and a hydrophobic core. Homologous domains with common functions usually show sequence similarities.

  7. 15 most common domains (human) Zn finger, C2H2 type 1093 proteins Immunoglobulin 1032 EGF-like 471 Zn-finger, RING 458 Homeobox 417 Pleckstrin-like 405 RNA-binding region RNP-1 400 SH3 394 Calcium-binding EF-hand 392 Fibronectin, type III 300 PDZ/DHR/GLGF 280 Small GTP-binding protein 261 BTB/POZ 236 bHLH 226 Cadherin 226

  8. Varieties of protein domains Extending along the length of a protein Occupying a subset of a protein sequence Occurring one or more times

  9. Example of a protein with domains: Methyl CpG binding protein 2 (MeCP2) MBD TRD The protein includes a methylated DNA binding domain (MBD) and a transcriptional repression domain (TRD). MeCP2 is a transcriptional repressor. Mutations in the gene encoding MeCP2 cause Rett Syndrome, a neurological disorder affecting girls primarily.

  10. Result of an MeCP2 blastp search: A methyl-binding domain shared by several proteins

  11. Are proteins that share only a domain homologous?

  12. Proteins can have both domains and patterns (motifs) Pattern (several residues) Pattern (several residues) Domain (aspartyl protease) Domain (reverse transcriptase)

  13. Can find UniProt accession number within GenBank Entry Human hemoglobin subunit beta NP_000509

  14. The SwissProt entry for any protein provides highly useful information…

  15. Definition of a motif A motif (or fingerprint) is a short, conserved region of a protein. Its size is often 10 to 20 amino acids. Simple motifs include transmembrane domains and phosphorylation sites. These do not imply homology when found in a group of proteins. PROSITE (www.expasy.org/prosite) is a dictionary of motifs (there are currently 1600 entries). In PROSITE, a pattern is a qualitative motif description (a protein either matches a pattern, or not). In contrast, a profile is a quantitative motif description. Profiles are found in Pfam, ProDom, SMART, and other databases. Page 231-233

  16. Pattern syntaxThe symbol `x' is used for a position where any amino acid is accepted. Ambiguities are indicated by listing the acceptable amino acids for a given position, between square brackets `[ ]'. For example: [ALT] stands for Ala or Leu or Thr. Ambiguities are also indicated by listing between a pair of curly brackets `{ }' the amino acids that are not accepted at a given position. For example: {AM} stands for any amino acid except Ala and Met. Each element in a pattern is separated from its neighbor by a `-'. Repetition of an element of the pattern can be indicated by following that element with a numerical value or, if it is a gap ('x'), by a numerical range between parentheses.Examples: x(3) corresponds to x-x-x x(2,4) corresponds to x-x or x-x-x or x-x-x-x A(3) corresponds to A-A-A Note: You can only use a range with 'x', i.e. A(2,4) is not a valid pattern element. When a pattern is restricted to either the N- or C-terminal of a sequence, that pattern either starts with a `<' symbol or respectively ends with a `>' symbol.

More Related