1 / 43

COT 6930 HPC and Bioinformatics Introduction to Molecular Biology

COT 6930 HPC and Bioinformatics Introduction to Molecular Biology. Xingquan Zhu Dept. of Computer Science and Engineering. Outline. Cell DNA DNA Structure DNA Sequencing RNA (DNA-> RNA) Protein Protein structure Protein synthesis. Replication. Transcription. Translation.

quasar
Download Presentation

COT 6930 HPC and Bioinformatics Introduction to Molecular Biology

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. COT 6930HPC and BioinformaticsIntroduction to Molecular Biology Xingquan Zhu Dept. of Computer Science and Engineering

  2. Outline • Cell • DNA • DNA Structure • DNA Sequencing • RNA (DNA-> RNA) • Protein • Protein structure • Protein synthesis

  3. Replication Transcription Translation Central Dogma of Biology: DNA, RNA, and the Flow of Information

  4. Lys Lys Gly Gly Leu Val Ala His Cartoon Space filling Oxygen Nitrogen Carbon Sulfur Ribbon Surface Protein • A sequence from 20 amino acids • Adopts a stable 3D structure that can be measured experimentally

  5. X-ray Crystallography

  6. X-ray Crystallography

  7. X-ray Crystallography

  8. The 20 amino acids • Each amino acid contains an "amine" group (NH3) and a "carboxy" group (COOH) (shown in black in the diagram). • The amino acids vary in their side chains (indicated in blue in the diagram).

  9. Protein Structure • Protein Structure • Primary structure (amino acid sequence) • Secondary structure (local folding) • Tertiary Structure (global folding) • Quaternary structure (multiple-chain) • Protein Structure Animation • https://mywebspace.wisc.edu/jonovic/web/proteins.html

  10. Primary Structure • Primary structure is described by the sequence of Amino Acids in the chain

  11. Polypeptide N-terminal C- terminal One end of every polypeptide, called the amino terminal or N-terminal, has a free amino group. The other end, with its free carboxyl group, is called the carboxyl terminal or C-terminal. Peptide: 50 amino acids or less Polypeptide: 50-100 amino acids Protein: over 100 amino acids

  12. Polypeptide • The amino acids are linked covalently by peptide bonds. The image shows how three amino acids linked by peptide bonds into a tripeptide.

  13. Secondary Structure • Secondary structure describes the way the chain folds • Local structure of consecutive amino acids • Common regular secondary structures •  Helix •  Sheet • b turn

  14. Secondary Structure • Alpha helix • Beta strand / pleated sheet • Coil

  15. Tertiary Structure of protein • Tertiary Structure describes the shapes which form when the secondary spirals of the protein chain further fold up on themselves.

  16. Quaternary structure (multi-chain structures) • Quaternary structure describes any final adjustments to the molecule before it can become active. For example, pairs of chains may bind together or other inorganic substances may be incorporated into the molecule.

  17. Protein Structure Space Protein folding taxonomy : all alphaall betaalpha/betaalpha+betaothers http://www.nigms.nih.gov/psi/

  18. rotatable rotatable Geometry of Protein Structure Total number of degree is 2*(n-1) where n is the length of the protein

  19. The Leventhal Paradox • Given a small protein (100aa) assume 3 possible conformations/peptide bond • 3100 = 5 × 1047 conformations • Fastest motions 10- 15 sec so sampling all conformations would take 5 × 1032 sec • 60 × 60 × 24 × 365 = 31536000 seconds in a year • Sampling all conformations will take 1.6 × 1025 years • Proteins do not have problem in folding, we have! ­ the Leventhal paradox

  20. Outline • Cell • DNA • DNA Structure • DNA Sequencing • RNA (DNA-> RNA) • Protein • Protein structure • Protein synthesis

  21. RNA 3 types of RNA

  22. Messenger RNA DNA: TAC CAT GAG ACT … ATC mRNA: AUG GUA CUC UGA … UAG

  23. Ribosomal RNA and ribosomes

  24. Transfer RNA

  25. Overview of protein synthesis Transcription: same language Translation: different language

  26. Overview of protein synthesis

  27. A. Transcription No Thymine, instead has Uracil

  28. 2. Translation, the final steps

  29. Rules (the secret of life) • G →C • C →G • Transcription: • A →U • T →A • Translation AUG: Methionine (Met)

  30. Codons and anticodons DNA: TAC CAT GAG ACT … ATC mRNA: AUG GUA CUC UGA … UAG tRNA: UAC CAU GAG ACU … AUC

  31. Protein structure databases Gene expression database transcription translation DNA RNA protein phenotype Protein sequence databases cDNA ESTs UniGene Genomic DNA Databases

  32. List of Amino Acids (1)

  33. List of Amino Acids (2)

  34. Transcription & Open Reading Frame (ORF) • Open Reading Frame (ORF) • Where to start reading codons (ATG) • 6 possible reading frames (3 forward, 3 backward) • Gene is usually longest ORF found • Forward reading frame example

  35. Complication – Non-coding Regions • Non-coding regions • Very little genomic DNA produce proteins • Exon – DNA expressed in protein (2–3% of human genome) • Intron – DNA transcribed into mRNA but later removed • Untranslated region (UTR) – DNA not expressed • UTRs may affect gene regulation & expression • Biological processes • Remove introns from mRNA, splice exons together • Transition between intron / exon = splice site • Splicing can be inconsistent • Some exons may be skipped • Result = splice-variant gene / isoform • Estimated 30% of human proteins from splice-variant genes

  36. Non-coding regions

  37. Control regions Exons Transcription Introns Splicing Transcription • The process of making RNA from DNA • Needs a promoter region to begin transcription.

  38. Alternative Splicing • One single gene produce different forms of a protein • A single gene can contain numerous exons and introns, and the exons can be spliced together in different ways

  39. Complication: Mutations • Mutations • Modifications during DNA replication • Possible changes • Point mutation / single nucleotide polymorphism (SNP) • 5’ A T A C G T A … • 5’ A T G C G T A … • Occur every 100 to 300 bases along the 3-billion-base human genome • Duplicate sequence • Inverted sequence • Insert / delete sequence ( indel )

  40. Mutations

  41. Mutations

  42. Outline • Cell • DNA • DNA Structure • DNA Sequencing • RNA (DNA-> RNA) • Protein • Protein structure • Protein synthesis

  43. Excellent Animation • Cell • http://www.youtube.com/watch?v=UB6G9GD2KFk • Central Dogma • http://www.youtube.com/watch?v=GkdRdik73kU

More Related