1.24k likes | 1.47k Views
Hydrophobic Residue Patterning in β -Strands and Implications for β -Sheet Nucleation. Brent Wathen Dept. of Biochemistry Queen’s University. Outline. Part I: Introduction Proteins Protein Folding Part II: Protein Structure Prediction Goals, Challenges Techniques State of the Art
E N D
Hydrophobic ResiduePatterning in β-Strands and Implications for β-SheetNucleation Brent Wathen Dept. of Biochemistry Queen’s University
Outline • Part I: Introduction • Proteins • Protein Folding • Part II: Protein Structure Prediction • Goals, Challenges • Techniques • State of the Art • Part III: Residue Patterning on β-Strands • β-Sheet Nucleation • Hydrophobic/Hydrophilic Patterning
Outline • Part I: Introduction • Proteins • Protein Folding • Part II: Protein Structure Prediction • Goals, Challenges • Techniques • State of the Art • Part III: Residue Patterning on β-Strands • β-Sheet Nucleation • Hydrophobic/Hydrophilic Patterning
Part I: Introduction Proteins – Some Basics • What Is a Protein?
Part I: Introduction Proteins – Some Basics • What Is a Protein? • Linear Sequence of Amino Acids...
Part I: Introduction Proteins – Some Basics • What Is a Protein? • Linear Sequence of Amino Acids... • What is an Amino Acid?
Part I: Introduction Proteins – Some Basics • What Is a Protein? • Linear Sequence of Amino Acids... • What is an Amino Acid?
Part I: Introduction Proteins – Some Basics • How many types of Amino Acids?
Part I: Introduction Proteins – Some Basics • How many types of Amino Acids? • 20 Naturally Occurring Amino Acids • Differ only in SIDE CHAINS IsoleucineArginineTyrosine
Part I: Introduction Proteins – Some Basics • Amino Acids connect via PEPTIDE BOND
Part I: Introduction Proteins – Some Basics • Backbone can swivel: DIHEDRAL ANGLES • 2 per Amino Acid • Proteins can be 100’s of Amino Acids in length! • Lots of freedom of movement
Part I: Introduction Protein Functions • What do proteins do?
Part I: Introduction Protein Functions • What do proteins do? • Enzymes • Cellular Signaling • Antibodies
Part I: Introduction Protein Functions • What do proteins do? • Enzymes • Cellular Signaling • Antibodies • WHAT DON’T THEY DO!
Part I: Introduction Protein Functions • What do proteins do? • Enzymes • Cellular Signaling • Antibodies • WHAT DON’T THEY DO! • Comes from Greek Work Proteios – PRIMARY • Fundamental to virtually all cellular processes
Part I: Introduction Protein Functions • How do proteins do so much?
Part I: Introduction Protein Functions • How do proteins do so much? • Proteins FOLD spontaneously • Assume a characteristic 3D SHAPE • Shape depends on particular Amino Acid Sequence • Shape gives SPECIFIC function
Part I: Introduction Protein Structure • STRUCTURE FUNCTION relationship • Determining structure is often critical in understanding what a protein does • 2 main techniques • X-ray crystallography • NMR • 0.5Å RMSD accuracy • Both are very challenging • Months to years of work • Many proteins don’t yield to these methods
Part I: Introduction Protein Structure • Levels of organization • Primary Sequence • Secondary Structure (Modular building blocks) • α-helices • β-sheets • Tertiary Structure • Quartenary Structure • Hydrophobic/Hydrophilic Organization • Hydrophobics ON INSIDE • Hydrophobic Cores
Part I: Introduction Protein Structure
Part I: Introduction Protein Structure
Part I: Introduction Protein Folding • What we DO know... • Protein folding is FAST!! • Typically a couple of seconds • Folding is CONSISTENT!! • Involves weak forces – Non-Covalent • Hydrogen Bonding, van der Waals, Salt Bridges • Mostly, 2-STATE systems • VERY FEW INTERMEDIATES • Makes it hard to study – BLACK BOX
Part I: Introduction Protein Folding • What we DON’T know... • Mechanism...? • Forces...? • Relative contributions? • Hydrophobic Force thought to be critical
Part I: Introduction Intro Summary • Proteins are central to all living things • Critical to all biological studies • Folding process is largely unknown • Sequence Structure Mapping • Structure Function relationship • Determining Protein Structure Experimentally is HARD WORK
Outline • Part I: Introduction • Proteins • Protein Folding • Part II: Protein Structure Prediction • Goals, Challenges • Techniques • State of the Art • Part III: Residue Patterning on β-Strands • β-Sheet Nucleation • Hydrophobic/Hydrophilic Patterning
Part II: Structure Prediction The Prediction Problem Can we predict the final 3D protein structure knowing only its amino acid sequence?
Part II: Structure Prediction The Prediction Problem Can we predict the final 3D protein structure knowing only its amino acid sequence? • Studied for 4 Decades • “Holy Grail” in Biological Sciences • Primary Motivation for Bioinformatics • Based on this 1-to-1 Mapping of Sequence to Structure • Still very much an OPEN PROBLEM
Part II: Structure Prediction PSP: Goals • Accurate 3D structures. But not there yet. • Good “guesses” • Working models for researchers • Understand the FOLDING PROCESS • Get into the Black Box • Only hope for some proteins • 25% won’t crystallize, too big for NMR • Best hope for novel protein engineering • Drug design, etc.
Part II: Structure Prediction PSP: Major Hurdles • Energetics • We don’t know all the forces involved in detail • Too computationally expensive BY FAR! • Conformational search impossibly large • 100 a.a. protein, 2 moving dihedrals, 2 possible positions for each diheral: 2200 conformations! • Levinthal’s Paradox • Longer than time of universe to search • Proteins fold in a couple of seconds?? • Multiple-minima problem
Part II: Structure Prediction Tertiary Structure Prediction • Major Techniques • Template Modeling • Homology Modeling • Threading • Template-Free Modeling • ab initio Methods • Physics-Based • Knowledge-Based
Part II: Structure Prediction Template Modeling • Homology Modeling • Works with HOMOLOGS • ~ 50% of new sequences have HOMOLOGS • BLAST or PSI-BLAST search to find good models • Refine: • Molecular Dynamics • Energy Minimization
Part II: Structure Prediction Template-Free Modeling • Modeling based primarily from sequence • May also use: Secondary Structure Prediction, analysis of residue contacts in PDB, etc. • Advantages: • Can give insights into FOLDING MECHANISMS • Adaptable: Prions, Membrane, Natively Unfolded • Doesn’t require homologs • Only way to model NEW FOLDS • Useful for de novo protein design • Disadvantages: HARD!
Part II: Structure Prediction Template-Free Modeling • Physics-Based • Use ONLY the PRIMARY SEQUENCE • Try to model ALL FORCES • EXTREMELY EXPENSIVE computationally • Knowledge-Based • Include other knowledge: SSP, PDB Analysis • Statistical Energy Potentials • Not so interested in folding process • “Hot” area of research
Part II: Structure Prediction Template-Free Modeling • All methods SIMPLIFY problem • Reduced Atomic Representations • C-α’s only; C-α + C-β; etc. • Simplify Force Fields • Only van der Waals; only 2-body interactions • Reduced Conformational Searches • Lattice Models • Dihedral Angle Restrictions
Part II: Structure Prediction Template-Free Modeling • Basic Approach: 1. Begin with an unfolded conformation 2. Make small conformational change 3. Measure energy of new conformation Accept based on heuristic: SA, MC, etc. 4. Repeat until ending criteria reached • Underlying Assumption: Correct Conformation has LOWEST ENERGY
Part II: Structure Prediction Diverse Efforts • Data Mining • Pattern Classification • Neural Networks, HMMs, Nearest Neighbour, etc. • Packing Algorithms • Search Optimization • Traveling Salesman Problem • Contact Maps, Contact Order • Constraint Logic, etc. • Combinations of the above!
Part II: Structure Prediction ROSETTA • Pioneered by Baker Group (U. of Washington) • Fragment Based Method • Guiding Assumption: • Fragment Conformations in PDB approximate their structural preferences • Pre-build fragment library • Alleviates need to do local energy calculations • Lowest energy conformations should already be in library
Part II: Structure Prediction ROSETTA • Pre-build fragment library • 3-mers and 9-mers • 200 structural possibilities for each • Build conformations from the library • Randomly assign 3-mers, 9-mers along chain • During conformational search, reassign a 3-mer or a 9-mer to a new conformation at random • Score using energy function • Adaptive: Coarse grain at first, detailed at end • Accept changes based on Monte Carlo method
Part II: Structure Prediction Diverse Efforts • Data Mining • Pattern Classification • Neural Networks, HMMs, Nearest Neighbour, etc. • Packing Algorithms • Search Optimization • Traveling Salesman Problem • Contact Maps, Contact Order • Constraint Logic, etc. • Combinations of the above!
Part II: Structure Prediction State of the Art • CASP Competition • Critical Assessment of Structure Prediction • Blind Competition Every 2 years • CASP6 in 2004 - CASP7 just completed • ~75 proteins whose structures have not been published as yet • Easy homologs examples • Distant homologs available • De novo structures: no homologs known
Part II: Structure Prediction State of the Art • Template Modeling CASP6 Target 266 (green), and best model (blue) Moult, J. (2005) Cur. Opin. Struct. Bio.15:285-289
Part II: Structure Prediction State of the Art • Template Modeling • Alignment still not easy, and often requires multiple templates • Accurate core models (within 2-3Å RMSD) • Still not good at modeling regions missing from template • Side-chain modeling not too good • Molecular dynamics not able to improve models as hoped
Part II: Structure Prediction State of the Art • Template-Free Modeling CASP6 target 201, and best model. Vincent, J.J. et. al (2005) Proteins 7:67-83.
Part II: Structure Prediction State of the Art • Template-Free Modeling CASP6 target 241, and 3 best models. Vincent, J.J. et. al (2005) Proteins 7:67-83.
Part II: Structure Prediction State of the Art • How Good are Current Techniques? • CASP6 Summary: “The disappointing results for [hard new fold] targets suggest that the prediction community as a whole has learned to copy well but has not really learned how proteins fold.” Vincent, J.J. et. al (2005) Proteins 7:67-83.
Part II: Structure Prediction PSP Summary • Many diverse, creative efforts • Progress IS being made in finding final 3D structures • Less so with regards to understanding folding mechanisms • NEEDED: • Marriage of Creative Ideas and Increased Resources
Outline • Part I: Introduction • Proteins • Protein Folding • Part II: Protein Structure Prediction • Goals, Challenges • Techniques • State of the Art • Part III: Residue Patterning on β-Strands • β-Sheet Nucleation • Hydrophobic/Hydrophilic Patterning
Part III: β-Strand Patterning β-Sheet Basics • Made up of β-Strands • Diverse: • Parallel/Antiparallel • Edge/Interior Strands • Typically Twisted • Many Forms • β-sandwiches, β-barrels, β-helices, β-propellers, etc. • 2D? 3D? • Less studied than helices
Part III: β-Strand Patterning Beta Sheet Basics Internalin A Narbonin Polygalacturonase Galactose Oxidase
Part III: β-Strand Patterning Beta Sheet Basics • What do we know? • Residues: • V, I, F, Y, W, T, C L • Found largely in Protein Cores • Amphipathic Nature