460 likes | 761 Views
MACiE – a Database of Enzyme Reaction Mechanisms Janet Thornton EMBL-EBI July 2006. Enzymes in Data Resources (2005). No. enzymes Total No. %-tage in DB
E N D
MACiE – a Database of Enzyme Reaction Mechanisms Janet Thornton EMBL-EBI July 2006
Enzymes in Data Resources (2005) No. enzymes Total No. %-tage in DB UniProt65,076 172,690 37.7 PDB (all entries) 14,143 31,522 44.9 PDB (non-redundant) 3,655 10,450 35.0 Reactome (human) (via UniProt) 230 680 33.8 Roman Laskowski
Relating the number of enzymes to proteome size Permissive set KEGG assignments Conservative set human mouse Number of enzymes worm fly Proteome size Shiri Freilich
Increase in Number of Different Reactions (E.C.) in larger proteomes E.C. 1 E.C. 4 E.C. 2 E.C. 5 E.C. 3 E.C. 6 Freilich et al (2005) JMB ??
Universal metazoa human sterol cholesterol steroid hormone bile acid Extension and Evolution of Pathways:The integration of the steroid biosynthesis pathway into the sterol biosynthesis pathway Shiri Freilich
Enzyme Structure, Function and Evolution Outstanding Research Questions: • How is catalysis performed • Principles of catalysis? • E.C. numbers • How do enzymes evolve? • Can we predict enzyme function from structure? • Can we design new enzymes? • What is the enzyme complement in different organisms and how does it evolve? Need a list of active sites Trypsin
What constitutes a catalytic residue? • Direct involvement in the reaction mechanism • Polarises or alters the pKa of a residue or water molecule which is directly involved in the reaction mechanism • Polarises or activates part of the substrate (e.g. making a bond more susceptible to cleavage) • Stabilisation of a transition-state intermediate
http://www.ebi.ac.uk/thornton-srv/databases/CSA The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data. Porter, Bartlett, & Thornton Nucl. Acids. Res. (2004) 32: D129-D133. -lactamase Class A; EC 3.5.2.6; PDB: 1btl • Reaction: -lactam + H2O -amino acid • Active site residues: S70, K73, S130, E16
Comparison of CSA, SwissProt & PDB (2004) Porter, Bartlett et al, 2004 NAR
Ligand Selectivity Conformational Change Templates Metabolome Catalytic Site Atlas Binding Site Diversity Spherical Harmonics
CSA Coverage and Annotations Generated • Current entries in CSA from Literature = 737 • Current proteins in UniProt annotated by homology = 14,863 • Functional annotations by homology are more accurate if catalytic residues are checked and conserved (George et al (2005) PNAS) BUT no possibility of storing or querying the proposed chemical mechanisms (which must be available to identify the catalytic residues in the CSA)
The MACiE Database - a Research Project http://www.ebi.ac.uk/thornton-srv/databases/MACiE Mechanism, Annotation and Classification in Enzymes G. L. Holliday, G. J. Bartlett, Daniel Almonacid P. Murray-Rust, J.M.Thornton J. B. O. Mitchell (Holliday et al Bioinformatics 2005 21:4315)
Why develop MACiE? • To understand more about catalysis • To gather information on mechanisms • To compare and contrast mechanisms in different proteins • To help validate enzyme mechanisms • To study the evolution of mechanisms • To develop mechanism-based classification of enzymes • To help predict mechanism from structure • To help design new enzymes
The MACiE Database http://www.ebi.ac.uk/thornton-srv/databases/MACiE
Content in MACiE • Enzyme Name:fructose-bisphosphate aldolase • E.C. Classification:(EC 4.1.2.13) • Obsolete EC codes associated with entry: EC 4.1.2.7 • Reference Structure: PDB 1b57 • Domain classification: CATH 3.20.20.70 • UniProt code: P11604 • Specie:Escherichia coli (Bacteria) • Cofactors: Zn2+ and Na+ • Catalytic residues: Asp109, Glu182, Asn286 • Links
Classifying Residue Catalytic Function Hydrogen Donor, Hydrogen bond acceptor, Proton Relay Nucleophile, Electrophile Radical relay, Hydride relay Radical Donor, Radical stabiliser Leaving group , Steric role, Charge stabiliser Covalently attached, Metal ligand
Overall Reactionfructose-bisphosphate aldolase D-glyceraldehyde 3-phosphate glycerone phosphate + D-fructose 1,6-bisphosphate
Similarly Step 2is annotated Where the information is available the rate determining step is annotated
Step 3 is annotated MACiE always endeavours to return the enzyme to its ground state. This is often inferred, which is noted in the annotation
Finally: any spontaneous changes are included These are often spontaneous and occur outside the enzyme There is no other annotation involved in steps like this
General Searches • Query MACiE by reaction comments • Query MACiE by enzyme and species (scientific and common) names • Query the chemical changes in MACiE • Overall reactants and products (by KEGG and ChEBI compound id or compound name)
Frequencies of amino acid reactants performing a given function
Strong preference for acid/base function No reactant Function No strong preference for function Combining the amino acid and functional clusterings
Roles of catalytic residues and mechanistic steps in homologous enzymes of different function • How do enzymes modify the chemical reaction they catalyse, using the same structural scaffold? • Do catalytic residues conserve their role and / or identity in enzyme-catalysed reactions? Gail Bartlett
Methods Twenty-seven pairs of homologous proteins of totally different function (at primary EC level) PSIBLAST run against NRDB + PDB (cutoff e=10-5) 178 enzyme dataset with assigned catalytic residues and proposed mechanism Structural alignment performed using SSM server Structurally equivalent catalytic residues Information manually extracted from literature Catalytic mechanisms Comparison of function, active site, catalytic residues and catalytic mechanism
Results - overview • 27 pairs of proteins • 3 enzyme / nonenzyme pairs • 24 enzyme / enzyme pairs, from 21 enzyme superfamilies
Enzyme / enzyme pairs • All but one enzyme pair have their active site located at the same place in the protein fold • Substrates and / or products shared by 11 pairs • Cofactor shared by 5 enzyme pairs
Metal ion binding sites Twelve enzyme pairs conserve metal binding sites and ligands to the metal ions are structurally aligned Where the metal ion type has altered, subtle mutations to the ligand binding site have occurred
Rubredoxin oxygen oxidoreductase / metallo--lactamase Metallo--lactamase uses a Zn2+-activated hydroxyl for nucleophilic attack on the -lactam substrate Rubredoxin oxygen oxidoreductase reduces dioxygen via a redox cycle at a di-iron site
Change in residue identity and residue function DNA hydrolysis Xylose Xylulose Endonuclease IV / Xylose isomerase
Change in residue identity and residue function His 109 Trp 136 Base catalysed ring opening, followed by intramolecular hydride transfer and ring closure Zn2+-assisted hydroxyl performs nucleophilic attack on DNA backbone Endonuclease IV Xylose isomerase
Evolution of Mechanism Mechanisms share a common step at the beginning of the overall reaction pathway, catalysed by residues which are structurally equivalent in both enzymes 7 Mechanisms share a common step somewhere in the middle of the overall reaction path, catalysed by residues which are structurally equivalent in both enzymes 7 Mechanisms share common steps at the beginning and end of the overall reaction path, catalysed by residues which are structurally equivalent in both enzymes, but have a different step in the middle 2 Mechanisms do not share any common steps catalysed by structurally equivalent residues 8 Bartlett et al JMB
Common first stepdehydroquinate synthase / glycerol dehydrogenase a. Dehydroquinate synthase several stages b. Glycerol dehydrogenase
Common first stepdehydroquinate synthase / glycerol dehydrogenase H287 E194 dehydroquinate synthase H275 H271 H269 Superposition of Zn2+ and ligands glycerol dehydrogenase D169 H255 H252
Conclusions • Enzymes are economical in their use of active site residues and features • It is more likely for residues conserving function to also conserve their identity • Residues not conserving function tend to mutate • Tend to find common mechanistic steps at the beginning and ‘middle’ of reaction paths – possibly the most energetically difficult step or intermediate is conserved
Future • Increase coverage in MACiE • Analyse Catalytic Mechanisms • Ingold Reaction Types; effects of non-polar residues • Evolution of enzymes, pathways & metabolism in different organisms & tissues • Design??
Acknowledgements • Gail Bartlett, Craig Porter, Jonathan Barker • The MACiE Team – Cambridge Univ Chemistry Dept John Mitchell, Daniel Almonacid, Peter Murray-Rust • BBSRC, MRC, Wellcome Trust, EMBL James Torrance Gemma Holliday Alex Gutteridge