10 likes | 119 Views
Asp. Stop. CUA. Institute of Molecular Biotechnology. Jena. AUC. GAU. UAG. Asp. STOP. THE PURINE-PYRIMIDINE CLASSIFICATION SCHEME REVEALS NEW PATTERNS IN THE GENETIC CODE. Swetlana Nikolajewa and Thomas Wilhelm
E N D
Asp Stop CUA Institute of Molecular Biotechnology Jena AUC GAU UAG Asp STOP THE PURINE-PYRIMIDINE CLASSIFICATION SCHEME REVEALS NEW PATTERNS IN THE GENETIC CODE Swetlana Nikolajewa and Thomas Wilhelm Institute of Molecular Biotechnology Jena, Germany, http://www.imb-jena.de/tsb We present a new classification scheme of the genetic code, based on a binary representation of purines (A, G = 1) and pyrimidines (U, C = 0). On the basis of this logical organization we detect codon regularities (strong, mixed and weak codons) and patterns of amino acid properties and show symmetry characteristics of the genetic code. We present the “codon - reverse codon” pattern and use it to remove the only ambiguity which had remained in our recently proposed new classification scheme of the genetic code. The purine-pyrimidine scheme has now found its final form. We show that the “codon - reverse codon” pattern is correlated to aspects of the tRNA anticodon distribution in all organisms. It is well known that there is no tRNA with an anticodon for any of the STOP codons; we found that there is also no tRNA containing reverse STOP anticodons. The classification scheme of the genetic code is based on a binary representation of purines (G,A) - 1, pyrimidines (C,U) - 0. The eight rows (23 = 8) and four columns are sufficient to place the 20 amino acids, as well as the termination codons. The four combinations of the first column (CC*, GC*, CG*, and GG*) always imply 6 hydrogen bonds in complementary base-paring with the corresponding anticodon of the tRNA. Accordingly, they are called strong codons. The first two bases guarantee 5 H-bonds: mixed codons. Weak codons have just 4 H-bonds . Each row contains exactly 4 different amino acids (including the stop codon).Exceptions in the standard code are the second row with two leucines and the fourth row with the AU* start codon. The common table of the standard genetic code Interestingly, in 5 different mitochondrial codes (including yeast) AU(A/G) always encodes for Met and UG(A/G) encodes Trp. Thus, 32 positions code for 32 entries and each row contains exactly four different amino acids. In this spirit the mitochondrial code is the most regular one. The codon-anticodon symmetry: the red horizontal line marks the symmetry axis. For instance, codon CCC (Pro, first column, first row) has the anticodon GGG (Gly, first column, last row). The point symmetry corresponds to Halitsky’s family – nonfamily symmetry operation (“E-M bifurcation”, Halitsky 2003). For instance, the family codon GUA (Val) is mapped into the nonfamily codon UGC (Cys). Thus, this point symmetry is behind the family – nonfamily symmetry in our scheme (shaded vs. unshaded regions). In 8 different mitochondrial codes UG(A/G) encodes Trp. The arrows represent “codon - reverse codons” pairs. “Codon - Reverse Codon” Pattern The arrows in the above table indicate “codon - reverse codons” pairs: the reverse codon of a codon XYZ is defined as ZYX, where X, Y, Z can be any base. For instance, the reverse codon of GGA (Gly) is AGG (Arg). There exist 15 different amino acids in the rows 000, 101, 010 and 111 where the codon is reverse to itself, e.g. Lys (AAA), Tyr (UAU). • The table can be divided into four blocks (codon - reverse codon groups) of the same size, for instance the upper left block with Pro (P), Ser (S), Ala (A) and Thr (T). Each block shows the same arrow pattern. All strongly evolutionary conserved groups of amino acids (Thompson et al., 1994) are subsets of exactly one codon - reverse codon group, e.g. the MILV amino acids belong to the upper right block in the table. The other conserved strong groups belonging to one block are STA, NEQK, NHQK, NDEQ, QHRK, MILF, HY. The only exception is FYW. • 2. We studied all known tRNA genes of 104 different organisms (Sprinzl et al., 1999). It is known that the STOP codons do not have any tRNA. We found that there are also no tRNA genes containing anticodons reverse to the STOP anticodons (ACT, ATC and ATT in Table 2). The only exception is H. sapiens, possessing one tRNAAsn gene with the anticodon ATT, but humans also have three different possible suppressor tRNA genes (Lowe and Eddy, 1997). Table 2. The amino acid codon- reverse codon pairs and the number of tRNA genes for the corresponding anticodons. 3. There are some anticodons, where the reverse anticodon has no tRNA, e.g. all 104 studied organisms have at least one tRNA for Tyr with anticodon GTA (some organisms, for instance H.sapiens, have different tRNAs with the same anticodon, altogether there are 189 different tRNAs with the anticodon GTA in the 104 species), but no organism has a tRNA for His with anticodon ATG. Table 2 lists different codon - reverse codon pairs. In the whole superkingdom bacteria the number of tRNA genes for the reverse part of table 2 is always zero, the few exceptions are only in eukaryotes and archaea. Another unique property of all bacteria is that there is no tRNA with anticodons for the self reverse codons UUU (Phe), UCU (Ser), UGU (Cys) and UAU (Tyr). Generally, table 2 shows that anticodons with an A at the third position are strongly preferred, whereas A** anticodons are significantly suppressed. REFERENCES Crick, F. H. C. (1968) The origin of the genetic code. J. Mol. Biol., 38, 367- 379. Halitsky, D. (2003) Extending the (hexa-)rhombic dodecahedral model of the genetic code: the code’s 6-fold degeneracies and the orthogonal projections of the 5-cube as 3-cube. Contributed paper (983-92-151), American Mathematical Society; and personal communication. Jungck, J.R. (1978) The genetic code as a periodic table. J. Mol. Evol., 11, 211-224. Lagerkvist, U. (1978) “Two out of three”: An alternative method for codon reading. Proc. Natl Acad. Sci. USA, 75, 1759-1762. Lagerkvist, U. (1981) Unorthodox codon reading and the evolution of the genetic code. Cell, 23, 305-306. Lowe, T. M., Eddy, S. R. (1997) tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res., 25, 955-964. http://lowelab.ucsc.edu/GtRNAdb/ Nikolajewa, S., Beyer, A., Friedel, M., Hollunder, J., and Wilhelm, T. (2005) Common patterns in type II restriction enzyme binding sites. Nucleic Acids Res., 33, 2726-2733. Sprinzl, M., Vassilenko, K. S., Emmerich, J., Bauer, F., (1999) Compilation of tRNA sequences and sequences of tRNA genes. Last update 2003. www.uni-bayreuth.de/departments/biochemie/trna/ . Roberts, R.J., Vincze, T., Posfai, J., Macelis, D. (2005) REBASE—restriction enzymes and DNA methyltransferases Nucleic Acids Res., 33, D230–D232 Thompson, J. D., Higgins, D. G., Gibson T. J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res., 22, 4673-4680. Wilhelm, T., Nikolajewa, S. L. (2004) A new classification scheme of the genetic code. J. Mol. Evol., 59, 598 -605 . Woese, C. R. (1967) The genetic code: The molecular basis for Genetic Expression. Harper & Row, New York.