430 likes | 501 Views
Conservation At silent sites. D GAC GCC A. R CGT CGC R. Codon models. Synonymous substitution. Nonsynonymous substitution. Ks. rate of synonymous substitutions. Ks. sequence position. Na ï ve assumption: no selection against synonymous substitutions. Selection. L T S I
E N D
Conservation At silent sites
D GAC GCC A R CGT CGC R Codon models Synonymous substitution Nonsynonymous substitution
Ks rate of synonymous substitutions Ks sequence position Naïve assumption: no selection against synonymous substitutions Selection
L T S I CTT ACA AGC ATC CTT ACA AGC ATC CTT ACA AGC ATC L T S I G R GGG CGT GGT CGG GGA CGA G R T A ACT GCC ACG GCT ACA GCA T A Species 1 Species 2 Species 3 Ks sequence position Synonymous purifying selection (conservation) • Protein folding • Splicing regulatory elements • mRNA structure • Overlapping genes • Codon bias
Testing for synonymous selection H0: free from synonymous selection → constant Ks H1: under synonymous selection → variable Ks likelihood ratio test
Research objective Quantify and characterize the magnitude and role of synonymous purifying selection
Comparative sequence data > 20 million years S.castelli S.bayanus S.mikatae S.paradoxus S.cerevisiae 70%-90% coding DNA sequence identity
GATCGATTC GATCGATTA GATCGGTCC GCTCGGTCC GATAGACAT Comparative sequence data 5,135 datasets of multiple sequence alignments + phylogenies (5,182 of ~6,000 S. cerevisiae genes) ? Obtained from Wapinski et al., Nature 2007
Under synonymous selection 54.4% (2,794) 45.6% (2,341) Not under synonymous selection
position Under significant synonymous selection Under synonymous selection Ks 12.4% (640) 42% (2,154) 45.6% (2,341) Not under synonymous selection
Synonymous selection underlies codon bias Different organisms prefer specific codons over others that encode the same amino acid R: S. cerevisiae AGA 48% AGG 21% CGA 7% CGC 6% CGG 4% CGU 14%
Codon bias maintains translational efficiency Translation accuracy Translation speed
Codon adaptation index (CAI) quantifies codon bias Sharp and Li. Nucleic Acids Res, 1987
Synonymous selection underlies codon bias synonymous selection ~ codon bias position
Codon bias (synonymous selection) derives from protein structure Translation speed Translation accuracy
Codon bias at the protein 3D structure S. cerevisiae mitochondrial NADP(+)-dependent isocitrate dehydrogenase (PDB: 2QFY)
codon bias core > codon bias surface S. cerevisiae mitochondrial NADP(+)-dependent isocitrate dehydrogenase (PDB: 2QFY)
codon bias interface > codon bias surface S. cerevisiae mitochondrial NADP(+)-dependent isocitrate dehydrogenase (PDB: 2QFY)
Silent mutations may change 3D structure
MDR1 is a member of the ABC transporter family. They pump drugs out of the cell utilizing ATP, which change conformation of the protein. These proteins were shown to induce multi-drug resistance in various cancers.
C3435T is a synonymous SNP that was reported to be a risk factor for several diseases such as Parkinson’s diseases, colon cancer, and renal epithelial tumor. • It can be either because: • Change in mRNA level • Change in splicing • Linkage disequilibrium with other causative SNPs • Something else
Bodipy FACS analysis. In purple – cell transfected with empty vector All other colors – cell trasfected with a vector containing MDR1 (various haplotypes) MDR1 pumps the drug (Bodipy) out of the cells.
All other colors – cell trasfected with a vector containing MDR1 – various haplotypes The inhibitor works differently on the various haplotypes
They showed that synonymous substitutions did not change protein levels but rather the structure. This was shown by differential response to specific antibodies. Important for linking SNPs to diseases.
Silent mutations in HIV-1
Conservation of Ks in pol Mayrose et al. Bioinformatics/ISMB (2007)
Conservation of Ks in pol (zoom in) DNA flap CTS ? cPPT
cPPT A This region serves as a primer for the reverse transcriptase in the synthesis of the plus-strand DNA. cPPT
CTS = Central Termination Sequence A CTS The CTS is involved in the nuclear import of the HIV-1 genome.
Silent mutations -5' mRNA region
Kudla et al. showed that the levels of GFP – which is a protein whose gene can easily be inserted into a host genome and its levels can then be easily quantified, are strongly affected by the secondary structure of the 5’ end of the mRNA.
Non- stable mRNA secondary structure at the 5’ end -> higher GFP level. Stable mRNA Non stable mRNA
Mechanism: stable secondary structures at the 5’ end of the mRNA obstruct ribosome binding to the mRNA and result with lower protein levels
Based on that we hypothesized that the 5’ end of the mRNA should show signals of strong synonymous selection. This is exactly what we found in our yeast data… In addition, we found that the codon bias is reduced at this region, as to allow non-stable mRNA structures.