310 likes | 437 Views
“The instructions for assembling every organism on the planet--slugs and sequoias, peacocks and parasites, whales and wasps--are all specified in DNA sequences that can be translated into digital information and stored in a computer for analysis. As
E N D
“The instructions for assembling every organism on the planet--slugs and sequoias, peacocks and parasites, whales and wasps--are all specified in DNA sequences that can be translated into digital information and stored in a computer for analysis. As a consequence of this revolution, biology in the 21st century is rapidly becoming an information science... ...hypotheses will arise as often in silico as in vitro.” Eric Lander, Science 287 (5459), 1777-1782
BioinformaticsFunctional Genomics ...TGTTATTGCCAATGCATCATGATCGTAGAGATGTAGGCTAAAGAAACTTTTTTAAAGTGTGATTTTACACAGCAGCTGGTGGTTACTGCTTAGCCATAAAATTATGTGTTGATATTGACATAAGTGGAATGTCGAAAACATGTTTATTCATACACGTATATTGTTGAAAGAAGTGTTAAACACATCATACGTTTAAAGTAGACAAATCTAGAAATATATAACAAAAGAGAATTTGTTTGTACATTTAAAAAGCCTTAGCTCTTTTTGAGCAAAATATGTAATATTATAATTTCAACACTTTTGAGTGTGTTTACTAAAACTAACAACCTTTATGAAGCAATTGGGATGGGTTTGTGTGATCTAAAATTGAGGTAGGATCGTGGGACCAGGAGAACATGAGATGCATCGAAGGCAAAGTCCAAAGACACTACGTCACTTTGAGTGGATTGTGTACGTGAGGTTGTACGAACCATCATTTCACGTTACATATGTTGCAACGCTGCGTAATCTGTCCCCACCATCCAACTTATTTCTAATTCTTTTTCATAATTATATTTGTATCTAAACTAAGTAGAATACCTACATATTGAAAAGAAATATTACTTGCTAGGGCCTAGGGATAAACGTTAAAGAGAAATATTACTATAGAGAAAAACCCTTTAGTTTTGAACACAATTTCTATGTGTAAACATGCGTGATTGTTATGTAGGCAGATGTGCACTAAGCACTTAAACAGAAAAAGAGACAAGTTTTGAACATGAGTTATGAGTAAAACATATAATTATAAAGTTTTGAACATGAGTTATGAGTAAAAACATATCATTATAAAGTTTTGAGAGCATCTATGAGTAAAAACAGACTAAATTAAATAGAACATATATATGGACAATATAGCAAGTAAAATCTCTCGGACTGACGAAATAGATGTAACTATACTACCAATGCTTGCGATTCAAACCGTCAAAAGTGAAAAATACTGTGAAATAACAAGTCAACTTCTATAAATGAAAATTAATATATATTGTAGACGGTTTATATATACGTTTGGATCATATCGATTCACTAATAAAAAAGTAACAAGTGTTATTCCACAAGTTGCATTTTTCATTTGTCTATAAGAACTTTATTGATCTTAAATAACGTTTATGCTTTTTCTTACACATATATGGAACCGAATTCAACAAAAACTTTCAAATAGATTTGTTTTATATGAACACAATTATGTATATTGATTGTCCATTTATGAAGAAAAGAATAATCTTATGCTTCACACCGTTGACAAAAAAAGATAAACGTATGTCATTAACCCTCTCTTTTTCTGGTACATGTCAGAGCAAAGAAAATTTACTAAAAGGTTATTTTGTTTTTCCTTGTTTGTCTATAAATCTACTGAAAGCTTTAAATGCAATTTTATTATTTTCAAGAATCGGTCTTTTTACACTTCCGCGATACTAGCTAAAGGAATGTACACGTGTCAGGATATTATTGTAGCATCTGATTTTTACCATTGATGGTAACTAAAGTTAGTAACAGAACCATCAGGTCCATCACCGTAAAATATGACTAGCACAGTAGCACCTTTACCAGAAAATAACGCCGTTAGCTGTTTTCATCAAGTCAACAACAACCATTTTAACGGCAGTTTAGATATATTTTAATTTTTTTTAACCATATATCATTGGAAGTAAAAAGTTGTCATACTAAAATATATATCTTTTAAGATCTCAAGGTAATTAAAAAAATAATCAAAGACGCTTAAAATCTGAAATTTTCAATAAAGTAATCATAAATTATTTTATGATTTAAATTTAGGTTTATCATAATAATCTTGAAAGAAATTAAATAATATAATAGTTTTAAGGAAATCTATTTATTTGTTTTTTGATAAATTACATATCTATAGTAAATTAATGATATTGAATGGTGTAGTTGGTTTCTCATAGGTAGAATTAAGACTTTCTCTTTTGTATTAATAAAATTTTATTTGACAACAAACAGAAGAAACCTTTTAATAAATTGAGAGAGATATACTATAGGACATGATATGACAAAACTGTCCTACTCAGTGAACTTCACTAACAGAGCTCAAATTTGAAATCTCAATGAATTAATATTTAATGACATAACAAGTCAAAGTGATGATATTAAATCTTTCCTCCTTGACAAGACGATAATACTTCAATTAACAAAAAAAAAAAAAAAAAGAGATGATAATTTTTCCAGAATAAAATATTTTCGACTTGGACTGGGAATAAAAAAAATGGGGTGACCTAAACCGACGTCGCATAGTTACGAGAGGCATATAAATAGAGAGCCACCAAATAGCCTAACGTAGTCCACCTTCACTTGCTCTCTTTTTTCCGAAGAATCTGGTTCGTTTCTCTTTGAATTATCTCTTTCTCTGTTCTTCTTCTTTTTCTAAAGTCTCGAATATATCTTCATTATCATTACTTTATTTGCTTCGTTCTTCCGCAGTTTTGTGCAAAGTTTTAACGATTTTTTGGTTTTGAACGAAGGCACAGAGAGAAGATGGCGAGTGGCCTCGAGGATATCGTGAACGAGAATGTTGATTTGGTATGGATTTCTTCTTTTTACTCTGTATATATTTTTTTGTTTCGTTTGTTTAACTTGTTTAATGAATATACGAGAGGAAAGTTTAGTTTGGTTTTGTTTTTTACTTAACGATTCGATCAAATTAGGGTTGTACACAAATCTCGTTACATCATCGTTGAACTATTGACTTCATGTTTTAGCATGAGATTTTTATTGTTTGATTTCGATGAGAAAAGAATCCATGAAAAAGTGTTAATATTATCATAATTATTTCAGTTTTTCTGCTTTTTTTTCAAAATTACTTTTTCTTTATTTATTTTTTTATTAGATTTTGCAACTGCACTAATTTAAAACAATTTGCAGTCACCAGAAATTTAGTTACAATTATGTAAAAGTTGTTTCCAAATAAAGCAAAAGTAATTACATTAGTAGTACTCTCTACTTTCACATGATCTATCTTAATGAGTATACTTTGAGATTTCTTAGAGAAGTTTTAGTCAATTCTGTTTTCAATTACTACGTTTTTGGATTTTTCATACTCATGGAATATCGCCTTATCAATCGGTTAATCACTTCCCTAGATAAAACTAAACTTCAATCTTCACCTGATTTTCTGGACTCAATTTAAGATACCATGTTATATAGACAGATTTGCCCCTATCAATACATCTTCCGTTACAATGGGCCTTATTGGGCCTTTAGGCTTATTATATAAGCGAATTAAAGTACAAGGCCTTGTGGTCCCACCTTAGATTCTCTGACATCGACAGCTACAGAACATTCCATAATTTTGGTCATCTAATATTTTTATGCTTTGATAAATAATTATAATCTTGATTATCTATTAATTTGCATGCTTATATTTTTTTCTTTTTTTTTGCTTTTTTTAGTACTGTAAAAGGTTAACATTTGTATCTGTAAAAAAGATGTACTACAGATAAAAACATACTTGGTTGTTTATTGGTGTAGGAGATACATAATTATTGAACACCATTTGAATTAGTCTAACATCGTGTTCATATTATATTGCAGGAGAAAATACCTATTGAGGAAGTGTTCCAGCAGTTGAAATGCAGCAGAGAAGGTTTATCTGGAGCAGAAGGAGAAAACAGACTCCAGATCTTTGGCCCCAACAAACTCGAGGAGAAGAAGGTATAGTTTGGTTTTAACCATTATCAAAGTCATCATACATGAGCTAAGTCACTAACTATGTAATGGTGGTTTCAATAAATAGGAAAGCAAGTTACTCAAGTTCTTGGGTTTTATGTGGAACCCTCTCTCTTGGGTCATGGAAGCAGCTGCAATCATGGCTATTGCCTTGGCTAATGGCGGTGGAAAGCCACCGGATTGGCAAGATTTTGTCGGTATCGTTTGTCTTTTGGTTATCAATTCAACTATCAGTTTTGTCGAAGAAAACAATGCTGGAAATGCTGCTGCTGCTCTTATGGCTGGTCTTGCTCCCAAAACTAAGGTCCCTCTTTATAAACTCCCTTCAAATACTCCTCTTTGCATTAGCATTTACGTGTATGTTTACTGTATACTTGGCATACGAATTACGCGAGTTTAACTATGTTACATAATTAGATTCTTAGCACCTAGTAGTTGTCTAACTTGAAAATGTTTGGCATAAGTAATTGGTTTACATATAATTTCATTACAAGCTTTTAATAATATTTGTTTTGTTATGACCAGGTTCTAAGAGATGGCAAATGGTCAGAGCAAGAAGCTTCTATTCTTGTTCCTGGAGATATTGTGAGCATCAAGCTTGGTGACATCATTCCTGCTGATGCACGTCTTCTTGAAGGTGATCCTTTAAAAGTCGACCAATCTGCTTTGACCGGAGAATCTCTTCCCGCAACGAAAGGTCCAGGGGAAGAAGTTTTCTCCGGTTCTACCTGCAAGCAAGGTGAGATTGAAGCGGTTGTGATAGCCACTGGAGTCCATACTTTCTTCGGTAAAGCGGCTCATCTAGTCGACAGCACAAACCAAGTTGGACATTTCCAGAAAGTTCTTACGGCGATTGGAAACTTCTGCATATGTTCCATAGCTGTTGGTATTGCGATTGAGATCGTTGTCATGTATCCGATCCAGCGCAGACACTACAGAGACGGAATCGACAATCTTCTTGTTCTTTTGATTGGTGGTATCCCCATTGCAATGCCTACTGTCTTGTCTGTCACAATGGCTATTGGATCTCACAAGTTGTCTCAACAAGGAGCTATCACAAAACGAATGACCGCCATTGAAGAAATGGCTGGTATGGATGTTCTGTGCAGCGATAAAACCGGGACTCTCACGCTAAACAAGCTTAGTGTTGACAAGAATCTTATTGAGGTTTATTGTAAAGGCGTTGAGAAAGATGAAGTTTTGCTTTTTGCTGCTAGAGCTTCAAGAGTTGAGAACCAAGATGCTATTGATGCTGCTATGGTTGGAATGCTCGCTGATCCAAAAGAGGTAATATATATAACTCTGTTTTTTCCTCTGTTTTAAAATTCTGTTTTTTCCTCTGTTTTGTTTCTTGAAGAAAATGTTCTGTTCTGTTTCTCATTGTTTGTTTCTTTCAGGCAAGAGCTGGAATCAGAGAGATTCACTTCCTACCATTCAATCCAGTTGATAAGCGAACCGCTTTGACGTTTATCGACAGCAACGGAAACTGGCACCGAGTAAGCAAAGGAGCTCCCGAGCAGGTAACTAAAACTGATTTTGGTTCTTAAGCTAAGTTAAGTGTTGTGACTTCTTAAAAAAACTGATTCTTTGATGATTTCAGATTCTTGATCTCTGTAACGCGAGAGCTGATCTGAGGAAGAGAGTCCACTCTACAATCGATAAGTACGCTGAGCGTGGACTTAGGTCATTAGCTGTTTCAAGACAGGTGAGAAGAAATCTTACATTTTGTAACAAGTTATAGGAAGAAGGAAAAAGAGAGAGAGAGTTTAATGTAAATTTTGTGTGTCTATGTTTCAGACTGTACCAGAGAAAACAAAAGAAAGCTCTGGTAGTCCATGGGAATTTGTTGGTGTGTTGCCTCTGTTTGATCCTCCAAGACATGATAGTGCCGAAACCATTAGAAGAGCTCTAGACCTCGGTGTCAATGTCAAGATGATTACTGGTTTATATCTAAACATTACTCTTCGCTATAGCGAAAACACCGAATTTCTTACTAATGTTTCCAATGGTTTTATAGGTGATCAACTTGCTATTGCGAAAGAGACAGGACGTAGACTTGGAATGGGATCAAACATGTACCCATCCTCTTCTTTACTCGGTAAACACAAAGACGAAGCCATGGCTCACATTCCTGTTGAGGATTTGATTGAGAAAGCTGATGGCTTTGCTGGAGTCTTCCCAGGTTTGTTTAAGAGCCATTTTTGTTTCCACAAACACAAACTTGCTCTTCTCTCTGTAACATTTGGTCTCTTTTTTTTCTTTACCAAATTCATTCAGAGCACAAATACGAAATTGTGAAGAAGTTGCAAGAAAGGAAGCATATCTGTGGTATGACTGGAGATGGAGTGAATGACGCTCCAGCATTGAAGAAAGCGGATATAGGTATCGCTGTGGCGGATGCAACTGACGCTGCACGAGGTGCTTCTGATATTGTCCTCACCGAGCCAGGTCTCAGTGTGATCATCAGCGCGGTTCTAACCAGTAGAGCCATCTTCCAGAGAATGAAGAACTACACAGTGAGTGTTCTCTTAATCTTAAACCATAACCTTTTTAGAATCTTTACACCATGATAACTTGTTCTCATGGTGTTTTTAAATGTTCCTTTAAACAGATATATGCAGTCTCAATTACCATTCGTATAGTGGTAAGTAATCAATCTTGTATGTATTCTTGAATATCATCAAATAATCTCTTTTTCTATATGATTTCTCACTTTGTGCCTTTTCACAGTTTGGGTTTATGCTCATTGCTCTAATATGGAAGTTCGACTTTTCACCATTCATGGTTTTGATCATTGCTATATTAAACGATGGTGAGTTTTTTTCTTCTCTTTAAAAGCTCTCTCTAAATTTTGGCTCTAATCGACTAACACATCATTAACTTACGCAACAGGAACCATCATGACTATCTCAAAGGACAGAGTCAAGCCTTCTCCAACACCAGATAGCTGGAAACTCAAAGAAATCTTCGCAACCGGCGTTGTTCTTGGAGGCTACATGGCTATAATGACTGTTGTTTTCTTCTGGGCCGCATACAAAACCGATTTCTTCCCGGTATGTAACATATAAACAACCTCTCTTTAGATTCCTTGTCACACAAGCATTAACTTTGTTTGTTTCCTATAGAGAACATTCCACGTGAGAGACTTGAGAGGCAGTGAACATGAGATGATGTCTGCTTTATACTTACAAGTCAGTATTGTGAGCCAAGCTCTTATATTCGTCACTCGATCTAGAAGCTGGTCTTTTACAGAACGACCTGGATATTTCTTGCTAATTGCTTTCTGGGTAGCACAATTGGTAAGCATTGTATCTTCTTGATTGATTATAAACTCTCTTGAATGATTGATTCATAAACTTTGACCTTGTAGATTGCAACGGCTATTGCGGTTTACGGTAATTGGGAATTTGCAAGAATCAAAGGAATAGGATGGGGATGGGCTGGAGTAATCTGGCTTTACAGTATTGTCTTCTACTTTCCATTAGACATAATGAAATTCGCAATCCGTTACATACTAGCCGGAACAGCCTGGAAGAATATCATTGACAACAGGGTACGTTTTGCTTTCTTCTTAGTTCTTCTTCTGAAATGTGATTTTAAGACTTAAACAAAATCTCTCTGTGTCTTTGAAAAAACAGACTGCGTTTACCACTAAGCAAAACTATGGAATAGAGGAGAGGGAAGCACAATGGGCTCATGCACAAAGGACTTTACACGGTCTTCAGAACACTGAAACAGCCAACGTTGTCCCGGAGAGAGGTGGTTACAGAGAACTGTCTGAGATTGCAAATCAAGCCAAGAGACGAGCTGAGATCGCAAGGTAAATGATTTGTAAAAAGTAATTTCGGTTTCTTGTGAGAGAAGTTAACGTTAGGGTTTTTGTTTACAGGCTTAGGGAACTTCATACGCTTAAAGGACATGTAGAGTCAGTGGTGAAGCTAAAGGGACTAGACATTGAGACAGCTGGTCACTACACCGTTTAATAAAGATTTAACAAGAAGATGATTAAGGAAGAAGAAGATCCTTTTGTTTAAGAGATGATAATGATGATGATTTTATAACAAGAGAATAAGGTTTTTTTTCTGTGTATGTTAAGTTCCTCTAAACTCCTGGGGTTTTTCTTCCTTCCTTATTCATGGTTTTAGACTTTAATTATAACCATGTGTTTTTGTTTCCCCTCTCTTGAATAAAAAAGAGAAGATACTAATATCATAGGTTTAATACGTTTATGTTTTATAAAAGTTCAATAGATATTGTTTTTGGATATTATGTGAACATAACAATATTTATAAATTCACCTTTCAATTTTCTTATGAGATGATATCATCTTTACTACTTAAAATATCTTCTAATTAAACCAAAAGCAGATTTAGGTCCGGTTAGATGAATAACCGGTTCGATTAATTTCGTCAGACCGATTTGATCCGTGGATATCATTAATCCGTGGAATACATGTGTGTTCTTGAGTAGGCCACTAGTTATAATCCACGTGGCAAGCATTTTATCACAAACCCAGAGCTACAAAAACAAAACCAAGAAGACGAGAAGGATCTTTAACAGAGAAAAAAGAGCTTTAGACCAAATCACCAGAAACAAAACAAAAATGTATTTCGCCGCCATAGCTTCTTCACGACAAAGCTTTTTGTCAAACAATTTCAGTTTCCAACACAGTTTTAAACCCAAATCCAATGTTAACCTCACTCGTCCTAATTCAATCTGTTGTAAATCTTCACACCACGACGACGAAACTGATTCTTCTCGGTAATTTTTATTTACCTTTAGAAAATTTTTTTCCGATTCTAAACTCGGGTTTGGTCTTATTCAATCTTTTATTTATCTATTGATTAGGAAAAATGAAAATCAGCTGGCGAAGTTGGCAATCGCGACGCTAGCGGTTGGCGTTTTGGCTTATTAAGGAAGAAGAAGATCCTTTTGTTTAAGAGATGATAATGATGATGATTTTATAACAAGAGAATAAGGTTTTTTTTCTGTGTATGTTAAGTTCCTCTAAACTCCTGGGGTTTTTCTTCCTTCCTTATTCATGGTTTTAGACTTTAATTATAACCATGTGTTTTTGTTTCCCCTCTCTTGAATAAAAAAGAGAAGATACTAATATCATAGGTTTAATACGTTTATGTTTTATAAAAGTTCAATAGATATTGTTTTTGGATATTATGTGAACATAACAATATTTATAAATTCACCTTTCAATTTTCTTATGAGATGATATCATCTTTACTACTTAAAATATCTTCTAATTAAACCAAAAGCAGATTTAGGTCCGGTTAGATGAATAACCGGTTCGATTAATTTCGTCAGACCGATTTGATCCGTGGATATCATTAATCCGTGGAATACATGTGTGTTCTTGAGTAGGCCACTAGTTATAATCCACGTGGCAAGCATTTTATCACAAACCCAGAGCTACAAAAACAAAACCAAGAAGACGAGAAGGATCTTTAACAGAGAAAAAAGAGCTTTAGACCAAATCACCAGAAACAAAACAAAAATGTATTTCGCCGCCATAGCTTCTTCACGACAAAGCTTTTTGTCAAACAATTTCAGTTTCCAACACAGTTTTAAACCCAAATCCAATGTTAACCTCACTCGTCCTAATTCAATCTGTTGTAAATCTTCACACCACGACGACGAAACTGATTCTTCTCGGTAATTTTTATTTACCTTTAGAAAATTTTTTTCCGATTCTAAACTCGGGTTTGGTCTTATTCAATCTTTTATTTATCTATTGATTAGGAAAAATGAAAATCAGCTGGCGAAGTTGGCAATCGCGACGCTAGCGGTTGGCGTTTTGGCTTTGGGAAGTGTTGGGGATGCGTTTGCGGCCAAAAGCGGTGGAAGAATCGGCGGTCAAGCGTTTCGGTCTTCAGCTCCTCGTCCTCCTCCCAGAATTAACAATCGATCAAGGTAAAATTTTTTACTTGAAGCTCTAATCTCGATTTTAATTATTTAATATGTATATGATTAAAATACTGGCGAAAAAAGTATATATTTAAATTAAGACCTTAATTGAGATTTCTTACAAGAGACTCCATGTATGGGCTTTTAGCAGCCTCATATCCACATTAAAGCCCATTAGTAAACTTTAAGCCTCTTATAATTTTCTATTACACTTTATATTTTGAACCCCATAATTACAATTCGAATCCAAGATACCCCAACGACCAACACTAATAACTTGATTCAATGCATTAGGCTTTTCTGTTTGCATTATATAATCACGACTTTAGTCTTAGTAGTTAGTACCTTTGCATTCTAAACGCAGCTTTGTTTTCGCAGGACCAACATCTACGTGAATCCTCAGGTTGCACCACCTTTGATCGGTGGCTATGGATATGGTTACGGAGGTTACGGATGGTCACCGTTTTCCTTTTTTGCCCCTGGTCCTGCGGTGGCAGTAGGTGTTGGCGGTGGCTTTGACCTCCTACTTCTCTTCATGTTTTTTGGAGCCGCTTCAGCCGTCGCTAGAAACTTTTTCCGATCAAGAAATGATGAAGATGACGAAGACGACTATTAGAGGAGAAAACGGAGAGAGGCTTATGTAACAATAGTCATACATACGATTTTGTGTAAAAAAATATGATGGAAGAAGACGGATATATGTGGAAATCTTGAAATAAGTAACTAAAGACCCATCTTAAAGTTAATAGAGGCAATTTGAAAACACCTCCAAATACTAAATTTATATACAAATTTTCTCTGTTTTTCCTTCATTCTTTATATTTTAGGTTCTAGTGTATGACCTCATTGGTCTCCGAAAGCAAGTGAAAATAAATTCTTTAAGCAACCAAATATAAAGAAAACATTGGTATATTTTGTCAATGGTTTATCAGTGTTAACGAAGAAGAATAATTAGACCTAGCTAAAAGTGAAATCAGTAAATCTCGTAAATAATAATAACGATAAATCAGATACAAAAAGAAAATAAAAAAATCAAGAAGAAGACGAAGAAGAAAAAAGAAAGAGTCAGTGAATTAACAAAAATCAAGAAGAGGTTCGGTTAATACTAAGTCCTTGAAAGAGCCAATGAGGTCTCTTCTTCTCTCGATCCAACGACGTCGTACTTGAGCTGCTCATAGGCGTGTTTGGTCCGCTACTTTCTTGGTTATTATTGCTGTTATTATATAGATTATTATTGCTCTGATTATTACAATTATCCGATGATGAT... The analysis of the DNA sequence of genes and genomic regions reveals evolutionary relationships, and can be used to (amongst a gazillion other things), to... • assign function to DNA that codes for peptides, • at the physiological level (i.e medical in Humans), • at the protein structure level.
The Problem(s) Big Problem:Cardiovascular Disease: 40.1% of deaths in 1999 (USA) Genomic Challenges besides being huge • …recognizing genes in the genome, • Exons/Introns, • Alternative Splicing/Termination, • Alternate transcription start/stop sites, • Tandem Repeats, Psuedogenes, etc. • We don’t really understand all there is to know about gene structure, • etc. • …determining the function of the gene, • no known homologies, • gene families, • etc.
Nature: 407, pp. 234 Cardiovascular Disease
Cholesterol, a steroid alcohol (sterol) used to insulate nerves, produce human sex hormones and as a component in cell membranes, produced by the body and supplied by diet. Lipids…major players in heart disease. • Triglycerides, • most common fat in the body, energy storage molecule, • found in fat cells, blood plasma, and cell membranes, • derived from eaten fats or made from other energy sources.
Apolipoprotein Gene Family A (APOA) Low Density Lipoprotein LDL Associated with HDL, Removes cholesterol from the blood, Modulates trigyceride levels. Lipoproteins…and classes.
Chromosome 11 (H.s.) • Several apolipoproteins map to a region of chromosome 11, including two APOA genes, • There is a polymorphism that is linked with high serum triglyceride levels that maps to the same region.
Hypertriglyceridemia Why bad? …high levels have been linked with coronary artery disease, • American Health Association Recommendation • Changes in lifestyle habits are the main therapy for hypertriglyceridemia… • If you're overweight, cut down on calories to reach your ideal body weight. • Reduce the saturated fat and cholesterol content of your diet. • Reduce your intake of alcohol considerably. • Be physically active for at least 30 minutes on most days each week. • Substitute monounsaturated and polyunsaturated fats • Substitute fish high in omega-3 fatty acids instead of meats that are high in saturated fat.
Mus Musculus A Solution…to finding genes. Mouse Genome: ~ 2.75 Gb, 19 autosomes + X and Y chromosomes, 99% per cent of the mouse genes have a sequence match in the human genome, 96% lie within 'syntenic' regions of mouse and human chromosomes. Nature420, 515 - 516
Conserved Synteny Description of DNA segments in which gene order is identical between species.
% similarity between M.m. and H.s. The SolutionComparative Genomics High homology, no annotation. • How did they know the region was expressed? • - not including the Northerns.
New Gene Structure? Most similar to each other (71% identity, 78% similarity), Next most similar to APOAIV (27%, 48%) Mouse: 1107 Open Reading Frame (ORF), 368 AA, Human: similar structure, 366 AA,
movie Mech Dev. 76(1-2) PNAS 97 GFP wt Glutamic Acid Decarboxylase 67 The Solution…to determining function (I, II). I. Transgenic Mice: - DNA is integrated in a random fashion by injecting it into the pronucleus of a fertilized ovum, II. Knock-Out Mice: • - DNA is introduced first into embryonic • stem (ES) cells. ES cells that have • undergone homologous recombination are • identified and injected into a 4 day old mouse • embryo - a blastocyst.
Pronucleus: either of the 2 haploid gamete nuclei just prior to their fusion in the fertilized ovum. Transgenic Mice Pronuclear Injection
1, heart; 2, brain; 3, placenta; 4, lung; 5, liver; 6, skeletal muscle; 7, kidney; or 8, pancreas. WT Mouse mRNA hybridized with mouse APOAV cDNA WT Human mRNA hybridized with human APOAV cDNA 1 and 5, transgenic liver; 2 and 6, transgenic intestine; 3 and 7, wild-type liver; and 4 and 8, wild-type intestine. Fig. 2 H.s. APOAV probe
APOAV Transgenic Mice“overexpressors” n=48 (transgenics), n=46 (controls), P < 0.0001
transgenic KO Typical KO vector…and homologous recombination. *tk:thymidine kinase + gancyclovir = lethal
KO Mice w/ Embryonic Stem Cells
Chimeric mouse Black/White Chimeric Example Cross and look for offspring with germ-line transfection.
Expression of native APOA mRNA. Ref. 25 Fig. 2
APOAV KO Mice“underexpressors” n=13 (wt), n=22 (hets), n = 10 (homozygous mut.) P < 0.001 Fig. 3
Murine Models are Good…but what about us? • Identified 4 SNPs in and near the human APOAV locus (each occurs at >8%), • Surveyed ~500 Caucasians for numerous lipid parameters before and after high- and low-fat diets, Fig. 1A
1,1 = homozygous for major allele 1,2 = heterozygous for major allele • Individuals heterozygous for the minor alleles at SNPs 1-3 had significantly higher triglyceride levels and VLDL mass. • SNP4 does not show a significant difference in triglyceride levels or VLDL mass.
Linkage disequilibrium:When the observed frequencies of haplotypes in a population does not agree with haplotype frequencies predicted by multiplying together the frequency of individual genetic markers in each haplotype, English Translation:The co-occurrence of genetic markers more often than would be expected. SNP1, SNP2, & SNP3 have significant disequilibrium with each other while SNP4 is not in linkage disequilibrium with any other SNP, English Translation: this genetic region containing SNP1-3 segregates as a “unit”.
Independent Association Study Data from an independent set of individuals demonstrating that subjects with minor alleles at SNP3 are significantly more likely to have high triglyceride levels than low.
Summary • SNP data shows that APOAV plays a significant role in plasma triglyceride and VLDL homeostasis. • Polymorphisms of APOAVcould be used as prognostic indicators for hypertriglyceridemia. • Modulation of APOAV is a potential strategy for reducing risk of cardiovascular disease. • Comparative Genomics uncovered a new gene in the human genome. • Mouse transgenic and KO experiments suggest that the protein contributes to triglyceride and VLDL levels.
APOAV Update http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed&cmd=search&term=apoav
Weds. ~Genome scale KO experiment.