1 / 27

Rosalind Elsie Franklin

Rosalind Elsie Franklin. Biophysicist and crystallographer X-ray diffraction images of DNA Tobacco mosaic and polio viruses 1920-1958 (source: wikipedia). A Structural Split in the Human Genome. Clara S. M. Tang and Richard J. Epstein PLoS One (2007) 7:e603 February 13, 2007

kiaria
Download Presentation

Rosalind Elsie Franklin

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Rosalind Elsie Franklin • Biophysicist and crystallographer • X-ray diffraction images of DNA • Tobacco mosaic and polio viruses • 1920-1958 (source: wikipedia)

  2. A Structural Split in the Human Genome Clara S. M. Tang and Richard J. Epstein PLoS One (2007) 7:e603 February 13, 2007 I. Elizabeth Cha

  3. Introduction • PCIs Promoter-associated CpG islands • Mediate methylation-dependent gene silencing • Co-locate to transcriptionally active genes • 60% of human genes contains PCIs

  4. CpG Islands • Genomic regions containing high frequency of CG dinucleotides • CpG cytidine-phosphodiester-guanosine • Formal definition • At least 200bp • GC percentage >50% • CpG ratio >60%

  5. DNA Methylation

  6. Materials and Methods • Sequence data and annotations • Determination of CpG island overlapping transcription start site • Housekeeping genes and paralogs of pseudogenes • Bimodal distribution of GC content • Gene expression data • Evolutionary rate determination • Principal component analysis

  7. Sequence Data and Annotations • UCSC genomic assemblies, RefSeq dataset, Emsembl gene dataset • Human (hg18, 3/2006) • Mouse (mm6, 3/2006) • Fugu (fr1, 8/2002) • Fruit fly (dm2, 4/2004) • Worm (ce2, 3/2004)

  8. Data Preprocessing • RepeatMask – Alu • Discard sequences • Not commencing with ATG codons • Not terminating with canonical stop codons • Retain the longest genomic sequences containing identical exonic sequences

  9. Determination of CpG Island Overlapping Transcription Start Site • Download CpG islands annotation (cpgIslandExt) from UCSC • Identify CpG islands overlapping with promoter regions • Map with RefGene annotation (200bp upstream and 500bp downstream)

  10. Data and Tools • 502 Housekeeping genes • 1220 pseudogene paralogs • NOCOM program • SAGEmap • Homologue data • XSTAT

  11. Results – PCI+ Genes • Housekeeping gene higher GC content lower intron length/number • Pseudogene paralog lower GC content higher intron length/number • Functional distinguishable

  12. Table 1

  13. Results – PCI- Genes • Higher evolutionary rate • Narrower expression breadth than PCI+ genes • More frequent tissue-specific inactivation

  14. Figure 1 Biphasic GC/AT Distribution of PCI+ Genes A. Distribution of GC content among different regions of genes intronic coding region 5’ UTR 3’ UTR

  15. Figure 1 Biphasic GC/AT Distribution of PCI+ Genes (cont’d) B&C Proportion of genes among different GC groups.

  16. Figure 2 GC Content of Promoter vs. Non-promoter CpG Island Overlapping Genes Genes with medium total intron size (10-50kb) All genes Genes with short total intron size (<10kb) and long intron size (>50kb) Intronless genes PCI+: solid line; PCI-: dash line

  17. Figure 3 Distribution of Coding GC% of RefGenes with PCIs House-keeping genes pseudogenes

  18. Figure 4 Quantitative Comparison of Gene Subsets L: low, GC<40%; H: high, GC>65%; double dark, <0.001; single dark, <0.01; open, < 0.05

  19. Figure 4 Quantitative Comparison of Gene Subsets (cont’d) L: low, GC<40%; H: high, GC>65%; double dark, <0.001; single dark, <0.01; open, < 0.05

  20. Figure 4 Quantitative Comparison of Gene Subsets (cont’d) L: low, GC<40%; H: high, GC>65%; double dark, <0.001; single dark, <0.01; open, < 0.05

  21. Figure 6 Model of human genomic evolution

  22. Conclusions • PCIs • Transcriptional regulators • Evolutionary accelerators to facilitate intron insertion • Mthylated PCIs on transcription and chromatin accelerate adaptive evolution towards biological complexity

  23. Conclusions • Adaptive evolution of human genome • Declining transcription of a subset of PCI+ genes • Predisposing to both CpGTpA mutation and intron insertion • Biological complexity model • Environmentally selected gains/losses of PCI methylation (+/-) • Polarizing PCI+ gene structures arounda genomic core of ancestral PCI- genes

  24. Discussion • AT-rich, PCI+ gene vs. GC-rich PCI+ housekeeping gene • Lower transcriptional activity • Higher intron number • Higher evolutionary rate • Loss of negative selection pressure

  25. Discussion (cont’d) • PCI- genes vs. PCI+ genes • Higher evolutionary rate • Lower expression breadth • Intron number relates more directly to PCI positivity

  26. Figure 5 Principal component analysis (PCA) A. PCA analysis using six variables at either 53% (left) or 59% (right) variance

  27. Figure 5 Principal component analysis (PCA) (cont’d) B. 2D dot plots C. 3D dot plots GC-rich, blue; GC-poor, red

More Related