1 / 33

Human Liver Analysis

Human Liver Analysis. Alignment + Quantitation. Ensembl 66, Noncode 3.0 Noncodes filters: no Ens overlaps, no Guttman . After filtering out genes with all FPKM < 1: TOTAL: 19,793 (29.4% total) Ensembl PC: 13,612 (66.4% total)

dyan
Download Presentation

Human Liver Analysis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Human Liver Analysis

  2. Alignment + Quantitation • Ensembl 66, Noncode 3.0 • Noncodes filters: no Ens overlaps, no Guttman. • After filtering out genes with all FPKM < 1: • TOTAL: 19,793 (29.4% total) • Ensembl PC: 13,612 (66.4% total) • Ensembl NC: 4,863 (14.7% total) • NoncodeIntronic: 434 (14.6% total) • NoncodeIntergenic: 884 ( 8.2% total) • FINAL NONCODING: 4,870 (78.8% expressed) • No Ensembl t / r / sn / sno / mi / misc-RNAs

  3. QC 1

  4. QC 2

  5. ANOVA Significants from 19,793 genes, adjusted p-values ≤ 0.05: Biotypes for significant NC genes:

  6. PC-NC Neighbor Assessment • Nearest PC neighbors for every NC gene found. • Pairs with expressed NC, ≤ 10kb: 2833 • 1747 Intronic, 1086 Intergenic

  7. Neighbor Expression

  8. Clustering • Cluster ANOVA-significant genes only: • Drug: zscores; one avg value per gene per drug • Source: Log2FC: one FC per gene per drug • Interaction: zscores: all values per gene • Z-scores clustered by Pearson correlation dist • Clusters cut from dendrogram • Log2FC clustered by Euclidean dist • For viewing only; actual clusters via k-means

  9. PC Drug NC Drug DMSO PHENO RIFAM DMSO PHENO RIFAM

  10. PC GO / KEGG Terms, raw p ≤ 0.05, 2+ genes/term Drug Effect: Six Clusters

  11. PC Source NC Source DMSO PHENO RIFAM DMSO PHENO RIFAM 724 4388 2036 3724

  12. Source Clusters • GO/KEGG Terms: Adj.p ≤ 0.05, 2+ genes/term: • PC: 1879 up, 1414 down • NC: nothing… Human GO based on UniProt • Neighbor Pairs: Significant trending observed: χ2 p-value ≈ 0

  13. Neighbors with Distance ≤ 10k • Neighbor Pairs: Significant trending observed: χ2 p-value ≈ 0

  14. PC Drug-Source NC Drug-Source PHD PHP PHR RGD RGP RGR PHD PHP PHR RGD RGP RGR

  15. Dif / Pro Log2FPKM, same order Z-Scores

  16. Mouse Liver Analysis

  17. Alignment + Quantitation • Ensembl 66, Noncode 3.0 • Noncodes filters: no Ens overlaps, no Guttman. • After filtering out genes with all FPKM < 1: • TOTAL: 15,244 (27.1% total) • Ensembl PC: 12,990 (56.0% total) • Ensembl NC: 1,516 (10.2% total) • NoncodeIntronic: 290 ( 2.8% total) • NoncodeIntergenic: 448 ( 5.6% total) • FINAL NONCODING: 1,932 (85.7% expressed) • No Ensembl t / r / sn / sno / mi / misc-RNAs

  18. QC

  19. ANOVA Significants from 15,244 genes, adjusted p-values ≤ 0.05: Biotypes for significant NC genes:

  20. PC-NC Neighbor Assessment • Nearest PC neighbors for every NC gene found. • Pairs with expressed NC, ≤ 10kb: 1356 • 766 Intronic, 590 Intergenic

  21. Clustering • Z-scores clustered by Pearson correlation dist • Clusters cut from dendrogram • To distinguish Neonatal, Adolescent, Mature stages, 9 PC clusters and 11 NC clusters required. • Also detected a “Neonatal-Mature” cluster. • NC has 2 types of adolescent clusters; merged.

  22. PC NC -2 0 1 3 5 10 15 20 25 30 45 60 -2 0 1 3 5 10 15 20 25 30 45 60

  23. Time Effect: 4 Main Clusters PC GO / KEGG Terms, adj p ≤ 0.05, 2+ genes/term

  24. 2 Merged NC Adolescent Clusters

  25. Other PC Clusters

  26. Other NC Clusters

  27. Neighbor Pairs • Again, significant trending observed: χ2 p-value ≈ 0

  28. Neighbors with Distance ≤ 10k • Again, significant trending observed: χ2 p-value ≈ 0

  29. Datasets Available • All Genes+FPKMs • Genes+FPKMs per cluster • GO/KEGG Terms per cluster • Concordant neighbor pairs per cluster • For mouse, human source only

  30. GO Themes for Clusters • Human Drug: • 1 Generic Drug Response+Metabolism / * Metabolism • 2 Transcription / Cardiovascular Development? • 3 Cell Adhesion / Steroid Hormone Metabolism • 4 Cell Cycle • 5 Cytoskeleton / Organelle Organization • 6 "Cell Part Morphogenesis" / RNA processing? • Human Drug-Source: • 1 Generic Drug Response+Metabolism, * Metabolism • 2 Organ / Anatomical Structure Development, Cellular Stress Response, Protein Complex Assembly, DNA Metabolism • Human Dif-Pro: • 1 Signaling, Transport, Metabolism, Wnt, MapK • 2 Cell Adhesion/Migration/Localization • Human Source: • Too many • Mouse Time: • Too many

  31. More Human Correlations PC + NC NC PC Scale has been fixed across heatmaps. Within NCs, intronics correlate worse than intergenics.

  32. More Mouse Correlations PC + NC NC PC Scale has been fixed across heatmaps. PC+NC is only slightly better than PC-only.

More Related