90 likes | 231 Views
Modeling of complex systems: what is relevant?. Arno Knobbe, Marvin Meeng, Joost Kok Leiden Institute of Advanced Computer Science (LIACS). Knowledge mining. Knowledge mining: Producing high-level insight by mining primary results in the context of existing knowledge.
E N D
Modeling of complex systems:what is relevant? Arno Knobbe, Marvin Meeng, Joost Kok Leiden Institute of Advanced Computer Science (LIACS)
Knowledge mining • Knowledge mining: Producing high-level insight by mining primary results in the context of existing knowledge. • Integration of detailed findings with (publicly available) domain knowledge • Knowledge sources • annotations: GO, KEGG, … • basic details: genomic location, … • interaction networks • detailed pathways: Wikipathways • literature: MEDLINE
Wikipathways domain knowledge literature relevant concepts chromosomes biological processes pathways transcription factors domain specific… Data Mining mathematical modeling differential equations quantitative, time high resolution refined experimental setup selected genes metabolites external variables (O2, CO2, heat) Role of LIACS in the modeling process high-throughput data genomics transcriptomics proteomics metabolomics important entities genes proteins metabolites
Genomic location chromosome = 11 chromosome = X chr_arm = 11q chromosome = 17 GO-terms cell cycle Focal adhesion DNA replication Adherens junction Protein families Cadherin_2 Cadherin Kinesin HEAT Transcription factors NOVA2: Neuro-oncological ventral antigen 2 Camta1: Calmodulin-binding transcription activator 1 SELS Selenoprotein S Pathways (Wikipathways) Neoplastic processes Malignant Prolactinoma Anaplastic Oligoastrocytoma Adrenal neuroblastoma Example: Understanding Neuroblastoma
Literature as Background Knowledge • concepts taken from • Unified Medical Language System • gene dictionary • MEDLINE abstracts • Express detailed findings in terms of biological vocabulary
Literature as Background Knowledge • Biological process (4825) • Molecular function (3370) • Cellular component (1613) • Cell (1197) • Tissue (736) • Gene to gene associations (24217 other genes) • Disorder (26410) • Neoplastic process (4307) • Pathological function (1912) • … as GO, but not annotations
Tracking concepts over time • Which concepts are relevant at each timepoint? • Track (multiple) winners through time • Metabolic Syndrome, genes involved in fatty diet:
Contact dr. Arno Knobbe www.liacs.nl/~knobbe knobbe@liacs.nl prof. dr. Joost Kok www.liacs.nl/~joost joost@liacs.nl