430 likes | 748 Views
Laboratory of Parasitic Diseases, NIAID. MINING FOR MEANING: Data mining & Knowledge extraction. Experiment. Results. Knowledge. Conclusions. Data interpretation. Data interpretation. Experiment. Results. Knowledge. Knowledge. High publication rates. Conclusions. Results.
E N D
Laboratory of Parasitic Diseases, NIAID MINING FOR MEANING:Data mining &Knowledge extraction
Experiment Results Knowledge Conclusions Data interpretation
Data interpretation Experiment Results Knowledge Knowledge • High publication rates Conclusions
Results • High-throughput screening Data interpretation Experiment Results Knowledge Knowledge • High publication rates Conclusions
Data interpretation Experiment GENE EXPRESSION PROFILING Results Results Identify relevant genes Identify expression patterns Knowledge Knowledge Conclusions
Data interpretation Experiment GENE EXPRESSION PROFILING Results Identify relevant genes Identify expression patterns Knowledge Knowledge FUNCTIONAL PROFILING Identify functional implications Conclusions
MINING FOR MEANING: FUNCTIONAL PROFILING
Infection with 5 pathogens -overnight- IL-4 + GM-CSF DC RNA pools M-CSF Elutriated Human Monocytes U95 Mac 7 donors Intracellular Extracellular vs Protozoan vs Bacteria Brugia malayi Toxoplasma gondii Leishmania Mycobacterium tuberculosis Leishmania major vs Leishmania donovani
DC Mac Gene expression profiling Dataset Dataset Extraction Filter 1200 75 12.000 genes
Induced by Bm 50 DC Bm 5 Mtb Lm Ld Tg Pathogens intracellular pathogens Fold Change (log2) Fold Change (log2) Leishmania & TB Genes Toxoplasma & TB Fold Change (log2) Fold Change (log2) Toxoplasma
MINING FOR MEANING: FUNCTIONAL PROFILING WITH GENE ONTOLOGIES - GO
MINING FOR MEANING: FUNCTIONAL PROFILING WITH GENE ONTOLOGIES - GO WITH LITERATURE ONTOLOGIES - MESH
http://array.ucsd.edu/hapi/ http://132.239.155.52/HAPI/TEST_377.HTML
MINING FOR MEANING: FUNCTIONAL PROFILING WITH GENE ONTOLOGIES - GO WITH LITERATURE ONTOLOGIES - MESH WITH LITERATURE ABSTRACTS
Data interpretation Experiment Results Knowledge Knowledge 12 million references Conclusions
Data interpretation Experiment Results Results Knowledge Knowledge Conclusions 12 million references
MINING FOR MEANING: FUNCTIONAL PROFILING WITH LITERATURE ABSTRACTS Co-citation Network
MINING FOR MEANING: FUNCTIONAL PROFILING WITH LITERATURE ABSTRACTS Co-citation Network Natural Language Processing
MINING FOR MEANING: FUNCTIONAL PROFILING WITH LITERATURE ABSTRACTS Co-citation Network Natural Language Processing Literature Profiling
1.Gene - Literature indexation Gene A Gene B Gene C… Gene X Abstracts 2. Analysis of abstract contents 3.Term filtering Term occurrences in abstracts Discrimination Co-occurrence Analyze functional relationships Retrieve relevant literature for each gene Determine term occurrence in abstracts Select relevant terms
KNOWLEDGE - Identify functional relationships - Translate genelists into keywords - Interpret data Experimental system Gene List 12 million references
MINING FOR MEANING: FUNCTIONAL PROFILING LITERATURE MINING
Data interpretation Experiment Results Knowledge Knowledge • High publication rates Conclusions
MINING FOR MEANING: DATA MINING LITERATURE MINING DOCUMENT CLUSTERING
MINING FOR MEANING: DATA MINING LITERATURE MINING DOCUMENT CLUSTERING NLP
MINING FOR MEANING: DATA MINING LITERATURE MINING DOCUMENT CLUSTERING NLP LITERATURE PROFILING
Data interpretation Experiment GENE EXPRESSION PROFILING Results Results Identify relevant genes Identify expression patterns Knowledge Knowledge Conclusions