430 likes | 444 Views
This experiment involves data mining and knowledge extraction techniques to interpret experimental results for functional and gene expression profiling. The study focuses on high publication rates and utilizes high-throughput screening to identify relevant genes and their expression patterns. The experiment includes infection with 5 pathogens and analysis of gene expression profiles. The functional implications of the findings are determined using gene ontologies and literature profiling. This process involves co-citation networks, natural language processing, and literature clustering to analyze the abundance of relevant literature and identify functional relationships. The experiment aims to generate knowledge and conclusions from a dataset of 12 million references.
E N D
Laboratory of Parasitic Diseases, NIAID MINING FOR MEANING:Data mining &Knowledge extraction
Experiment Results Knowledge Conclusions Data interpretation
Data interpretation Experiment Results Knowledge Knowledge • High publication rates Conclusions
Results • High-throughput screening Data interpretation Experiment Results Knowledge Knowledge • High publication rates Conclusions
Data interpretation Experiment GENE EXPRESSION PROFILING Results Results Identify relevant genes Identify expression patterns Knowledge Knowledge Conclusions
Data interpretation Experiment GENE EXPRESSION PROFILING Results Identify relevant genes Identify expression patterns Knowledge Knowledge FUNCTIONAL PROFILING Identify functional implications Conclusions
MINING FOR MEANING: FUNCTIONAL PROFILING
Infection with 5 pathogens -overnight- IL-4 + GM-CSF DC RNA pools M-CSF Elutriated Human Monocytes U95 Mac 7 donors Intracellular Extracellular vs Protozoan vs Bacteria Brugia malayi Toxoplasma gondii Leishmania Mycobacterium tuberculosis Leishmania major vs Leishmania donovani
DC Mac Gene expression profiling Dataset Dataset Extraction Filter 1200 75 12.000 genes
Induced by Bm 50 DC Bm 5 Mtb Lm Ld Tg Pathogens intracellular pathogens Fold Change (log2) Fold Change (log2) Leishmania & TB Genes Toxoplasma & TB Fold Change (log2) Fold Change (log2) Toxoplasma
MINING FOR MEANING: FUNCTIONAL PROFILING WITH GENE ONTOLOGIES - GO
MINING FOR MEANING: FUNCTIONAL PROFILING WITH GENE ONTOLOGIES - GO WITH LITERATURE ONTOLOGIES - MESH
http://array.ucsd.edu/hapi/ http://132.239.155.52/HAPI/TEST_377.HTML
MINING FOR MEANING: FUNCTIONAL PROFILING WITH GENE ONTOLOGIES - GO WITH LITERATURE ONTOLOGIES - MESH WITH LITERATURE ABSTRACTS
Data interpretation Experiment Results Knowledge Knowledge 12 million references Conclusions
Data interpretation Experiment Results Results Knowledge Knowledge Conclusions 12 million references
MINING FOR MEANING: FUNCTIONAL PROFILING WITH LITERATURE ABSTRACTS Co-citation Network
MINING FOR MEANING: FUNCTIONAL PROFILING WITH LITERATURE ABSTRACTS Co-citation Network Natural Language Processing
MINING FOR MEANING: FUNCTIONAL PROFILING WITH LITERATURE ABSTRACTS Co-citation Network Natural Language Processing Literature Profiling
1.Gene - Literature indexation Gene A Gene B Gene C… Gene X Abstracts 2. Analysis of abstract contents 3.Term filtering Term occurrences in abstracts Discrimination Co-occurrence Analyze functional relationships Retrieve relevant literature for each gene Determine term occurrence in abstracts Select relevant terms
KNOWLEDGE - Identify functional relationships - Translate genelists into keywords - Interpret data Experimental system Gene List 12 million references
MINING FOR MEANING: FUNCTIONAL PROFILING LITERATURE MINING
Data interpretation Experiment Results Knowledge Knowledge • High publication rates Conclusions
MINING FOR MEANING: DATA MINING LITERATURE MINING DOCUMENT CLUSTERING
MINING FOR MEANING: DATA MINING LITERATURE MINING DOCUMENT CLUSTERING NLP
MINING FOR MEANING: DATA MINING LITERATURE MINING DOCUMENT CLUSTERING NLP LITERATURE PROFILING
Data interpretation Experiment GENE EXPRESSION PROFILING Results Results Identify relevant genes Identify expression patterns Knowledge Knowledge Conclusions