150 likes | 264 Views
An open-source toolkit for mining Wikipedia. Presenter : Chang,Chun-Chih Authors : David Milne * , Ian H. Witten 2012, AI. Outlines. Motivation Objectives Methodology Experiments Conclusions Comments. Motivation.
E N D
An open-source toolkit for mining Wikipedia Presenter : Chang,Chun-ChihAuthors : David Milne *, Ian H. Witten2012, AI
Outlines • Motivation • Objectives • Methodology • Experiments • Conclusions • Comments
Motivation • The online encyclopedia Wikipedia is a vast, constantly evolving tapestry of interlinked articles. • For developers and researchers it represents a giant multilingual database of concepts and semantic relations, a potential resource for natural language processing
Objectives • The Wikipedia Miner toolkit, an open-source software system that allows researchers and developers to integrate Wikipedia’s rich semantics into their own applications. • Wikipedia Miner is intended to be a platform for sharing data mining techniques.
Experiments- Impact of thresholds for disambiguation and detection
Conclusions • Our aim in releasing this work open source is not to provide a complete and polished product, • but rather a resource for the research community to collaborate around and continue building together.
Comments • Advantages • Applications - wikipedia - Disambiguation - Annotation