1 / 18

RaJoLink: Creative Knowledge Discovery by Literature Outlier Detection

RaJoLink: Creative Knowledge Discovery by Literature Outlier Detection. Ingrid Petrič University of Nova Gorica Bojan Cestnik Temida, Ljubljana and Jozef Stefan Institute, Ljubljana Nad a Lavra č Jozef Stefan Institute, Ljubljana and University of Nova Gorica Tanja Urbančič

Download Presentation

RaJoLink: Creative Knowledge Discovery by Literature Outlier Detection

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. RaJoLink:Creative Knowledge Discovery by Literature Outlier Detection Ingrid Petrič University of Nova Gorica BojanCestnik Temida, Ljubljana and Jozef Stefan Institute, Ljubljana Nada Lavrač Jozef Stefan Institute, Ljubljana and University of Nova Gorica TanjaUrbančič University of Nova Goricaand Jozef Stefan Institute, Ljubljana

  2. Overview • Motivation: • to present a method that supportsknowledge discovering by connecting information from different contexts in a new way • Focus domain: • biomedical research • autism, a spectrum of pervasive developmental disorders • Knowledge discovery support tools: • RaJoLink (Petrič and Cestnik, 2007) • OntoGen (Fortuna et al., 2006) • Knowledge sources: • MEDLINE (http://www.ncbi.nlm.nih.gov/sites/entrez) • MeSH (http://www.nlm.nih.gov/mesh/)

  3. Swanson’s model of literature-based discovery(Swanson, 1986) Literature about Literature about magnesium (A) migraine (C) (Bi)

  4. Closed vs. open discovery process(Weeber et al., 2001)

  5. Combined open and closed discovery in RaJoLink(Petrič et al., 2009) Open discovery (generation of hypothesis) • Identifying rare terms r • Finding joint terms a Closed discovery (hypothesis testing) • Searching for linking terms b

  6. The RaJoLink method:open discovery Literature R3 Literature R1 Joint term A Literature R2 Rare term R1 Rare term R2 Rare term R3 Literature C

  7. The RaJoLink method:closed discovery Literature A Joint term A Linking term B1 Linking term B2 Linking term B3 Literature C

  8. The RaJoLink method’s procedures(Petrič et al., 2009)

  9. Steps of the RaJoLink method – step Ra

  10. Step Ra

  11. Steps of the RaJoLink method – step Jo

  12. Step Jo

  13. Steps of the RaJoLink method – step Link

  14. Step Link

  15. Step Link - alternative

  16. Conclusions • Open discovery: • RaJoLink represents a more interdisciplinary approach to hypotheses generation that bridges the overspecialization in the sciences. We provide connections between biomedical literature by analysis and explanation of rare terms. • Closed discovery: • With the combination of outlier detection and high frequency analysis approach we demonstrated that outlying documents could be used as a heuristic guidance to speed-up the search for the linking terms and alleviate the burden on the expert when hypotheses have to be tested. • Recent experiments: • Detection of published evidence of autism findings that coincide with specific calcineurin and NF-kappaB observations (Petrič et al., 2007, Urbančič et al., 2007). • The gold standard evaluation: RaJoLink led to the Swanson’s relation of magnesium with migraine and to other three discoveries important for migraine.

  17. Future work • Automated identification of semantic variants such as abbreviations, acronyms and synonyms. • Handling the language specifics for Slovenian and other languages. • Providing visualizations of results. • Implementing the similarity measure between documents in the Link step.

  18. RaJoLink - references • Petrič, I.; Urbančič, T.; Cestnik, B. Discovering hidden knowledge from biomedical literature. Informatica 31(1):15-20 (2007). • Petrič, I.; Urbančič, T.; Cestnik, B. Literature mining: potential for gaining hidden knowledge from biomedical articles, In: Bohanec, M.; Gams, M.; Rajkovič, V.; Urbančič, T.; Bernik, M.; Mladenić, D. et al., editors. IS-2006. Proceedings of the 9th International multi-conference Information Society; Ljubljana, Slovenia. 52-55 (2006). • Petrič, I.; Urbančič, T.; Cestnik, B.; Macedoni-Lukšič, M. Literature mining method RaJoLink for uncovering relations between biomedical concepts. Journal of Biomedical Informatics 42(2): 219-227 (2009). • Urbančič, T.; Petrič, I.; Cestnik, B.; Macedoni-Lukšič, M. Literature mining: towards better understanding of autism. In: Bellazzi R; Abu-Hanna A; Hunter J, editors. AIME 2007. Proceedings of the 11th Conference on Artificial Intelligence in Medicine in Europe; Amsterdam, The Netherlands. 217-226 (2007).

More Related