1 / 35

Topic Maps applied to PubMed

Topic Maps applied to PubMed. Giovani Rubert Librelotto Mirkos Martins Henrique Machado Franciscan University UNIFRA Brazil. José Carlos Ramalho Pedro Gabriel Ferreira Pedro R. Henriques Department of Informatics University of Minho Portugal. Outline. Motivation PubMed Topic Maps

neola
Download Presentation

Topic Maps applied to PubMed

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Topic Maps applied to PubMed Giovani Rubert Librelotto Mirkos Martins Henrique Machado Franciscan University UNIFRA Brazil José Carlos Ramalho Pedro Gabriel Ferreira Pedro R. Henriques Department of Informatics University of Minho Portugal

  2. Outline • Motivation • PubMed • Topic Maps • Metamorphosis • PubMed knowledge representation • Conclusion

  3. Motivation • Write down what you want to find! • Underline the key concepts • Consider how best to combine and refine the concepts

  4. Identify Key Concepts Is it riskyfor pregnantwomen to take antidepressants? Risky – (Synonyms) Harm, adverse effects Anti-depressants – Name of drug or drug(s)?

  5. PubMed • PubMed is a free search engine that provides very full coverage of the related biomedical sciences, such as biochemistry and cell biology. • It also offers access to the MEDLINE database with citations and abstracts of biomedical research articles.

  6. PubMed (2) • The PubMed core subject is medicine and its related fields. • It is offered by the United States National Library of Medicine as part of the Entrez information retrieval system.

  7. PubMed (3) • The inclusion of an article in PubMed does not endorse the article's contents, as other indexes. • Nevertheless, many PubMed citations contain links to full text articles which are freely available, often in the PubMed Central digital library.

  8. MEDLINE • MEDLINE database covers over 4.900 journals published around the world primarily from 1966 to the present and is composed of more than 17 millions of citations.

  9. Milestones • MEDLINE 35 years young • 15.8 million records • PubMed approximately 10 years old • 17 million records • MEDLINE database covers over 4.900 journals

  10. PubMed architecture

  11. PubMed web site

  12. Querying PubMed

  13. Query Results Formats

  14. Changing Results Format • Pull down menu on display bar • Select desired format • Change occurs automatically

  15. Information indexed in PubMed • Subject or journal title • Title Abbreviation • NLM ID (NLM'sunique journal identifier) • ISO abbreviation • Both the print and electronic International Standard Serial Numbers (pISSN and eISSN).

  16. PubMed data structure - CFG MedlineCitation ==> PMID, DateCreated, DateCompleted, Article, MedlineJournalInfo, ChemicalList, CitationSubset, MeshHeadingList Article ==> Journal, ArticleTitle, Pagination, Abstract, Affiliation, AuthorList, Language, PublicationTypeList Journal ==> ISSN, JournalIssue, Title JournalIssue ==> Volume, Issue, PubDate PubDate ==> Year, Month, Day, Hour?, Minute?, Second? MedlineJournalInfo ==> Country, MedlineTA, NlmUniqueID ChemicalList ==> Chemical+ Chemical ==> RegistryNumber, NameOfSubstance MeshHeadingList ==> MeshHeading+ MeshHeading ==> DescriptorName, QualifierName? AuthorList ==> Author+ Author ==> LastName, ForeName, Initials PublicationTypeList ==> PublicationType+

  17. PubMedXML Schema

  18. Publication Date Language Literature Type Journal Author Gender Age Journal Subset Available PubMed Concepts

  19. Available PubMed Association • Author writes article; • Keyword describes article; • Article was published in an year; • Article is published in a journal; • Article is written in a language; • Article refers to chemical substances; • Author publishes in an year; • Author writes paper in a language; • Journal refers to the keywords;

  20. What’s the idea? To create an ontology based on PubMed concepts and associations To use that ontology to present query results.

  21. How? Prebuild ontology’s structure level Populate the ontology with query results (occurrences) Structure level Catalog level

  22. Metamorphosis

  23. Metamorphosis

  24. XSTM editor

  25. Structure level through XSTM <?xml version="1.0" encoding="ISO-8859-1"?> <xstm> <topicTypes> <topicType> <id>Article</id> <instanceOf>Article</instanceOf> <name>Article</name> </topicType> <topicType> <id>Year</id> <instanceOf>Year</instanceOf> <name>Year</name> </topicType> ... <topics> <topic> <xpath id="PMID” name="Article/ArticleTitle"> //MedlineCitation</xpath> <type>Article</type> <resourceData scope="Abstract"> Article/Abstract </resourceData> <resourceData scope="PMID"> PMID</resourceData> <resourceData scope="Pagination"> Article/Pagination/MedlinePgn</resourceData> </topic>

  26. Topic Maps applied to PubMed

  27. New architecture Semantic Browser Qresults intersection Metamorphosis Extracting the base ontology

  28. One of the advantages of this approach is that no new database is created and no redundant data is produced; • Information is interconnected within a huge knowledge network navigable in any direction.

  29. Future work • Integration of Topic Maps and MeSH headings • Minimizing “false hits” and saving time • Identifying other useful – but frequently overlooked – features of the PubMed database.

  30. MeSH • Medical Subject Headings (MeSH) is a controlled vocabulary designed by the National Library of Medicine to search MEDLINE and other health sciences databases

  31. MeSH example by Wikipedia

  32. MeSH qualifiers • In addition to the descriptor hierarchy, MeSH contains a small number of standard qualifiers (also known as subheadings), which can be added to descriptors to narrow down the topic.

  33. MeSH qualifiers example • For example, "Measles" is a descriptor and "epidemiology" is a qualifier; "Measles/epidemiology" describes the subheading of epidemiological articles about Measles. • The "epidemiology" qualifier can be added to all other disease descriptors. Not all descriptor/qualifier combinations are allowed since some of them may be meaningless. In all there are 83 different qualifiers

  34. Conclusion • This paper described the integration of data from PubMed information system using the ontology paradigm, in order to generate an homogeneous view of this resources. • The proposal uses an environment, called Metamorphosis, for the automatic construction of Topic Maps with data extracted from the various data sources, and a semantic browser to navigate among the information resources.

  35. Questions? • Short announcement: • XATA2008: Portuguese XML conference, Feb. 2008 • Becoming international • English will be the language • We are seeking people for PC commitee...

More Related