1 / 5

Text Mining of Medical Documents

Text Mining of Medical Documents. Michael Elhadad - Raphael Cohen Dept of Computer Science. Natural Language Processing. Analyze free text to extract “information” Key challenges: Ambiguity: heart, ברק Variability: diabetes, dm, diab. Applications: Search

gaia
Download Presentation

Text Mining of Medical Documents

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Text Mining of Medical Documents Michael Elhadad - Raphael Cohen Dept of Computer Science

  2. Natural Language Processing • Analyze free text to extract “information” • Key challenges: • Ambiguity: heart, ברק • Variability: diabetes, dm, diab. • Applications: • Search • Text Mining: information extraction, relations • Summarization

  3. NLP for Medical Domain Opportunity • Availability of online textual documents • EHR: mostly textual (release notes) • Scientific literature (PubMed) Challenge • Methods developed on “regular language” fail on “medical language”

  4. Specific Interest • EHR • Exploit rich textual data in EHR. • In Hebrew! • Hebrew NLP • Complex morphology, no dictionaries, no UMLS • Domain Adaptation • Machine learning methods to port NLP models from one domain to medical domain.

  5. Recent Work in Domain • Raphael Cohen, Michael Elhadad and Ohad S Birk, Analysis of free online physician advice services, PLOS ONE, 2013 • Raphael Cohen, Noemie Elhadad, Michael Elhadad, Redundancy in Electronic Health Record Corpora: Analysis, Impact on Text Mining Performance and Mitigation Strategies BMC Bioinformatics, 2013. • Raphael Cohen and Michael Elhadad, Syntactic Dependency Parsers for Biomedical-NLP, AMIA Proceedings 2012, pp121-128 • Raphael Cohen, Yoav Goldberg and Michael Elhadad, Domain Adaptation of a Dependency Parser with a Class-Class Selectional Preference Model, ACL 2012, SRW • Raphael Cohen, Avitan Gefen, Michael Elhadad and Ohad S Birk, CSI-OMIM - Clinical Synopsis Search in OMIM, BMC Bioinformatics 2011, 12:65

More Related