130 likes | 653 Views
cTAKES: Demo Clinical Text Analysis and Knowledge Extraction System. James Masanz Mayo Clinic. UIMA CAS Visual Debugger (CVD). Provided by / part of UIMA Run a pipeline against free text With appropriate 1 st annotator, against XML such as CDA document
E N D
cTAKES: DemoClinical Text Analysis and Knowledge Extraction System James Masanz Mayo Clinic
UIMA CAS Visual Debugger (CVD) • Provided by / part of UIMA • Run a pipeline against free text • With appropriate 1st annotator, against XML such as CDA document • View annotations created (“debugger”) • Export annotations to XML (XCAS or XMI)
cTAKES: Components • Sentence boundary detection (OpenNLP technology) • Tokenization (rule-based) • Morphologic normalization (NLM’s LVG) • POS tagging (OpenNLP technology) • Shallow parsing (OpenNLP technology) • Named Entity Recognition • Negation and context identification (NegEx) • Dependency parser • Drug Profile module • Smoking status classifier • CEM normalization module
Extend Earlier Example Tamoxifen 20 mg po daily started on March 1, 2005 for 6 mo. Aspirin prn. Fx history of breast cancer. History of migraines.