Creating and Evaluating a Consensus for Negated and Speculative Words in a Swedish Clinical Corpus

Creating and Evaluating a Consensus for Negated and Speculative Words in a Swedish Clinical Corpus Hercules Dalianis Maria Skeppstedt Stockholm UniversityDepartment of Computer and Systems Sciences

Intro and Contents • An experiment with annotated clinical text • Background • Creation of a consensus • Automatic detection of cues and the class • Comparison with the BioScope Corpus • Conclusion and next step Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

What is special about clinical text? Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Example of clinical text (Swedish) Kvinna med hjärtsvikt, förmaksflimmer, angina pectoris. Ensamstående änka. Tidigare CVL med sequelae högersidig hemipares och afasi. Tidigare vårdad för krampanfall misstänkt apoplektisk. Inkommer nu efter att ha blivit hittad på en stol och sannolikt suttit så över natten. Inkommer nu för utredning. Sonen Johan är med. Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Example of clinical text Woman with heart failures, atrial fibrillation, and angina pectoris. Single, widow. Former CVL with sequele, right hemiparesis and aphasia. Prior hosp. care for seizures, apoplectic suspected. Arrive to hospital after being found in a chair and probably been sitting there over night. Arrive for further investigation and care. Accompanied by her son Johan. Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Related research: Negation and speculation detection in clinical text • Both rule-based systems and machine learning systems • Precision and recall from just above 80% to just below 100% • Most on English text Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

The Stockholm EPR Corpus • Clinics in Stockholm • 2006-2008 • >800 clinics, >1 million patients • In Swedish Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

The annotation • Three annotators • The assessment part of health records • 6 740 sentences Annotated: • Cues for negation and speculation • Classify the sentence as either certain or uncertain, or break it up the into sub-clauses Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

The annotation <Sentence> <Uncertain> <Speculative_words> <Negation>Not</Negation> really </Speculative_words> much worse than before </Uncertain> <Sentence> Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Construction of a consensus General idea: • Choose the majority annotation Discarded: • The first annotation rounds discarded (16%) • 2% too different to be resolved, also discarded In the resulting consensus: • 92% identically annotated by at least two persons • 6% identically annotated by at least two persons for class. (For cues, only identical when disregarding the scope. Ex. could perhaps) • 2% only identical for class, only when scope of class disregarded.

Differences between the individual annotations and the consensus • Fewer uncertain expressions • Fewer cues for speculation • Fewer sentences that were divided into sub-clauses Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

The BioScope Corpus • Cues for speculation and negation • The scope of speculation and negation <sentence id="S1345.2">Correlation with the patient's height and weight <xcope id="X1345.2.1"><cue type="speculation" ref="X1345.2.1">may</cue> be some value</xcope>.</sentence> Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Comparison between the BioScope Corpus and our corpus Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Our corpus/the BioScope Corpus • Not so detailed guidelines/More detailed guidelines • Consensus with majority decision/Resolving differences with chief annotator (also higher inter-annotator agreement) • Assessment part from many clinics/Radiology reports Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Experiment with the Stanford Named Entity Recognizer Based on Conditional Random Fields • Detections of cues and certain/uncertain • Comparison between our corpus and the BioScope Corpus Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Result of automatic detection of cues for negation Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Result of automatic detection of cues for speculation Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Result of automatic detection of class and scope Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Conclusion and next step • Low results for detecting cues for speculation and class in our constructed corpus • Simplifying the task can hopefully result in: • Higher inter-annotator agreement • Easier to automatically learn to detect speculation Dalianis & Skeppstedt, NeSp-NLP July 10, 2010

Thank you!Questions? Hercules Dalianishercules@dsv.su.se Maria Skeppstedtmariask@dsv.su.se

Creating and Evaluating a Consensus for Negated and Speculative Words in a Swedish Clinical Corpus

Creating and Evaluating a Consensus for Negated and Speculative Words in a Swedish Clinical Corpus

Presentation Transcript

Creating a portrait with words . . .

Creating and Evaluating Effective Educational Assessments

Compiling and Annotating a Syriac Corpus

Evaluating Sight Translation: A Corpus-based Approach

Creating a Bilingual Ontology: A Corpus-Based Approach for Aligning WordNet and HowNet

Learning English words and making use of a corpus

Creating , Monitoring and Evaluating a Master Schedule

CCSVI IN CANADA: A CALL FOR SCIENCE AND CONSENSUS

Evaluating a rating scale for assessing trainee clinical psychologists’ clinical skills in-vivo.

Animating Objects, Inspecting and Evaluating a presentation, Creating a template

What’s in a Corpus?

Evaluating a QTH for Contesting and DXing

Creating a Vocabulary from Consensus Syndrome Definitions

A corpus-based study of loan words in original and translated texts

Creating a Learning Environment in Clinical Education

Tools for Historical corpus research, and a corpus of Latin

CLINICAL CONSENSUS STATEMENT

Misresponse to Reversed and Negated Items in Surveys: A Review

A corpus-based study of loan words in original and translated texts

Modelling clinical goals A corpus of examples and a tentative ontology

Evaluating a QTH for Contesting and DXing