1 / 1

Sentence one/more emotion(s) no annotation = `no emotion’ or `no agreement’

2011 Medical NLP Challenge – Track 2 Shared task on sentiment detection in suicide nodes. DATA SET. 50%. Sentence one/more emotion(s) no annotation = `no emotion’ or `no agreement’. 50%. APPROACH. Re-annotation. on complex cases in training k eeping only clear-cut cases.

keaira
Download Presentation

Sentence one/more emotion(s) no annotation = `no emotion’ or `no agreement’

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 2011 Medical NLP Challenge – Track 2 Shared task on sentiment detection in suicide nodes DATA SET 50% Sentence one/more emotion(s) no annotation = `no emotion’ or `no agreement’ 50% APPROACH Re-annotation on complex cases in training keeping only clear-cut cases Fine-grained emotion detection in suicide notes:Multi-label classification with probability estimatesKim Luyckx, FrederikVaassen, Claudia Peersman & Walter DaelemansCLiPS Computational Linguistics Group, University of Antwerp, Belgium Phase 1: Calibration on single-label training data 15 emotions + no-emotion - feature selection - LibSVMparam optimization Lexical features Context features Emotion vocabulary Per fold 10-fold CV Single-label training set Single-label test set C parameter G parameter Phase 2: Thresholding Same data Optimal Assign multiple labels to a single sentence if label P exceeds probability threshold 3 classification scenarios: - 15 emotions (no threshold) - 15 emotions (1 threshold) - 15 emotions + no-emotion (2 thresholds) LibSVM probability estimates per class Single-label training set Original multi-label data + Multi-label test set threshold for 15 emotions threshold for no-emotion RESULTS during training Phase 1: Calibration Evaluated on single-label (=re-annotated) test data C=8.0, G=8.0 Is re-annotation a good idea? 15 emotions + no-emotion (optimized) Yes, so proceed with re-annotated training data Phase 2: Thresholding Evaluated on multi-label test data EVALUATION on test data ERROR ANALYSIS 15 EMO + NO-EMO with 2 thresholds 15% of sentences got multi-label prediction 80% of these were partially correct Confusion between instructions & information instructions & no-emotion hopelessness & no-emotion Higher than mean of all participants: 48.75 % Only just below median: 50.27 %

More Related