1 / 19

Extracting Why Text Segment from Web Based on Grammar-gram

Extracting Why Text Segment from Web Based on Grammar-gram. Iulia Nagy, Master student, 2010-02-27. Summary. Introduction Related work Rule Based Methods Machine Learning Approach “Bag of Function Words” method Method outline Adaptation of “Bag of Function Words” to English

chloe
Download Presentation

Extracting Why Text Segment from Web Based on Grammar-gram

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Extracting Why Text Segment from Web Based on Grammar-gram Iulia Nagy, Master student, 2010-02-27

  2. Summary • Introduction • Related work • Rule Based Methods • Machine Learning Approach • “Bag of Function Words” method • Method outline • Adaptation of “Bag of Function Words” to English • Experiments and Evaluation • Conclusion and Remarks

  3. Problem tremendous growth of the Internetinformation hard to find

  4. Solution • Create QA system system capable to give an exact answer to an exact question detect answer from arbitrary corpora • Purpose obtain viable information rapidly

  5. Purpose of our research Create a why-QA system with automatically-built classifier • Classifier • Use a model presented in Japanese Literature created using Machine learning based on Bag of Grammar approach Purpose of this paper

  6. Related word Two main trends • Rule Based methods • Machine Learning methods

  7. Rule based in why-QA Suzan Vererne’s Approach • Improve performance by re-ranking Method : • weight the score assigned to a QA-pair by QAP with a number of syntactic features.

  8. Machine Learning method Higashinaka and Isozaki’s Approach • Acquire causal expression from Japanese EDR dictionary Method : • train a ranker based on clause structures extracted from EDR

  9. Machine Learning method Tanaka’s Approach • Build why-classifier with function words as features Method : • Bag of function words

  10. Bag of function words Function words Machine learning approach to automatically build domain independent why-classifier based of function words Conditions to obtain domain-independence Class fulfilling conditions

  11. Bag of function words Ts 1 Create feature space Create feature vectors Extract function words Ts 2 … Ts n Mapping using “tf-idf” on function words Fv1 Vectors' format: Classification scheme Fv 2 Trainer for because at after in under which that why to therefore є … Fv n Loogit Boost weak learners Method – same baseline for Japanese and English

  12. Adaptation to English • Differences • Adjustments • Identify eligible function words in English

  13. Experiment • Data • Processing • Label all words with POS and extract function words • Calculatetf-idffor each function word • Map features from feature set into feature vectors

  14. Experiment • Classifier • Used Loogit Boost (Weka) with Decision stump • Created 5 classifiers (50, 100, 150, 200, 250 iterations) • Evaluation • 10-fold cross validation • Models trained on 9 folds and tested on 1 • Measured precision, recall and F-measure

  15. Results – why text segments No of iterations

  16. Results – non why text segments (NWTS) No of iterations

  17. Conclusion Method effective on English Type of TS • Results • 321 instances out of 432 correctly classified • 76.1% precision and 70.6% recall on WTS • 72.6% precision and 77.9% recall on NWTS

  18. Future works • Experiment with a increased dataset (> 5000) • Use Yahoo!Answers database to extract dataset • Interest • Include causative construction in the analysis

  19. Questions and remarks Thank you for your attention !

More Related