120 likes | 248 Views
A Concept-based Model for Enhancing Text Categorization. Presenter : Jiang-Shan Wang Authors : Shady Shehata, Fakhri Karray, Mohamed Kamel. 國立雲林科技大學 National Yunlin University of Science and Technology. SIGKDD 2008. Outline. Motivation Objective Methodology Experiments Conclusion
E N D
A Concept-based Model for Enhancing Text Categorization Presenter : Jiang-Shan Wang Authors : Shady Shehata, Fakhri Karray, Mohamed Kamel 國立雲林科技大學 National Yunlin University of Science and Technology SIGKDD 2008
Outline • Motivation • Objective • Methodology • Experiments • Conclusion • Comments
Motivation • Most of text categorization techniques are based on word and/or phrase analysis of the text. • However, two terms can have the same frequency in their documents, but one term contributes more to the meaning of its sentences than the other term. • Example : • electronic techniques • defense effort
Objective To propose a new concept-based model that analyzes terms on the sentence and document levels.
Methods – Natural Language Processing We have noted how some electronic techniques, developed for the defense effort, have eventually been used in commerce and industry. [ARG0 We] [TARGET noted] [ARG1 how some electronic techniques developed for the defense effort have eventually been used in commerce and industry]. We have noted how [ARG1 some electronic techniques] [TARGET developed] [ARGM-PNC for the defense effort] have eventually been used in commerce and industry. We have noted how [ARG1 some electronic techniques developed for the defense effort] have [ARGM-TMP eventually] been [TARGET used] [ARGM-LOC in commerce and industry].
Conclusion This work bridges the gap between natural language processing and text categorization disciplines. The quality of the categorization results by the proposed model surpasses that of traditional approaches significantly.
Comments • Advantage • Considering about sentence semantics for text categorization. • Drawback • . • Application • Text categorization. • Document categorization • Web document categorization.