Presenter: Chun-Ping Wu

Unsupervised word sense disambiguation for Korean through the acyclic weighted digraph using corpus and dictionary Presenter: Chun-Ping Wu Authors: Yeohoon Yoon, Choong-NyoungSeon, Songwook Lee, JungynuSeo 國立雲林科技大學 National Yunlin University of Science and Technology IPM 2007

Outline • Motivation • Objective • Methodology • Experiments • Conclusion • Comments

Motivation • The Word Sense Disambiguation is a common problem in natural language processing. • Traditional approaches only consider the co-occurrence probability alone. Sample:I deposit some money in the bank. Options: bank = 銀行？ bank = 堤; 岸？ bank = (一)排； (一)組

Objective • To construct a WSD system, which can be easily implemented by learning all polysemous words at once, while covering all polysemous words which are listed in MRD. • To consider relation between each sense of context words and the sense of the target word. Sample:I deposit some money In the bank. Ans: bank = 銀行

Methodology • Learning step • Similarity matrix • Word vector • Vector representations of sense definitions in MRD • Disambiguation step • The definition of acyclic weighted digraph. • Selecting context words • Constructing the acyclic weighted digraph • Searching the optimal path on the acyclic weighted digraph

Methodology • Learning step • Similarity matrix • Word vector • Vector representations of sense definitions in MRD

Methodology • Learning step • Similarity matrix • Word vector • Vector representations of sense definitions in MRD.

Methodology • Learning step • Similarity matrix • Word vector • Vector representations of sense definitions in MRD

Methodology • Disambiguation step • The definition of acyclic weighted digraph. • Selecting context words • Constructing the acyclic weighted digraph • Searching the optimal path on the acyclic weighted digraph

Experiments • System results

Experiments • Experiment on English • The accuracy of the system is 30.7% on average. • The result is very low; there are some reasons as follows. • Context words are not appropriate although context words are very important in that they decide which sense of the target word might be the best. • Mapping English senses to Korean for using English-Korean dictionary leads to some loss of information. • The errors of the stemming process disturbed us to search the right root of the verb in the MRD.

Conclusion • To consider the relationship between each sense of context words and the sense of the target word • By using Viterbi algorithm to reduce computational complexity. • The system showed bad results on English (30.7), but it resulted in suitable performances, 76.4% by accuracy, over the semantically ambiguous Korean words. • To apply this method to other languages by studying language characteristics. 15

Comments • Advantage • To consider the relationship between each sense of context words and the sense of the target word. • By using Viterbi algorithm to reduce computational complexity. • Drawback • The performance of this system is better in Korean. • Application • Word Sense Disambiguation 16

Presenter: Chun-Ping Wu

Presenter: Chun-Ping Wu

Presentation Transcript

10 reasons why Classroom Presenter is better than PowerPoint

Simone de Beauvoir’s The Second Sex

Scanning, Sniffing, Spoofing and Man-in-the-middle Attacks

Raw data: student practice slides

Introduction to medical equipment inventory management

College Connection

Presenter Disclosures

UDC Inspector Seminar

Presenter Name Date

Title of presentation Name of presenter Title of presenter School / Faculty / Division xx Month 201x

Your Presenter Today is Rev. Dougald W. McLaurin, Jr.

Presenter : Min- Chia Chang Advisor : Prof. Jane Hsu Date : 201 1 - 06 -30

Presenter(s): Ronald Beavers, Ph.D . Shawna Charles, Ph.D. Cand .

Title of presentation Name of presenter Title of presenter School / Faculty / Division

Thomas Li-Ping Tang Middle Tennessee State University Randy Chiu Hong Kong Baptist University

何淼 PhD 改编自 Prof. Chun-Ting Zhang Center of Bioinformatics Tianjin University

ARC 1.0 Introduction Video

Presenter: Ir. Dr. Quek Keng Hong

Presenter Title | Quest Software | quest/vworkspace email@quest

5 Characteristics of a Great Presenter