170 likes | 312 Views
A method of extracting malicious expressions in bulletin board systems by using context analysis. Presenter: Jun-Yi Wu Authors: Hiroshi Hanafusa , Kazuhiro Morita, Masao Fuketa , Jun- ichi Aoe. 國立雲林科技大學 National Yunlin University of Science and Technology. 2011 IPM. Outline. Motivation
E N D
A method of extracting malicious expressions in bulletin board systems by using context analysis Presenter: Jun-Yi Wu Authors: Hiroshi Hanafusa, Kazuhiro Morita, Masao Fuketa, Jun-ichiAoe 國立雲林科技大學 National Yunlin University of Science and Technology 2011 IPM
Outline • Motivation • Objective • Methodology • Experiments • Conclusion • Comments
Motivation • The extracting scheme of the traditional method depends on words or a sequence of words without considering contexts of articles. • To takes a lot of human efforts to alert malicious articles. Non-malicious expression text Malicious expression text
Objective • To presents a new context filtering algorithm to reduce the effort of human and to improve the rate of false positive without degrading the rate of false negative.
Methodology • The presented system • Rule-based extracting knowledge • Multi-attribute matching
Methodology • The presented system
Methodology Non-malicious expression text Malicious expression text • The presented system • The outline of context analysis
Methodology • The presented system • Inadequate and crime expressions
Methodology • Rule-based extracting knowledge • Definition of multi-attribute rules • For example: “He kills someone”
Methodology • Multi-attribute matching • Construction of machines(MAPM) • Goto and out function
Methodology • Multi-attribute matching • Procedure • For example : “I get a strong sward”
Experiments • Time evaluation and error analysis
Conclusion • The presented method MULTI is a very useful approach for filtering services for inadequate expressions. • It is difficult task to register new words and expressions into dictionaries together with their categories and semantics. • The rules bases of the presented method MULTI is building for frequent expressions step by step, but there are difficult problems as shown in the following examples: • ‘‘RQJmcf2O” kill ‘‘Aaaaqqqbbb” 16
Comments • Advantage • Many examples • Drawback • Some mistakes • Application • Information retrieval • Context analysis 17