10 likes | 164 Views
Elusive Vandalism Detection in Wikipedia Deepika Sethi, Raga Sowmya T Computer Science Department University of Georgia dsethi@uga.edu, sowmya@uga.edu. Introduction Vandalism -deliberate activity that compromises Wikipedia integrity.
E N D
Elusive Vandalism Detection in WikipediaDeepika Sethi, Raga Sowmya TComputer Science DepartmentUniversity of Georgiadsethi@uga.edu, sowmya@uga.edu • Introduction • Vandalism-deliberate activity that compromises Wikipedia integrity. • Problem-In general around 8% of Wikipedia edits are vandalized. • Elusive vandalism-Doesn’tcontain normal characteristics of vandalism and hence hard to detect. • Ex. abusive words, changing dates. • Use Google to check for co-occurrence probability of Wikipedia word and its page title. • Probability too low might imply out of context and vandalism. • Preliminary Results • Observing order of magnitude between vandalized and non-vandalized words. • Able to distinguish vandalism edits that were undetected by humans. Objective As a large number of users rely on Wikipedia for useful information, we tried to detect vandalism on Wikipedia pages Extract Words • Context Aware Approach • Detecting vandalism based on the context in which it is used. • Identifies words that are out of context with the existing words in an article. Acknowledgements We would like to express our appreciation to Professor Dr. LakshmishRamaswamy and Dr.Kang Li