80 likes | 273 Views
Vandalism Detection in Wikipedia using Trustworthy Ranking and Semantic Context Analysis . Deepika Sethi Raga Sowmya Tummalapenta. Wikipedia Vandalism. Wikipedia Benefits Problems Vandalism Example. Approach. Approach 1 Trustworthy Ranking based on a Trustworthy search engine
E N D
Vandalism Detection in Wikipedia using Trustworthy Ranking and Semantic Context Analysis DeepikaSethi Raga Sowmya Tummalapenta
Wikipedia Vandalism • Wikipedia • Benefits • Problems • Vandalism • Example
Approach • Approach 1 • Trustworthy Ranking based on a Trustworthy search engine • Approach 2 • Semantic Context Analysis using an Ontology.
Approach 1 • Collection of top ranked documents using a trustworthy search engine • Co-occurrence probability of an edit and its corresponding page. • Probability too low might imply out of context and vandalism. • Feature extraction followed by data-trained classification.
Approach 2 • DBpediaextracts data from Wikipedia. • Finding Word-relationships using ontology. • Semantic Distance • Feature extraction followed by data-trained classification.
Evaluation • Feature 1 : Co-occurrence probability • Feature 2 : Semantic Distance • Feature 3 : Combination of both • 10-fold cross-validation
Related Work • Wikipedia Vandalism Detection : Combining Natural Language, Metadata, and Reputation Features. • Elusive Vandalism Detection in Wikipedia : A Text Stability Based Approach • Wikipedia Vandalism Detection through Machine Learning