90 likes | 244 Views
Simple Plagiarism Detection. Duy Nguyen Ramakrishna Bhandage. Problem Definition. Implement a simple application to determine if a suspect text file is plagiarized by comparing it with the set of corpora in the given library (set of text files).
E N D
Simple Plagiarism Detection Duy Nguyen Ramakrishna Bhandage
Problem Definition • Implement a simple application to determine if a suspect text file is plagiarized by comparing it with the set of corpora in the given library (set of text files). • The suspect file and corpora are in simple text format (*.txt) • Algorithms used to detect the plagiarism: • LCSS • KMP • Rabin-Karp • Compare the performance of algorithms used in the implementation
Major Components • SimplePlagiarism: • detect() • getRunningTime() • onAlgorithmStarted() and onAlgorithmEnded() • SimplePlagiarism subclasses: • LCS: getLCSSLength() • KMP: • tableBuild() • match() • RabinKarp: • match() • Utility: • getParagraphs() • getSentences() • normalizeText()
Running time comparison A chart will go here….