70 likes | 259 Views
Focused Crawler. Ben Markines Mira Stoilova Fulya Erdinc. Introduction. Based from the paper presented the first week of class Accelerated Focused Crawling through Online Relevance Feedback by Chakrabarti presented by Mark Meiss
E N D
Focused Crawler Ben Markines Mira Stoilova Fulya Erdinc
Introduction • Based from the paper presented the first week of class • Accelerated Focused Crawling through Online Relevance Feedback by Chakrabarti presented by Mark Meiss • Implemented a focused crawler and a focused crawler with an apprentice • Apprentice analyzes words around a link
Crawler Implementation • Feature extraction • Using document frequency and mutual information • Baseline crawl using a classifier • Naïve Bayesian • Cosine Similarity • Support Vector Machine • Crawl with trained apprentice • Again using the same types of classifiers