100 likes | 163 Views
TwitterFeedRank offers a powerful tweet search solution using personalized crawlers with 20K requests/hour. Utilizes Lucene for indexing and ranking, providing sorted results based on FeedRank analysis. Future plans include enhanced feed recommendations and category prediction using Naive-Bayes Classifier.
E N D
TwitterFeedRank • Nick Flacco • Dalton Huynh • Abhishek Jha • Phong Lam
Overview • TwitterFeedRank provides an easier and more powerful way of searching tweets • Keyword Search • Personalized Search
Crawler • Crawl with Twitter API • 20,000 requests per hour • Weekly crawls • Link Graph for computing rank
Indexer • Lucene • Indexing and Searching • Feeds as Documents • (UserId, screen name) • (UserId, FeedRank) • (UserId, TweetId, Tweet/Status, Date)
Ranker • Computed from link graph created during crawl • PageRank for Twitter Feeds • Index each user with their computed rank in Lucene • Results are sorted using FeedRank
Analyzer • Find tweets from your friends-of-friends • Use common friends among your friends
Web UI • Features • Keyword Search • Personalized Search • Technologies • PHP, Lucene (Java)
Future Work • Improved Feed Recommendation • Using the 16 categories in OpenDirectory to classify tweets • Train Naive-Bayes Classifier • Predict category of a given tweet • Improve FeedRank • Factor in number of retweets and/or mentions