130 likes | 234 Views
Today: Web – Millions of users. Web is like a “laboratory” for studying millions of people at once. Users leave detailed traces of their social activity. Large on-line applications with hundreds of millions of users. Connecting the dots….
E N D
Today: Web – Millions of users Web is like a “laboratory” for studying millions of people at once. Users leave detailed traces of their social activity. Large on-line applications with hundreds of millions of users
Connecting the dots… As we connect the dots into a network patterns emerge...
Navigating the world’s social netowrk The network:180M people, 1.3B undirected edges
The 6 degrees of separation • Small-world experiment [Milgram ‘67] • 64 letters are sent-forward from Nebraskato Boston • How many steps does it take? Average path length is 6.2 6 degrees of separation
Microsoft Instant Messenger(180M people, 1.3B undirected edges, ) Number of steps between 180 billion pairs of people Avg. path length 6.6 90% of the people can be reached in < 8 hops
Use Machine Learning to find the target person Green bar is prob. that node is good
What does the web talk about? • 1.6 million news media and blog sites • 1. million articles a day • What do they talk about? Who is imitating/copying whom?
Info propagation on the web • News media writes articles and refer (link) to other articles and the information spreads Can track the information as it spreads and mutates over millions of websites
Question… ? = I have 10 minutes. Which news sites should I read to be most up to date? = Who are the most influential bloggers?
Problem: Covering blogs = Given a budget (e.g., of 3 blogs) = Select blogs to cover the most of the blogosphere? = Bad news: Solving this exactly is NP-hard = Good news: Theorem: Can do it in linear time and within factor 3 of optimal “topics” Blogosphere
www.blogcascades.org So, who is influential?What should I read?