Why We Twitter : Understanding Microblogging Usage and Communities ( 과제 세미나 ) 2012.5.11

Why We Twitter: Understanding Microblogging Usage and Communities(과제 세미나) 2012.5.11 정보보증 연구실 석사 23기 윤수진

목차 이 논문에서 배울 점 내용 결론 – 과제와의 연계

이 논문에서 배울 점 • HITS 알고리즘 적용 • Post(tweet) 분류 제안 • User 분류 제안

Twitter와 Web 비교

자료 수집 • 2007.4.1~2007.5.30까지 자료를 수집 • 76,177명의 user, 1,348,543개의 tweet • User를 node로, following(friend)를 edge로 삼아 그래프를 그림

HITS 알고리즘 이용 • Many followers, less friends-> high authority, low hub score • Less followers, many friends-> high hub score, low authority

Community 분류에 사용한 알고리즘 • Modularity • The strength of the community structure 측정 • 단점 : 한 node의 overlap 허용 안 함 • CPM(Clique Percolation Method) • Overlapping communities in network를 찾아냄

Modularity

Twitter 적용 예

Tweet 분석

Post(Tweet) 분류 • Manually 분류 • Daily Chatter • Daily routine, currently doing • The largest and most common user • Conversations • @ • 21% of user use it • Sharing information/URLS • 13% of all posts • Reporting news • Latest news, comment about current events

User 분류 • Based on the link structure • Information Source • Hub, a large number of followers • (may) regularly post, have valuable post • Friends • Most relationships • Information Seeker • (might) post rarely • Many following

과제와의 연계 • 본 논문에서는 특정 topic 안에서 user를 role을 찾아 표현 했다. • 비록 사건의 시작점을 찾아야하는 것도 중요하지만, 어느 시점에서 폭발적으로 사건이 나타날 때는 information source를 거치는 가에 대해서 고려해볼만 하다

Why We Twitter : Understanding Microblogging Usage and Communities ( 과제 세미나 ) 2012.5.11