300 likes | 412 Views
TIE STRENGTH IN MICRO-BLOGS. He Yaxi 2012.4.26. INTRODUCTION. Micro-blogs A kind of person-based RSS service. Forming a tie through follow even does not need permission of the two.
E N D
TIE STRENGTH IN MICRO-BLOGS He Yaxi 2012.4.26
INTRODUCTION • Micro-blogs • A kind of person-based RSS service. • Forming a tie through follow even does not need permission of the two. • A recent study of Facebook showed that users only poke and message a small number of people while they have a large number of declared friends. • Both acquaintances and close friends appear in the same social list with same methods. • The declared social networking is incompetent to reflect the real social relation of users.
STRUCTURE • Introduction • Methodology • Experimental results • Network comparison
INTRODUCTION • Tie-strength • Mark Granovetter was the first to propose the notion of tie strength in “The Strength of Weak Ties”. • He highlighted the important role that tie strength plays in information exchange between people. • Modeling tie strength in SNS contributes to the understanding of the exchange and transmission of information and influence between users. • In fact, tie strength represents unequal contacts as the weight of links in real interactive social network instead of the one that forms through follow.
INTRODUCTION • Weak ties • In job finding practices, job opportunities were found mostly through word-of-mouth communication with weak ties, for the reason that acquaintances who traveled in different circles and had more access to different information than strong ties or close friends • Strong ties • When one may wish to use only close contacts to gather or acquire information, for instance, one may be interested in assembling a team or otherwise gathering information that is distributed in different parts of a social network using only strong ties. • Tie strength in Micro-blogs • One may follow lots of users such as friends in reality, celebrities or enterprises. • It is unreasonable for user to get the flow of information in the same way as not all of his followees appeal to him equally. • It means a lot to distinguish different ties in terms of strength in the static network forming through follow.
IDEA • As two users forming a tie is much more easier than maintaining a tie in SNS, we focus on the records of interaction behaviors. • Interaction behaviors • Comment • Retweet • Mention
CONTRIBUTIONS • Measurement of tie strength between users. • Key factors that matter tie strength significantly. • Acquisition of a weighted, dynamic network. • Property variation of network topology.
STRUCTUER • Introduction • Methodology • Experimental results • Network comparison
METHODOLOGY • Dimensions of tie strength • Data collection • Variables mapping • Model construction
DIMENSIONS OF TIE STRENGTH • Contact frequency • SNS providers try to present user with information in terms of stream to promote the amount of interactive behaviors. • Contact frequency promotes other dimensions. • Emotional intensity • The recognition of entities produces intrinsic emotions. • Stress more on cognition of the other. • Intimacy • The closeness of relationship • Deep affection between two entities acting as a sense of reliance and security. • They are willing to talk with open mind to get or provide recognition and support. • Reciprocity • the basic condition to establish and maintain a link. • measured by cost and profit including time, energy ,emotion, etc. • Cost less and gain more will increase the tie strength.
DATA COLLECTION: • Users from Sina Micro-blogs through API. • The original network is defined by users’ followees. • 10 users are defined as participants from BUPT. • Participants requirement • Account is created more than six months • With over 100 followers, 100 followees, 200 statuses • Followees requirement • NO Verified-users. • NO users with more than 2000 followers or 2000 statuses. • NO users with under 20 statuses or10 followers. • With 40~50 followees left randomly.
VARIABLES MAPPING Independent variables
VARIABLES MAPPING Dependent variables Participants mark the relationship with some of his followees in the questionnaire.
DATASET • 关于微博、粉丝、关注、评论、回复的几个统计图
MODEL CONSTRUCTION • Data normalization • Polynomial Regression • Native Bayesian classification
Data normalization • Data normalization for each participant • This linear transformation locates all data in [0, 1], and the numeric difference is relevant. • Notes • Every time calculating strength for a new tie, it depends on other ties the user owns. • It happens in reality, for the reason that relationship difference is relevant .
POLYNOMIAL REGRESSION • Estimate tie strength where variables affect tie strength as a linear combination. • With stepwise regression in three datasets to determine key variables affecting tie strength significantly.
NATIVE BAYESIAN CLASSIFICATION • Goals • Test the results of the polynomial regression. • Method • Define the normalized value of tie strength under mean value as weak ties and others are strong ties. • Test the result of classification with U1_Set. • Results • With variables obtained above, higher precision of classification could be obtained than with all of variables.
CORRELATION ANALYSIS • Analyze the correlation in U1_Set. • Correlation between each dimension with tie strength =》effective service to improve use experience • Correlation between variables with its dimension =》mapping reason
STRUCTUER • Introduction • Methodology • Experimental results • Network comparison
TOPOLOGICAL COEFFICIENTS • Density • The closeness of nodes in the network. • Actual links/all possible links • For directed graph: L/N(N-1) • For undirected graph: 2L/N(N-1) • Clustering coefficient • For a node • the coincidencedegree of friends. • between neighbor nodes: actual links /all possible links • For a network • average clustering coefficient of all nodes • higher in SNS than in random network
TOPOLOGICAL COEFFICIENTS • Shortest path length • The average number of steps along the shortest paths for all possible pairs of network nodes. • It is a measure of the efficiency of information or mass transport on a network. • Small world network theory predicts that the average path length changes proportionally to log n, where n is the number of nodes in the network. • Diameter • Maximal distance between any two nodes in the network. • Six-degree Thoery.
Distribution of in-degree Distribution of clustering coefficient Distribution of out-degree
APPLICATION • Display of information stream. • Access control for users in Micro-blogs. • …………