250 likes | 384 Views
An Algorithm for Measuring Optimal Connections in Large Valued Networks. Song Yang Henry Hexmoor Sociology Computer Science University of Arkansas Preparation of this presentation benefits from cogent comments from Jim Hollander. Binary Distance.
E N D
An Algorithm for Measuring Optimal Connections in Large Valued Networks Song Yang Henry Hexmoor Sociology Computer Science University of Arkansas Preparation of this presentation benefits from cogent comments from Jim Hollander
Binary Distance • In binary graphs, path distance is normally used to indicate the optimal connections between a pair of nodes. This solution assumes that intermediaries are costly.
Binary Distance • If more intermediaries are necessary to connect a pair of actors, they may extract higher commissions for their services, distort the information content exchanged, and increase the time required to complete a transaction.
VALUED GRAPHS • Valued graph is defined as a graph whose lines carry numerical values indicating the intensities of the relationships between all dyads. • For example, volumes of communications, levels of friendship and trust, or dollar amounts of economic transactions.
Optimal Connections in Valued Graphs • Previous researchers propose a solution to measure optimal connections in valued graphs. Peay (1980) states that path value, defined as the smallest value attached to any line in a path, indicates the optimal path between a pair of nodes.
Problems • The problems of Peay’s path value solutions • How to determine the path value/optimal connection when multiple paths/path values present between two nodes? • How to account for the transaction costs of exchanges involving many go-betweens?
Our Solution • We argue that including binary distance is especially crucial for measuring path strength in a valued graph • Because it takes into account the costs (in time, energy, or decay of information) required for indirectly connected dyads to reach one another through varying numbers of intermediaries.
APV • A measure of Average Path Value (APV) between nodes ni and nj is the ratio of path value to distance, indicated by
APV • Note that a pair of nodes may have multiple paths, thus containing multiple APVs. We suggest that the highest APV indicates the optimal connection between the pair of nodes.
APV • So optimal connection permits the highest volume of things such as transactions, messages, contracts, treaties or friendships after controlling for the binary distance between the two nodes.
Applications of APV • Full Network Data • Strategic Alliance Network among a set of firms under focus
The Algorithm • Step 1 involves identifying different connected components in a graph with Union Find Algorithm. A connected component consists of a set of nodes, in which each node can reach every other node in the set. • Step 2 involves calling of a subroutine called MAPVC to process optimal connections in each connected components • Step 3 ensures all the connected components are processed and results organized into a matrix for further analyses
MAPVC • MAPVC considers each node v one at a time and incrementally constructs a path from that node to all other nodes. MAPVC calls a subroutine Maximum APV (MAPV) to process each node
MAPV • Let us start with v (i) • First a node v (j) is picked so it has a maximum APV (path values/number of lines) with v (i). • The path linking v (i) and v (j) becomes the path for subsequent extension. • Suppose a node v (k) is picked extending the v (i) – v (j) path.
MAPV • If the path value of v (j) – v (k) path is smaller than the v (i) – v (j) path, the v (j) – v (k) path value will replace the original v (i) – v (j) path value to compute the APV for v (i) – v (k) path • For every extension, the algorithm picks up path with the largest APV and NEVER extended before.
MAPV • The process continues until every path in the connected component matrix was either extended or was a terminal path, which was because either no other nodes is reachable or circular path occurs (the path connects back to the beginning node) • In the end, the algorithm compares different APVs during each stage of path extension and picks up the largest APV to indicate the optimal connection between the node v (i) and node v (k)
MAPV-MAPVC-Union Find • MAPV for single node in a connected component • MAPVC calls on MAPV to process all the nodes in a connected components • Union Find calls on MAPVC to process all the connected components in a graph
A 1 3 • Example 3 1.5 B E 3 1 1.5 4 2 D C 6
Application and Limitation • Data have to be full network, instead of ego-centered network data • Does not account for signs of links, always assume positive relations • Does not account for directions, only for non-directional graphs. In other words, input and output matrices are symmetrical
Data • Data matrix are strategic alliances among 38 companies in the Informational Technology in 1998 • This dataset comes from a large database focusing on 145 IT companies from 1989 to 2002, collected by David Knoke and his associates.