350 likes | 500 Views
The Query-Cycle Simulator for Simulating P2P Networks. Mario T. Schlosser Tyson E. Condie Sepandar D. Kamvar Stanford University. For each peer i { -Repeat until convergence { - Compute . . . - Send . . . } }. Problem. Problem:
E N D
The Query-Cycle Simulator for Simulating P2P Networks Mario T. Schlosser Tyson E. Condie Sepandar D. Kamvar Stanford University
For each peer i { -Repeat until convergence { -Compute. . . -Send . . . } } Problem • Problem: • Accurately Simulate Real-World P2P Networks. • Motivation: • Testing P2P Algorithms.
Goals • P2P Simulator • Descriptive • Simple • Easily Extensible • Make it available on the web so that people can test and compare their algorithms on a standard platform.
Query Cycle Model Query Cycle 1
Query Cycle Model Query Cycle 2
Query Cycle Model Query Cycle 3
Properties to Model • Peer Content • Network Parameters • Peer Behavior
Properties to Model • Peer Content • How Much? • What Type? • Network Parameters • Peer Behavior
Data Volume • Observations • Model Saroiu,Gummandi,and Gribble. A Measurement Study of Peer-to-Peer File Sharing Systems, 2002. Simulator assigns # of files owned by peer i according to distribution.
Content Type: Observations • Content Categories • Zipf distribution on file popularity Punk Rock Hip-Hop Jazz Crespo and Garcia-Molina. Semantic Overlay Networks, 2002. Korfhage, Information Storage and Retrieval, 1997.
Content Type: Model • Modeling Content Categories: • Assume n content categories. C={c1,c2,…,cn} • A peer i is assigned content categories according to the Zipf distribution: • It is then assigned an interest level p(c|i) to each of the assigned content categories by a uniform random distribution.
Content Type: Model • Modeling Files: • Each distinct file f may be uniquely identified by {c,r} • A peer is assigned files by:
Recap on Content Assignment Assign Data Volume
Recap on Content Assignment {c1, c3, c4} Assign Content Categories
Recap on Content Assignment {c1=.5, c3=.3, c4=.2} Assign Interest Level to Content Categories
Recap on Content Assignment {c1=.5, c3=.3, c4=.2} {c,r}={c1,f7} {c,r}={c1,f1} . . . Assign Files
Properties to Model • Peer Content • Network Parameters • Topology • Bandwidth • Peer Behavior
Network Parameters • Topology: • Observation: Power Law Topology • Model: probability of connecting to a peer is proportional to the degree of that peer. • Bandwidth • Simple Bandwidth Model • Can be easily extended.
Properties to Model • Peer Content • Network Parameters • Peer Behavior
Query-Cycle Model • At each cycle, peer i may be: • active • inactive • or down
Query-Cycle Model • At each cycle, peer i may be: • active • inactive • or down • Issues a single query. • Waits for incoming responses. • Selects a source and downloads file. • Also: • Responds to queries. • Forwards query messages.
Query-Cycle Model • At each cycle, peer i may be: • active • inactive • or down • Responds to queries. • Forwards Query Messages.
Query-Cycle Model • At each cycle, peer i may be: • active • inactive • or down • Does nothing.
Properties to Model • Peer Content • Network Parameters • Peer Behavior • Uptime and Session Duration • Query Activity • Queries • Query Responses • Downloads
Uptime • Observations • Model Saroiu,Gummandi,and Gribble. A Measurement Study of Peer-to-Peer File Sharing Systems, 2002. At each query cycle, probability of being up is drawn from distribution in Saroiu et al.
Queries • Observations • None • Model • Based on the idea that peers query for files in the same categories that they own.
Responses and Downloads • Responses • If a peer receives a query for which it owns the file, it responds. • Source Selection • Random
Extensions • Different Types of Peers • i.e., Malicious Peers • Different Models for Different Situations • Reputation-based source selection. • Edutella: model distribution over markups rather than content categories. • Web Services: Change models for content distribution, query activity, etc. However, parameters are the same.
Future Work • Test predictions against observations in P2P networks “in the wild”. • Observations, observations, observations. • Model other networks.
The End • Code, demos will be available at http://www.stanford.edu/~sdkamvar/research.html next monday.
Motivation Affected algorithms Network or peer property • Structuring algorithms • Whatever • Stability of trust algorithms • Topology • Content distribution • Bandwidth, uptime of peers
Query Activity • Observations • Model Saroiu,Gummandi,and Gribble. A Measurement Study of Peer-to-Peer File Sharing Systems, 2002. At each query cycle, . . .