250 likes | 338 Views
Where in the World is Carmen BitDiego? And who is she, anyways…. Alexandru IOSUP A.Iosup@ewi.t u delft.nl. The 12th annual ASCI Computing Workshop. Introduction (1 of 3). Peer-2-Peer File sharing Everybody has the same rights. P2P average everybody ? Who? Where? When? How? Why?
E N D
Where in the World is Carmen BitDiego? And who is she, anyways… Alexandru IOSUP A.Iosup@ewi.tudelft.nl The 12th annual ASCI Computing Workshop
Introduction (1 of 3) • Peer-2-Peer • File sharing • Everybody has the same rights. • P2P average everybody? • Who? • Where? • When? • How? • Why? • Tons of studies over the past 5 years • Saroiu’02, Yazti’02, Yzal’04, Pouwelse’04 • We go for something else! (tbs)
Introduction (2 of 3) • BitTorrent • Most used P2P network today (53% traffic) • Attributes • 2nd gen. P2P network – no centralized servers; optimizes transfer speed; favors high-bandwidth users; files are split in chunks • Peers – Trackers – Web sites • Tit-for-tat sharing mechanism – everybody gives some; except when they don’t… • no search at peer level • Owners are called seeds, we are called leeches • So much to know:I want my BitTorrent today!
Enters Carmen… • Carmen BitDiego • Famous P2P network • Location: unknown • Likes: who knows? • Clues to where she is: some history, lightweight hints • Caught (?) • NO Multi-files studies • NO Country-per-file • NO Organizations • NO, NO, NO… • Carmen SanDiego • Famous spy • Location: unknown • Likes: to hide • Clues to where she is: history, complicated hints • Never caught Who is this Carmen, anyways…
Introduction (3 of 3) • We track Carmen BitDiego • Tracked data attributes • Users got 204,454,719,497,935B (ok, 204,5TB) • 40,000,000 contacts • 200,000 unique users (*) • 120 files • 9 specific media types • The firstaliased media view • 7 unique views • We got her now! Or is it…
Mission statement • We want to know about Carmen BitDiego • Where she goes • Continent, country, city, organization • When she goes • Time-patterns per country • Time-patterns in seeds/leeches ratio • How many file chunks at any time? • With whom she hangs out • Special users? Super-peers, collector peers • Is she a good companion? • How many users get what they want? We’re getting to this info in no time…
Outline of the presentation • Intro • Enters Carmen… • Mission statement • Our data looks like this… • Methods, or how to catch her • Results, or how we caught her • Conclusions (done) (done) (done) (we are here) (coming up next)
Our data looks like this… • We track 120 files • 120 trace files • Time stamp, IP, port, # of chunks = record = 1observation • 12 big traces (+500,000 observations/trace) • December 2003 – January 2004 • 108 small traces • March 2004 • 3 global categories • All, Big, Small • 9 special categories • Movies, Games, Music, Applications • Alias media • Same contents, different names • Same language • Different language
Outline of the presentation • Intro • Enters Carmen… • Mission statement • Our data looks like this… • Methods, or how to catch her • Results, or how we caught her • Conclusions (done) (done) (done) (done) (we are here) (coming up next)
Methods, or how to catch her • We want to know about Carmen BitDiego • Where she goes • Un-DNS(*): continent (1), country (2), city (3), organization (4) • When she goes (5) • Parse and correlate Time-patterns per country • Parse and correlate Time-patterns in seeds/leeches ratio • Parse and correlate How many file chunks at any time? • With whom she hangs out (6) • Super-peers = nodes that own more than one complete file • Collector peers = nodes that try to get more than one file • Is she a good companion? (7) • How many users get what they want? * Thanks MaxMind (GeoIP lib, database) and WebLog Expert (databases)
Outline of the presentation • Intro • Enters Carmen… • Mission statement • Our data looks like this… • Methods, or how to catch her • Results, or how we caught her • Conclusions (done) (done) (done) (done) (done) (we are here) (coming up next) WARNING! We show only a selection of our results!
Results, or how we caught her • Where she goes • continent Europe is now the biggest BitTorrent consumer (not NA) Tit-for-tat discourages low-bandwidth users!
Results, or how we caught her • Where she goes • continent Not the same distribution for different sets of files! Europe is now the biggest BitTorrent consumer (not NA) Coarse media locality property Asia > North America (themed game) Tit-for-tat discourages low-bandwidth users!
Results, or how we caught her • Where she goes • country US still the biggest overall BitTorrent consumer – continent view can be misleading! NL is only 6th!
Results, or how we caught her Hong Kong, Chile: soccer management sim Israel: action movie Japan: animes • Where she goes • country Fine media locality!Countries have habits! Localized versions of the files attract local users! The Nederlands 6thRomania ~50th Themed files attract very specific audiences! What about a marketing study based on BitTorrent file ranks? Not the same distribution for different sets of files! US still the biggest overall BitTorrent consumer – continent view can be misleading!
Results, or how we caught her • Where she goes • city Dispersed locations Oldenburg, Eschborn, Herndon …Internet nodes placed outside major cities – cannot use this to track real users! 30% unknown – not reliable!
Results, or how we caught her • Where she goes • organization We’d like to thank:The Walt Disney Company,Sony Corporation, SANYO Electric Software Co. Ltd.,and Merrill Lynchfor actively supporting BitTorrent! Academic institutions < 10% users! Not the same distribution for different sets of files! 1 ISP covers +60% users 10 ISPs cover <50% users ISP caching policy different for different files and communities!
Results, or how we caught her • When she goes • Time-patterns per country Europe guides the time-patterns! 8:30AM, 1PM, 6-9PM, 12-1AMmostly at work, during slow hours?
Results, or how we caught her • When she goes • How many file chunks at any time? Causes:- trackers down- users interest down- others The network is not robust all the time – attacks at these precise moments could be fatal!
Results, or how we caught her • When she goes • Time-patterns per no. of chunks/seeders/leeches ratio users:seeds ~ 10:1leeches:seeds ~ 9:1chunks:seeds ~ 1000:1
Results, or how we caught her • With whom she hangs out • Super-peers = nodes that own more than one complete file • Collector peers = nodes that try to get more than one file Group Small:Collectors (n files) ~ 2x Superpeers (n files) # users / # files decreases exponentially!
Results, or how we caught her 113 81 81 81 YES! 1 Point = 1% of any file • Is she a good companion? Aliased Media results in exponential drop!people drop after getting 1/many Group Small users download whole files! Group Small Avg. (any) ~ 81 points Avg. (1 file) ~ 113 pointsAliased Media Avg. (any) ~ 52 points Avg. (1 file) ~ 109 points Users download 1 file then disconnect!
Outline of the presentation • Intro • Enters Carmen… • Mission statement • Our data looks like this… • Methods, or how to catch her • Results, or how we caught her • Conclusions (done) (done) (done) (done) (done) (done) (we are here)
Conclusions • Carmen BitDiego • Famous P2P network • Location: known • Likes: established(study per specific file groups) • Clues to where she is: complete hints • Multi-files study • Continents, Country, Cities, Organizations, global and per-file • Time-patterns in the users/seeds/leeches behavior (also country) • Super-nodes / collector nodes analysis • Carmen BitDiego almost caught! • Trivial and Non-trivial locality properties • Alias media hints • Need a full study w/ these methods to catch her!
Thank you… Questions? Remarks? Observations? All welcome! Alexandru IOSUP TU Delft A.Iosup@ewi.tudelft.nl http://www.pds.ewi.tudelft.nl/~iosup/index.html I would like to thank Johan Pouwelse and Pawel Garbacki for all their help in creating this study. Thank you, Johan! Thank you, Pawel! Their previous work:http://www.theregister.co.uk/2004/12/18/bittorrent_measurements_analysis/