350 likes | 456 Views
Realtime Location Privacy Via Mobility Prediction Creating Confusion at Crossroads Joseph Meyerowitz Romit Roy Choudhury Undergraduate Senior, Asst. Professor ECE and Physics, Dept. of ECE and CS Duke University Duke University. Context. Better localization technology +
E N D
Realtime Location Privacy Via Mobility PredictionCreating Confusion at CrossroadsJoseph Meyerowitz Romit Roy ChoudhuryUndergraduate Senior, Asst. Professor ECE and Physics, Dept. of ECE and CSDuke University Duke University
Context Better localization technology + Pervasive wireless connectivity = Location-based pervasive applications
Location-Based Apps • For Example: • GeoLife shows grocery list on phone when near WalMart • Micro-Blog allows querying people at a desired region • Location-based ad: Phone gets coupon at Starbucks • … • Location expresses context of user • Facilitating content delivery Location is the IP address Its as if for content
Double-Edged Sword While location drives this new class of applications, it also violates user’s privacy Sharper the location, richer the app, deeper the violation
Double-Edged Sword While location drives this new class of applications, it also violates user’s privacy Sharper the location, richer the app, deeper the violation Moreover, range of apps are PUSH based. Require continuous location information Phone detected at Starbucks, PUSH a coffee coupon Phone located on highway, query traffic congestion
Location Privacy • Problem: • Research: Continuous location exposure a serious threat to privacy Preserve privacy without sacrificing the quality of continuous loc. based apps
Just Call Yourself ``Freddy” • Pseudonymns • Effective only when infrequent location exposure • Else, spatio-temporal patterns enough to deanonymize … think breadcrumbs Leslie Jack John Susan Alex Romit’s Office
Add Noise • K-anonymity • Convert location to a space-time bounding box • Ensure K users in the box • Location Apps reply to boxed region • Issues • Poor quality of location • Degrades in sparse regions • Not real-time Bounding Box You K=4
Confuse Via Mixing • Path intersections is an opportunity for privacy • If users intersect in space-time, cannot say who is who later • Issues • Users may not be collocated in space and time • Mixing still possible at the expense of delay
Existing solutions seem to suggest: Privacy and Quality of Localization (QoL) is a zero sum game Need to sacrifice one to gain the other
Our Goal Break away from this tradeoff Target: Spatial accuracy Real-time updates Privacy guarantees Even in sparse populations We design: CacheCloak
CacheCloak Intuition Exploit mobility prediction to create future path intersections User’s paths are like crossroads of breadcrumbs App knows precise locations, but doesn’t know the user
CacheCloak • Assume trusted privacy provider • Reveal location to CacheCloak • CacheCloak exposes anonymized location to Loc. App Loc. App1 Loc. App2 Loc. App3 Loc. App4 CacheCloak
CacheCloak Design • User A drives down path P1 • P1 is a sequence of locations • CacheCloak has cached response for each location • User A takes a new turn (no cached response) • CacheCloak predicts mobility • Deliberately intersects predicted path with another path P2 • Exposes predicted path to application • Application replies to queries for entire path • CacheCloak always knows user’s current location • Forwards cached responses for that precise location
CacheCloak Design • Adversary confused • New path intersects paths P1 and P2 (crossroads) • Not clear where the user came from or turned onto Example …
Benefits • Real-time • Response ready when user arrives at predicted location • High QoL • Responses can be specific to location • Overhead on the wired backbone (caching helps) • Entropy guarantees • Entropy increases at traffic intersections • In low regions, desired entropy possible via false branching • Sparse population • Can be handled with dummy users
Quantifying Privacy • City converted into grid of small sqaures (pixels) • Users are located at a pixel at a given time • Each pixel associated with 8x8 matrix • Element (x, y) = probability that user enters x and exits y • Probabilities diffuse • At intersections • Over time • Privacy = entropy y x pixel
Diffusion • Probability of user’s presence diffuses • Diffusion gradient computed based on history • i.e., what fraction of users take right turn at this intersection Time t1 Time t2 Time t3 Road Intersection
Evaluation • Trace based simulation • VanetMobiSim + US Census Bureau trace data • Durham map with traffic lights, speed limits, etc. • Vehicles follow Google map paths • Performs collision avoidance 6km x 6km 10m x 10m pixel 1000 cars
Results • High average entropy • Quite insensitive to user density (good for sparse regions) • Minimum entropy reasonably high
Results • Per-user entropy • Increases quickly over time • No user starves of location privacy
Issues and Limitations • CacheCloak overhead • Application replies to lots of queries • However, overhead on wired infrastructure • Caching reduces this overhead significantly • CacheCloak assumes same, indistinguishable query • Different queries can deanonymize • Need more work • Per-user privacy guarantee not yet supported • Adaptive branching & dummy users
Closing Thoughts Two nodes may intersect in space but not in time Mixing not possible, without sacrificing timeliness Mobility prediction creates space-time intersections Enables virtual mixing in future
Closing Thoughts CacheCloak Implements the prediction and caching function Significant entropy attained even under sparse population Spatio-temporal accuracy remains uncompromised
Final Take Away Chasing a car is easier on highways … Much harder in Manhattan crossroads CacheCloak tries to turn a highway into a virtual Manhattan … Well, sort of …
Thank You For more related work, visit: http://synrg.ee.duke.edu
Emerging trends in content distribution • Content delivered to a location / context • As opposed to a destination address • Thus, “location” is a key driver of content delivery IP address : Internet = Location : CDN • New wave of applications
Emerging trends in content distribution • Content delivered to a location / context • As opposed to a destination address • Thus, “location” is a key driver of content delivery IP address : Internet = Location : CDN • New wave of applications
Location Privacy • Problem: Continuous location exposure deprives user of her privacy.
Location Frequency • Some location apps are reactive / infrequent • E.g., List Greek restaurants around me now (PULL) • But, many emerging apps are proactive • E.g., Phone detected at Starbucks, PUSH a coffee coupon
Location Frequency • Some location apps are reactive / infrequent • E.g., List Greek restaurants around me now (PULL) • But, many emerging apps are proactive • E.g., Phone detected at Starbucks, PUSH a coffee coupon Opportunity for Big Bro to track you over space and time Proactive apps require continuous location
Categorizing Apps • Some location apps are reactive • You ask, App answers • E.g., Pull all Greek restaurants around your location • But, many emerging apps are proactive • E.g., Phone detected at Starbucks, PUSH a coffee coupon
Categorizing Apps • Some location apps are reactive • You ask, App answers • E.g., Pull all Greek restaurants around your location • But, many emerging apps are proactive • E.g., Phone detected at Starbucks, PUSH a coffee coupon Proactive apps require continuous location