220 likes | 357 Views
Query Processing in WSN: The Dilemma of Suppressions and Failures. Baljeet Malhotra Supervisors Ioanis Nikolaidis Mario A. Nascimento Communication Networks and Database Systems Computing Science Department Graduate Student’s Workshop on Network’s Research, Nov 17, 2009.
E N D
Query Processing in WSN: The Dilemma of Suppressions and Failures Baljeet Malhotra Supervisors Ioanis Nikolaidis Mario A. Nascimento Communication Networks and Database Systems Computing Science Department Graduate Student’s Workshop on Network’s Research, Nov 17, 2009
In-Network Query Processing in WSN In-network Processing Query Result Decision Support system
Background • Queries • Variations of Top-k (Min, Max, Avg.) • Nearest Neighbor • Join queries • Techniques • Exploit the wireless broadcast nature of WSN • Aggregation, Pruning, Filtering …. • Schedules • Sensors should not be in ‘idle listening mode” • Contentious our slotted time • Failures • Unreliable communication • ?
Top-k Query : Problem Definition Given a set of N sensors, S = {si : i =1, 2,….., N} Sp,jbe the set of sensors that produced the pth highest value during the jthround v(Sp,j) be the pth highest value The problem is to find: Top-k values: Di = {v(Sp,j): p =1, 2,….., k} Top-k sensors: Sp,j
Top-k Query Processing Sink = Root A Shortest Path Tree (SPT)
Convergecast + Aggregation = TAG* Sink = Root A *TAG, Madden et. al., OSDI’02, 2002
Convergecast + Aggregation Sink = Root A Total # of messages used = n-1
Convergecast + Aggregation + Filtering Sink = Root A τ (threshold) = min of top-k
Threshold (τ) Broadcast Sink = Root A
Convergecast + Aggregation + Filtering Sink = Root A send iff v(si) ≥ τ; or if top-k node
Broadcast via SPT Sink = Root A Total # of messages used = 8
Broadcast via DST Sink = Root A Dominating Set Tree (DST) Total # of messages used = 4
Some Results Varying k Tree topology makes a difference EXTOK performs better on both synthetic and Intel dataset Synthetic data Real data
Convergecast Scheduling Sink = Root A TDMA slots => Precedence Constraints
Convergecast Scheduling Sink = Root A 7 6 8 5 4 5 3 2 3 1 TDMA slots => Conflict Free + Precedence Constraints
Some Results Varying Ψ (node density) SDA, Chen et. al, LNCS, vol. 3799, 2005 SAS, Wan, et. al., MOBIHOC, 2009 DAS, B. Yu et. al., INFOCOM, 2009 PAS, X. Yu et. al., SSDB, 2007 Synthetic data Real data
Convergecast Scheduling + Filtering Sink = Root A 7 6 8 5 4 5 3 2 3 1 send iff v(si) ≥ τ; or if top-k node
Convergecast Scheduling + Filtering + Failure Sink = Root A 7 6 8 5 4 5 3 2 3 1 send iff v(si) ≥ τ; or if top-k node
Convergecast Scheduling + Failure Recovery Sink = Root A 7 6 8 5 4 5 3 2 3 1
Some Results Application Perspective Just Brodcast
Conclusions • Infrastructure • Clusters and Routing Trees • Dominating Sets • Techniques • Aggregation, Pruning, and Filtering • How to use them in the best possible way for a particular problem ? • Can we use one single infrastructure for every thing ? • Schedules • Every task must be done in a systematic fashion while minimizing the response time • Failures • What happens when parts of our infrastructure breaks down ? • How and When to fix these problems ?
Acknowledgements • This research is partially supported by NSERC, i-Core, FGSR, VLDB, GSA, and Walter John Scholarship. • Contact baljeet@cs.ualberta.ca for more details. THANK YOU !