Baljeet Malhotra Supervisors Ioanis Nikolaidis Mario A. Nascimento

Query Processing in WSN: The Dilemma of Suppressions and Failures Baljeet Malhotra Supervisors Ioanis Nikolaidis Mario A. Nascimento Communication Networks and Database Systems Computing Science Department Graduate Student’s Workshop on Network’s Research, Nov 17, 2009

In-Network Query Processing in WSN In-network Processing Query Result Decision Support system

Background • Queries • Variations of Top-k (Min, Max, Avg.) • Nearest Neighbor • Join queries • Techniques • Exploit the wireless broadcast nature of WSN • Aggregation, Pruning, Filtering …. • Schedules • Sensors should not be in ‘idle listening mode” • Contentious our slotted time • Failures • Unreliable communication • ?

Top-k Query : Problem Definition Given a set of N sensors, S = {si : i =1, 2,….., N} Sp,jbe the set of sensors that produced the pth highest value during the jthround v(Sp,j) be the pth highest value The problem is to find: Top-k values: Di = {v(Sp,j): p =1, 2,….., k} Top-k sensors: Sp,j

Top-k Query Processing Sink = Root A Shortest Path Tree (SPT)

Convergecast + Aggregation = TAG* Sink = Root A *TAG, Madden et. al., OSDI’02, 2002

Convergecast + Aggregation Sink = Root A Total # of messages used = n-1

Convergecast + Aggregation + Filtering Sink = Root A τ (threshold) = min of top-k

Threshold (τ) Broadcast Sink = Root A

Convergecast + Aggregation + Filtering Sink = Root A send iff v(si) ≥ τ; or if top-k node

Broadcast via SPT Sink = Root A Total # of messages used = 8

Broadcast via DST Sink = Root A Dominating Set Tree (DST) Total # of messages used = 4

Some Results Varying k Tree topology makes a difference EXTOK performs better on both synthetic and Intel dataset Synthetic data Real data

Convergecast Scheduling Sink = Root A TDMA slots => Precedence Constraints

Convergecast Scheduling Sink = Root A 7 6 8 5 4 5 3 2 3 1 TDMA slots => Conflict Free + Precedence Constraints

Some Results Varying Ψ (node density) SDA, Chen et. al, LNCS, vol. 3799, 2005 SAS, Wan, et. al., MOBIHOC, 2009 DAS, B. Yu et. al., INFOCOM, 2009 PAS, X. Yu et. al., SSDB, 2007 Synthetic data Real data

Convergecast Scheduling + Filtering Sink = Root A 7 6 8 5 4 5 3 2 3 1 send iff v(si) ≥ τ; or if top-k node

Convergecast Scheduling + Filtering + Failure Sink = Root A 7 6 8 5 4 5 3 2 3 1 send iff v(si) ≥ τ; or if top-k node

Convergecast Scheduling + Failure Recovery Sink = Root A 7 6 8 5 4 5 3 2 3 1

Some Results Application Perspective Just Brodcast

Conclusions • Infrastructure • Clusters and Routing Trees • Dominating Sets • Techniques • Aggregation, Pruning, and Filtering • How to use them in the best possible way for a particular problem ? • Can we use one single infrastructure for every thing ? • Schedules • Every task must be done in a systematic fashion while minimizing the response time • Failures • What happens when parts of our infrastructure breaks down ? • How and When to fix these problems ?

Acknowledgements • This research is partially supported by NSERC, i-Core, FGSR, VLDB, GSA, and Walter John Scholarship. • Contact baljeet@cs.ualberta.ca for more details. THANK YOU !

Baljeet Malhotra Supervisors Ioanis Nikolaidis Mario A. Nascimento

Baljeet Malhotra Supervisors Ioanis Nikolaidis Mario A. Nascimento

Presentation Transcript

SUPERVISORS

Mario

MARIO

Nascimento et al. A Summary

Supervisors

N. Vassiliadis, A. Chormoviti, N. Kavvadias, S. Nikolaidis

Mario

Alexandru Coman, Mario A. Nascimento, Jorg Sander CIKM 2005

Supervisors

SUPERVISORS