1 / 29

Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces

Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces. Jian Pei # Wen Jin # Martin Ester # Yufei Tao + # Simon Fraser University, Canada + City University of Hong Kong. VLDB’05 at Trondheim, Let’s Go! .

totie
Download Presentation

Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces Jian Pei# Wen Jin# Martin Ester# Yufei Tao+ # Simon Fraser University, Canada + City University of Hong Kong

  2. VLDB’05 at Trondheim, Let’s Go! • Flights to Trondheim? Price, travel-time and # stops all matter! • A (long) list of all feasible flights? • It is boring to review many flights • A better idea: presenting only some selected flights – how? • Vancouver  Seattle  Munich  London  Oslo  Trondheim, $7200, 38 hours, 4 stops (bad) • Vancouver  Amsterdam  Trondheim, $2200, 14 hours, 1 stops (good) • Vancouver  Amsterdam  Oslo  Trondheim $1600, 18 hours, 2 stops (also good) • Only the skyline routes are interesting – all possible trade-offs among price, travel-time and # stops superior to the others J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  3. Domination and Skyline • A set of objects S in an n-dimensional space D=(D1, …, Dn) • D1, …, Dn are in the domain of numbers • Can be extended to other domains • For u, vS, u dominates v if u.Di ≤ v.Di for 1 ≤ i ≤ n, and on at least one dimension Dj, u.Dj< v.Dj • u  S is a skyline object if u is not dominated by any other objects in S J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  4. Finding the Skyline in Full Space • Many existing methods • Divide-and-conquer and block nested loops by Borzsonyi et al. • Sort-first-skyline (SFS) by Chomicki et al. • Using bitmaps and the relationships between the skyline and the minimum coordinates of individual points, by Tan et al. • Using nearest-neighbor search by Kossmann et al. • The progressive method by Papadias et al. J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  5. Full Space Skyline Is Not Enough! • Skylines in subspaces • Mr. Richer does not care about the price, how can we derive the superior trade-offs between travel-time and number of stops from the full space skyline? • Sky cube – computing skylines in all non-empty subspaces (Yuan et al., VLDB’05) • Any subspace skyline queries can be answered (efficiently) J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  6. Even Sky Cube May Not Be Enough! • Understanding skyline objects • Both Wilt Chamberlain and Michael Jordan are in the full space skyline of the Great NBA Players, which merits, respectively, really make them outstanding? • How are they different? • Finding the decisive subspaces – the minimal combinations of factors that determine the (subspace) skyline membership of an object? • Total rebounds for Chamberlain, (total points, total rebounds, total assists) and (games played, total points, total assists) for Jordan J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  7. Intuition • a, b and c are in the skyline of (X, Y) • Both a and c are in some subspace skylines • b is not in any subspace skyline • d and e are not in the skyline of (X, Y) • d is in the skyline of subspace X • e is not in any subspace skyline • Why and in which subspaces is an object in the skyline? J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  8. Observations • Is subspace skyline membership monotonic? • x is in the skylines in spaces ABCD and A, but it is not in the skyline in ABD – it is dominated by y in ABD • x and y collapse in AD, x and y are in the skylines of the same subspaces of AD J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  9. Coincident Groups • How to capture groups of objects that share values in subspaces? • (G, B) is a coincident group (c-group) if all objects in G share the same values on all dimensions in B • GB is the projection • A c-group (G, B) is maximal if no any further objects or dimensions can be added into the group • Example: (xy, AD) J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  10. C-Group Lattices • All coincident groups form a lattice (c-group lattice) • All maximal c-groups form a lattice (maximal c-group lattice) • Maximal c-group lattices are quotient lattices of c-group lattice • Where are the (multidimensional) skyline objects in the (maximal) c-group lattice? • Are they also in some good structure? J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  11. Skyline Groups • A maximal c-group (G, B) is a skyline group if GB is in the subspace skyline of B • How to characterize the subspaces where GB is in the skyline? • (x, ABCD) is a skyline group • If the set of subspaces are convex, we can use bounds J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  12. Decisive Subspaces • A space CB is decisive if • GC is in the subspace skyline of C • No any other objects share the same values with objects in G on C • C is minimal – no C’C has the above two properties • (x, ABCD) is a skyline group, AC, CD are decisive J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  13. Semantics • Problem: In which subspaces an object or a group of objects are in the skyline? • The skyline membership of skyline groups are established by their decisive subspaces • For skyline group (G, B), if C is decisive, then G is in the skyline of any subspace C’ where CC’B • Signature of skyline group Sig(G, B)=(GB, C1, …, Ck) where C1, …, Ck are all decisive subspaces J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  14. Example • The skyline membership of an object is determined by the skyline groups in which it participates • An object u is in the skyline of subspace C if and only if there exists a skyline group (G, B) and its decisive subspace C’ such that uG and C’CB J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  15. Subspace Skyline Analysis • All skyline projections form a lattice (skyline projection lattice) • A sub-lattice of the c-group lattice • All skyline groups form a lattice (skyline group lattice) • A quotient lattice of the skyline projection lattice • A sub-lattice of the maximal c-group lattice J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  16. Relationship Among Lattices quotient C-group lattices Maximal c-group lattices sub-lattice sub-lattice quotient Skyline projection lattices Skyline group lattices J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  17. OLAP Analysis on Skylines • Subspace skylines • Relationships between skylines in subspaces • Closure information J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  18. Full Space vs. Subspace Skylines • For any skyline group (G, B), there exists at least one object uG such that u is in the full space skyline • Can use u as the representative of the group • An object not in the full skyline can be in some subspace skyline only if it collapses to some full space skyline objects • All objects not in the full space skyline and not collapsing to any full space skyline object can be removed from skyline analysis • If only the projections are concerned, only the full space skyline objects are sufficient for skyline analysis J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  19. Subspace Skyline Computation • Compute the set of skyline groups and their signatures • Top-down enumeration of subspaces • Similar ideas in skyline cube computation • For each subspace, find skyline groups and decisive subspaces • Find (subspace) skylines by sorting • Share sorting and use merge-sorting as much as possible J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  20. Enumerating Subspaces • Using a top-down enumeration tree • Each child explores a proper subspace with one dimension less • All objects not in the skyline of the parent subspace and not collapsing to one skyline object of the parent subspace can be removed J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  21. Computing Skylines by Sorting • Sort all objects in lexicographic ascending order • a-d-b-e-c • Check objects in the sorted list, an object is in the skyline if it is not dominated by any skyline objects before it in the list • {a, b, c} are skyline objects J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  22. Efficient Local Sorting • Not necessary to sort for each subspace • A sorted list in subspace (A, B, C, D) can be used in subspaces (A), (A, B), (A, B, C) • To generate a sorted list in subspace (B, C, D), we can use merging sort to merge the sublists of different values on A • If a non-skyline object collapses to a skyline object, the skyline object “absorbs” the non-skyline object by taking the non-skyline object’s id • A non-skyline object may be “absorbed” by multiple skyline objects • Recursively reduce the number of objects and shorten the sorted lists J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  23. Results on Great NBA Players’ • 17,266 records • 4 attributes are selected • 67 skyline records in the full space, 146 decisive subspaces J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  24. # Skyline Groups vs. Dimensionality • Dimensionality: the complexity of subspaces • A 1-d subspace has only one skyline group • A high-dimensional subspace many have many skyline groups • # skyline groups tends to increase when dimensionality increases • Number of subspaces • An n-d data set has n 1-d subspaces, 1 n-d (sub-)space, and n!/[(n/2)!(n/2)!] n/2-d subspaces (if n is even) • The number of skyline groups in subspaces of dimensionality k depends on the joint-effect of the two factors • When k < n/2, the two factors are consistent • When k > n/2, the two factors are contrasting J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  25. About the Synthetic Data Sets • Independent: attribute values are uniformly distributed • Correlated: if a record is good in one dimension, likely it is also good in others • Anti-correlated: if a record is good in one dimension, it is unlikely to be good in others J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  26. Scalability w.r.t Database Size Independent Correlated Anti-correlated J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  27. Scalability w.r.t. Dimensionality J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  28. Conclusions • Skyline analysis is important in many applications • Only skyline objects in the full space may not be enough • Skyline cube is powerful to answer subspace skyline queries • But it is interesting to ask why an object is in the subspace skylines, and more • Skyline groups and decisive subspaces – capturing the semantics of subspace skylines • OLAP subspace skyline analysis • An efficient algorithm to compute skyline groups J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

  29. Thank You! Hong Kong http://lambcutlet.org/gallery/Day_6/Hong_Kong_Island_ skyline_on_a_cloudy_night_around_Central Vancouver, BC, Canada http://members.virtualtourist.com/m/822f5/dc80f/ Trondheim, Norway By Gerold Jung J. Pei, W. Jin, M. Ester, and Y. Tao: Catching the Best Views of Skyline

More Related