260 likes | 271 Views
Explore a novel approach for efficient kNN queries on road networks using SPIE trees and nd Indexing. The method simplifies network topology and boosts performance compared to traditional methods. Experimental results demonstrate significant improvements.
E N D
Fast Nearest Neighbor Search on Road Networks Haibo Hu, Dik Lun Lee, and Jianliang Xu Hong Kong Univ. of Science & Technology Hong Kong Baptist University
About Myself • Sagar Uplanchiwar • MS Computer Science • Graduating Dec 2008
Presentation Outline • Problem • Existing Solutions • Motivation for new work • Network Reduction, SPH, SPIE • nd (nearest descendants) Index • Updates • Cost Models • Performance • Conclusions
Existing Solutions • Voronoi • Dijkstra’s
Motivation • Voronoi • Unwieldy for denser/vast data • Dijkstra’s • Too many node visits on large/sparser data
Network Reduction • Objectives • Reduce the number of edges while preserving network distances • Replace complex graph topology with simpler structures (trees).
Network ReductionThe Elements of reduction • Shortest Path Trees (SPT) • Distance between root and other nodes is minimized
Network ReductionThe Elements of reduction • Are Shortest Path Tree (SPT) networks inefficient for road networks? • Degree of vertices in a road network are typically >= 3. • The length of the shortest circuits are still usually long • These reasons justify the reduction of road networks to SPT pieces
SPH • SPH means Shortest Path Trees with Horizontal Edges Specified to reduce number of connected trees • Like SPT but with another condition • Allow sibling-sibling connections (horizontal edges) within trees
SPIE • An SPIE is an SPH with another condition • SPIE SPH with ‘Triangular Inequality’ Edges Shortest Path between two nodes in a tree is guaranteed to contain exactly one horizontal edge between ancestors of the two nodes
nd Index – nearest descendant • Very simple operation • For every node in the tree, extract the nearest descendant data node (point of interest) down the tree representation of the road.
Updates • Node Insertion • Insert into SPIE containing adjacent node • Node Deletion • Rebuild local SPIE • Edge Insertion/Deletion non-trivial depending on specifics of the edge, but is still relatively inexpensive • Edge re-weighting is like above • Data Point Insertion/Deletion only requires change of nd Index of local SPIE tree
Cost Models • I will just provide an overview of insights • Even in a 2D uniform grid (city blocks) there is still a 25% benefit by the reduction model • Nearest Neighbor search by traditional means is exponential while SPIE NN search is linear to the average distance from a node to a NN • Number of node accesses in nd index is much less than in the traditional approach
Performance • Experimented with the algorithms on two sets of data • Artificial network with ~180K nodes, exponential distribution of node degrees, edge weights random 1 through 10 • Digital Chart of the World (DCW) containing ~600K railroads and roads in the Americas. ~400K nodes • Test system: C++ on Win32 platform, 2.4 Ghz P4, 512 MB RAM, 4Kb page size
PerformanceNetwork Reduction • With ~430K nodes, only 1571 SPIEs made
Performancend Index Construction • p represents density of random datasets • Ignores one-time construction of SPIE graph • ~8 MB, created in ~300 seconds • Almost constant construction time of nd Index
PerformanceNN Search Result • From average of 2000 trials
PerformanceKNN Search Result • For p=0.01 dataset on real road network
PerformanceSummary • Network Reduction and nd Indexing • Simplify network topology in a decent one-time cost • Create light-weight (CPU and mem) nd Index • Perform well on (k)NN queries of varying data • Perform well on kNN for various k values
Conclusions • Overview • New network kNN search technique created • Reduction of network to a set of interconnected tree structures (SPIE) • nd index created per SPIE to make kNN search on SPIE follow predetermined path, and faster • Cost Models and Experimental Results both show improvement upon network-expansion (Dijkstra’s) and solution-based (Voronoi) system for most network topologies and distributions • Future plans are to redesign structure in place of SPIE trees