Learning Transportation Mode from Raw GPS Data for Geographic Applications on the Web

Learning Transportation Mode from Raw GPS Data for Geographic Applications on the Web Yu Zheng, Like Liu, Xing Xie yuzheng@microsoft.com Microsoft Research Asia

Outline • Introduction • Framework • Methodology • Experiment • Conclusion & future work

Background Percentage of GPS-enabled handset among mobile phone (Gartner Dataqueste: Forecast: GPS-enabled device 2004-2011)

Introduction • What we do: Infer transportation modes from users’ GPS logs GPS log Infer model

Introduction • Motivation • Differentiate GPS trajectory of different transportation modes • Learning knowledge from raw GPS data • enable people to absorb more knowledge from others’ life experience • Trigger people’s memory about their past • Understand people’s life pattern • Understanding user behavior • Context-aware computing • Modeling traffic condition • Discover social pattern • … • Difficulty • A trajectory may contain more than two kinds of transportation modes • Pure velocity-based method may suffer from congestion

Introduction Distribution of mean velocity (m/s) of different transportation modes Distribution of maximum velocity (m/s) of different transportation modes

Introduction • Contributions • We propose • A change point-based segmentation method • An inference model based on supervised learning • A post-processing algorithm based on conditional probability • Significance • A step toward mining knowledge from raw GPS data for geographic applications on the Web • A step toward understanding user behavior based on GPS data • Evaluation results • Large-scale data collected by 45 people over a period of 6 months • Almost 70 percent accuracy

Framework • Preliminary

Framework • Inference strategy

Framework • Post-Processing Segment[i].P(Bike) = Segment[i].P(Bike) * P(Bike|Car) Segment[i].P(Walk) = Segment[i].P(Walk) * P(Walk|Car)

Framework • CRF-Based Inference

Methodology Transition matrix of transportation modes • Commonsense knowledge from real world • Typically, people need to walk before transferring transportation modes • Typically, people need to stop and then go when transferring modes

Methodology • Change point-based Segmentation Algorithm • Step 1: distinguish all possible Walk Points, non-Walk Points. • Step 2: merge short segment composed by consecutive Walk Points or non-Walk points • Step 3: merge consecutive Uncertain Segment to non-Walk Segment. • Step 4: end point of each Walk Segment are potential change points

Experiments • Framework of experiment • Feature Extraction • length • mean velocity • expectation of velocity • variance of velocity • top three velocities • top three accelerations

Experiment • Devices • Data

Experiment • Evaluation method • Precision of inference a segment • Accuracy by Length • Accuracy by Duration • Change Point • Precision of change point • Recall of change point

Experiment: Result • Inference performance Inferring accuracy of transportation mode over change point-based segmentation method

Experiment • Inference performance of change point Recall of change point using change point based segmentation method Precision of change point using change point based segmentation method

Experiment: Result Comparison of different segmentation methods using Decision Tree

Experiment: Result Comparison of inference results of CRF over different segmentation methods

Conclusion Segmentation method Inference method SVM Change Point based Bayesian Net Uniform Duration based Decision Tree Uniform Length based CRF

Future work • Identify more valuable features • Location-constraint conditional probability • Improving prediction performance of CRF-based approach

Thanks! Q&A Yu Zheng @ Microsoft

Learning Transportation Mode from Raw GPS Data for Geographic Applications on the Web

Learning Transportation Mode from Raw GPS Data for Geographic Applications on the Web

Presentation Transcript

The Railroad Mode of Transportation

Mode Transportation

GPS Applications for Precision Agriculture

GPS Applications

Data in Web Applications

Raw Data

SilverLine: Preventing Data Leaks from Compromised Web Applications

Climate Applications of the Precipitable Water Data from Ground-based GPS Measurements

Web Mining: Machine Learning for Web Applications

Learning From the Data

Raw Data

A Geographic Knowledge Base for Semantic Web Applications

From Raw Data to Physics Results

Introduction to RINEX, GPS Raw Data

Tracking: From Raw Data to Analysis

Deducing Mode and Purpose from GPS Data

RAW DATA

From raw data to rich data

Framework for Raw Data

Raw Data for MUON