290 likes | 310 Views
Explore spatial time series anomaly detection, problem formulation, detection methods, outbreak scenarios, application in agriculture, and conclusions of Dr. Hesam Izakian's October 2014 study. Learn about structure, clustering, anomaly visualization, and more.
E N D
Cluster-Centric Anomaly Detection and Characterization in Spatial Time Series Dr. Hesam Izakian October 2014
Outline • Spatial time series • Problem formulation • Anomaly detection in spatial time series- questions • Overall scheme of the proposed method • Time series segmentation • Spatial time series clustering • Assigning anomaly scores to clusters • Visualizing the propagation of anomalies • An outbreak detection scenario • Application • Conclusions
Spatial time series • Structure of data • A set of spatial coordinates • One or more time series for each point • Examples • Daily average temperature in different climate stations • Stock market indexes in different countries • Number of absent students in different schools • Number emergency department visits in different hospitals • Measured signals in different parts of brain
Problem formulation There are N spatial time series Objective: Find a spatial neighborhood of data In a time interval Containing a high level of unexpected changes
Anomaly detection in spatial time series- questions • Spatial neighborhood of data • Size of neighborhood • Overlapping neighborhoods • Unexpected changes (anomalies) • What kind of changes are expected/not expected • How to evaluate the level of unexpected changes • Anomaly visualization • Anomaly characterization • What was the source of anomaly • How the anomaly is propagated over time
Overall scheme of the proposed method Spatial time series data Spatial time series data Sliding window Anomaly scores Spatial time series clustering Fuzzy relations Revealing the structure of data in various time intervals Comparing the revealed structures
Time series part segmentation • Sliding window • Spatio-temporal subsequences • Local view of time series part
Overall scheme of the proposed method Spatial time series data Sliding window Anomaly scores Spatial time series clustering Fuzzy relations Revealing the structure of data in various time intervals Comparing the revealed structures
Fuzzy C-Means clustering… • Partitions N data • Into clusters • Result: • Objective function: • Minimization:
Spatial time series clustering • Reveals available structure within data • In form of partition matrices • Challenges • Different sources: Spatial part vs. temporal part • Different dimensionality in each part • Different structure within each part
Spatial time series clustering… • In spatial time series, we define • Adopted FCM objective function • Characteristics • When λ=0: Only spatial part of data in clustering • A higher value of λ : a higher impact of time series part in clustering • Optimal value of λ: Optimal impact of each part in clustering
Overall scheme of the proposed method Spatial time series data Sliding window Anomaly scores Spatial time series clustering Fuzzy relations Revealing the structure of data in various time intervals Comparing the revealed structures
Assigning anomaly scores to clusters in different time windows Assign an anomaly score to each single subsequence based on historical data Aggregating anomaly scores inside revealed clusters
Overall scheme of the proposed method Spatial time series data Sliding window Anomaly scores Spatial time series clustering Fuzzy relations Revealing the structure of data in various time intervals Comparing the revealed structures
Visualizing the propagation of anomalies- Fuzzy relations • Objective: quantifying relations between clusters
Visualizing the propagation of anomalies… • Objective function to construct relation • Optimization
Example • An outbreak • In southern part of Alberta • Using NAADSM for 100 days
Example… • A sliding window is used • Length : 20 • Movement: 10 • Generated spatio-temporal subsequences:
Application • Implemented for Agriculture and Rural Development (Government of Alberta) • Using KNIME (Konstanz Information Miner) • Animal health surveillance in Alberta • Anomaly detection • Data visualization
Conclusions • A framework for anomaly detection and characterization in spatial time series is developed • A sliding window to generate a set of spatio-temporal subsequences is considered • Clustering is used to discover the available structure within the spatio-temporal subsequences • An anomaly score assigned to each revealed spatio-temporal cluster • A fuzzy relation technique is proposed to quantify the relations between clusters in successive time steps