1 / 12

Mining Time-Series Databases

Learn about the extraction of nontrivial knowledge from time-series data and its importance in artificial intelligence, computer science, statistics, and information retrieval.

coxk
Download Presentation

Mining Time-Series Databases

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Mining Time-Series Databases

  2. Data Mining I – Introduction • The extraction of nontrivial, implicit and usefulknowledge from the data Data Knowledge Data Mining • Artificial Intelligence • Computer Science • Statistics • Information Retrieval

  3. Data Mining goals I – Introduction To find “structure” in the large amount of information available from different sources To organize the data To identify patterns that translate into new understandings and viable predictions To discover relationships between data and phenomena that ordinary operations and routine analysis would otherwise overlook

  4. Time Series I – Introduction • People measure things: • Oil price • Sócrates popularity • Blood pressure, etc. and things change over time, creating a time series

  5. Introduction • A Time-Series Database is a database that contains data for each point in time. • Examples: • Weather Data • Stock Prices

  6. What to Mine? • Full Periodic Patterns • Every point in time contributes to the cyclic behavior of the time-series for each period. • e.g., describing the weekly stock prices pattern considering all the days of the week. • Partial Periodic Patterns • Describing the behavior of the time-series at some but not all points in time. • e.g., discovering that the stock prices are high every Saturday and small every Tuesday.

  7. Time Series definition I – Introduction A (numeric) time series is a sequence of observations of a numeric property over time

  8. Motivation to Work in Time Series I – Introduction • Time series are ubiquitous • Most of the information (data) produced in a variety of areas are time series • e.g. about 50% of all newspaper graphics are time series • Other types of data can be converted to time series Image from E. J. Keogh. A decade of progress in indexing and mining large time series databases. In VLDB, page 1268, 2006.

  9. Time Series Examples I – Introduction Images from a variety of papers by E. J. Keogh. Available at: www.cs.ucr.edu/~eamonn motion data physiology (muscle activation) electroencephalogram sensors historical archives ECG

  10. Time Series Examples (cont.) I – Introduction Image from E. J. Keogh. A decade of progress in indexing and mining large time series databases. In VLDB, page 1268, 2006. sales stocks data goods consumption animal ECG images motion capture handwritten character recognition DNA sequences

  11. Time Series data characteristics I – Introduction • Analysis is hard, as we are typically dealing with massive data-sets: • One hour EEG: 1 GB of data • Typical weblog: 5 GB / week • MACHO database: 5 TB (growing 3 GB a day) • Stanford Linear Accelerator database: 500 TB • Quadratic complexity algorithms are insufficient • The data also present some distortions (noise, scaling effects, etc.) that make the analysis more difficult

  12. Time Series Data Mining Tasks I – Introduction Image from E. J. Keogh. A decade of progress in indexing and mining large time series databases. In VLDB, page 1268, 2006.

More Related