290 likes | 483 Views
On-line Learning From Streaming Data. ACM CIKM October 31, 2013 Jeff Hawkins jhawkins@GrokSolutions.com. Industrial Research Track. 1) Discover operating principles of neocortex. Anatomy, Physiology. Theoretical principles. Software. Build systems based on these principles.
E N D
On-line Learning From Streaming Data ACM CIKMOctober 31, 2013 Jeff Hawkins jhawkins@GrokSolutions.com
Industrial Research Track 1) Discover operating principles of neocortex Anatomy,Physiology Theoreticalprinciples Software • Build systems based on these principles Cortical algorithms Anomaly detection in high velocity data
The neocortex is a memory system. The neocortex learns a model from sensory data - predictions - anomalies - actions retina data stream cochlea somatic The neocortex learns a sensory-motor model of the world
Principles of Neocortical Function retina 1) On-line learning from streaming data cochlea somatic data stream
Principles of Neocortical Function retina 1) On-line learning from streaming data cochlea 2) Hierarchy of memory regions somatic data stream
Principles of Neocortical Function retina 1) On-line learning from streaming data cochlea 2) Hierarchy of memory regions somatic 3) Sequence memory - inference - motor data stream
Principles of Neocortical Function retina 1) On-line learning from streaming data cochlea 2) Hierarchy of memory regions somatic 3) Sequence memory data stream 4) Sparse Distributed Representations
Principles of Neocortical Function retina 1) On-line learning from streaming data cochlea 2) Hierarchy of memory regions somatic 3) Sequence memory data stream 4) Sparse Distributed Representations 5) All regions are sensory and motor Motor
Principles of Neocortical Function retina 1) On-line learning from streaming data cochlea 2) Hierarchy of memory regions somatic x x x 3) Sequence memory data stream x x x x 4) Sparse Distributed Representations x x x x x x 5) All regions are sensory and motor 6) Attention
Principles of Neocortical Function retina 1) On-line learning from streaming data cochlea 2) Hierarchy of memory regions somatic 3) Sequence memory data stream 4) Sparse Distributed Representations 5) All regions are sensory and motor 6) Attention These six principles are necessary and sufficientfor biological and machine intelligence. - All mammals from mouse to human have them
Dense Representations • Few bits (8 to 128) • All combinations of 1’s and 0’s • Example: 8 bit ASCII • Individual bits have no inherent meaning • Representation is assigned by programmer Sparse Distributed Representations (SDRs) 01101101 = m • Many bits (thousands) • Few 1’s mostly 0’s • Example: 2,000 bits, 2% active • Each bit has semantic meaning • Meaning of each bit is learned, not assigned 01000000000000000001000000000000000000000000000000000010000…………01000
1) Similarity: shared bits = semantic similarity A Few SDR Properties 2) Store and Compare: store indices of active bits Indices12345|40 subsampling is OK Indices12|10
Sequence Memory (for inference and motor) Coincidence detectors How does a layer of neurons learn sequences?
Each cell is one bit in our Sparse Distributed Representation SDRs are formed via a local competition between cells.
Cell forms connections to subsample of previously active cells.Predicts its own future activity.
Multiple Predictions Can Occur at Once With one cell per column, 1st order memoryWe need a high order memory
High Order Sequence Memory Enabled by Columns of Cells Cortical Learning Algorithm (CLA)Distributed sequence memoryHigh order High capacity Multiple simultaneous predictions Semantic generalization
Three Current Directions • NuPIC Open Source Project
NuPIC Open Source Project www.Numenta.org Single source tree (used by GROK) GPLv3 Steady community growth • 67 contributors (+26 since July) • 245 mailing list subscribers • 1621 total messages eBook from community member OS community joining KaggleCompetitions Fall Hackathon: 70 attendees
Three Current Directions • NuPIC Open Source Project • Custom CLA Hardware • Needed for scaling research and commercial applications • DARPA “Cortical Processor” • IBM, Seagate, Sandia Labs • Commercialization
Data: Past and Future Past 1. Store data 2. Look at data 3. Build models Problem: - Doesn’t scale with velocity and #models Solution: - Automated model creation - Continuous learning - Temporal inference PredictionsAnomaliesActions Stream data Automated model creation Continuous learning Temporal inference Future
Anomaly Detection Using Predictive Cortical Models Cortical Memory Prediction Point anomaly score Time average Distribution of averages Metric anomaly score Encoder Metric 1 SDR Cortical Memory Prediction Point anomaly score Time average Distribution of averages Metric anomaly score Encoder SystemAnomalyScore ... Metric N SDR SDR
Largely predictable Metric value Anomalyscore Largely unpredictable Metric value Anomalyscore
Grok for IT Monitoring Breakthrough Science for Anomaly Detection Reinventing UX for IT Monitoring • Detects problems thresholds miss • Continuous learning • Automated model building • State-of-the art neocortical model • Smartphone-centric • Ranks anomalous instances • Rapid drill down • Continuously updated • User-controlled notifications In private beta for Amazon AWS cloud users grokbeta@GrokSolutions.com
Extensible Architecture • Custom metrics for any application/server • Web interface and mobile client source code available under no-cost license • Engine API to be published • NuPIC open source community