A Road Sign Recognition System Based on a Dynamic Visual Model

A Road Sign Recognition System Based on a Dynamic Visual Model C. Y. Fang Department of Information and Computer Education National Taiwan Normal University, Taipei, Taiwan, R. O. C. C. S. Fuh Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R. O. C. S. W. Chen Department of Computer Science and Information Engineering National Taiwan Normal University, Taipei, Taiwan, R. O. C. P. S. Yen Department of Information and Computer Education National Taiwan Normal University, Taipei, Taiwan, R. O. C. violet@ice.ntnu.edu.tw

Outline • Introduction • Dynamic visual model (DVM) • Neural modules • Road sign recognition system • Experimental Results • Conclusions violet@ice.ntnu.edu.tw

Introduction -- DAS • Driver assistance systems (DAS) • The method to improve driving safety • Passive methods: seat-belts, airbags, anti-lock braking systems, and so on. • Active methods: DAS • Driving is a sophisticated process • The better the environmental information a driver receives, the more appropriate his/her expectations will be. violet@ice.ntnu.edu.tw

Introduction -- VDAS • Vision-based driver assistance systems (VDAS) • Advantages: • High resolution • Rich information • Road border detection or lane marking detection • Road sign recognition • Difficulties of VDAS • Weather and illumination • Daytime and nighttime • Vehicle motion and camera vibration violet@ice.ntnu.edu.tw

Subsystems of VDAS • Road sign recognition system • System to detect changes in driving environments • System to detect motion of nearby vehicles • Lane marking detection • Obstacle recognition • Drowsy driver detection • …… violet@ice.ntnu.edu.tw

Introduction -- DVM • DVM: dynamic visual model • A computational model for visual analysis using video sequence as input data • Two ways to develop a visual model • Biological principles • Engineering principles • Artificial neural networks violet@ice.ntnu.edu.tw

Video images Data transduction Sensory component Episodic Memory Information acquisition Spatialtemporal information Perceptual component STA neural module No Focuses of attention Yes Feature detection Categorical features Conceptual component CART neural module Category Pattern extraction Patterns CHAM neural module Action Dynamic Visual Model

Physical stimuli Data compression Transducer Low-level feature extraction Sensory analyzer High-level feature extraction Perceptual analyzer Classification and recognition Conceptual analyzer Class of input stimuli Human Visual Process violet@ice.ntnu.edu.tw

Neural Modules • Spatial-temporal attention (STA) neural module • Configurable adaptive resonance theory (CART) neural module • Configurable heteroassociative memory (CHAM) neural module violet@ice.ntnu.edu.tw

STA Neural Network (1) ak ai Output layer (Attention layer) nk ni Inhibitory connection wij Excitatory connection xj nj Input layer violet@ice.ntnu.edu.tw

Gaussian function G Attention layer ni rk nk corresponding neurons wkj nj Input neuron The linking strengths between the input and the attention layers STA Neural Network (2) • The input to attention neuron nidue to input stimuli x: violet@ice.ntnu.edu.tw

Interaction + Lateral distance “Mexican-hat” function of lateral interaction STA Neural Network (3) • The input to attention neuron ni due to lateral interaction: violet@ice.ntnu.edu.tw

STA Neural Network (4) • The net input to attention neuron ni : : a threshold to limit the effects of noise where 1< d <0 violet@ice.ntnu.edu.tw

STA Neural Network (5) stimulus activation t 1 1 p pd The activation of an attention neuron in response to a stimulus. violet@ice.ntnu.edu.tw

Orienting subsystem Attentional subsystem Category representation field F2 y Signal generator Reset signal S Input representation field F1 ＋ q ＋＋ r ＋－＋－－＋＋ G p ＋ G ＋＋ G G － v ＋＋＋ u ＋＋ x －＋ G ＋ w ＋ Input vector i ART2 Neural Network (1) CART violet@ice.ntnu.edu.tw

ART2 Neural Network (2) • The activities on each of the six sublayers on F 1: where I is an input pattern where where the J th node on F 2 is the winner violet@ice.ntnu.edu.tw

ART2 Neural Network (3) • Initial weights: • Top-down weights: • Bottom-up weights: • Parameters: violet@ice.ntnu.edu.tw

v1 v2 vi vn Output layer (Competitive layer) i Excitatory connection wij xj j Input layer HAM Neural Network (1) CHAM violet@ice.ntnu.edu.tw

HAM Neural Network (2) • The input to neuron nidue to input stimuli x: nc: the winner after the competition violet@ice.ntnu.edu.tw

Road Sign Recognition System • Objective • Get information about road • Warn drivers • Enhance traffic safety • Support other subsystems violet@ice.ntnu.edu.tw

Problems • contrary light • side by side • shaking • occlusion violet@ice.ntnu.edu.tw

Information Acquisition • Color information • Example: Red color • Shape information • Example: Red color edge violet@ice.ntnu.edu.tw

Results of STA Neural Module— Adding Pre-attention violet@ice.ntnu.edu.tw

Locate Road Signs — Connected Component violet@ice.ntnu.edu.tw

Categorical Feature Extraction • Normalization: 50X50 pixels • Remove the background pixels • Features: • Red color horizontal projection: 50 elements • Green color horizontal projection: 50 elements • Blue color horizontal projection: 50 elements • Orange color horizontal projection: 50 elements • White and black color horizontal projection: 50 elements • Total: 250 elements in a feature vector violet@ice.ntnu.edu.tw

Conceptual Component— Classification results of the CART Training Set Test Set violet@ice.ntnu.edu.tw

Conceptual Component—Training and Test Patterns for the CHAM violet@ice.ntnu.edu.tw

Conceptual Component—Another Training Patterns for the CHAM violet@ice.ntnu.edu.tw

Experimental Results of the CHAM violet@ice.ntnu.edu.tw

Experimental Results violet@ice.ntnu.edu.tw

Other Examples violet@ice.ntnu.edu.tw

Discussion • Vehicle and camcorder vibration • Incorrect recognitions Input patterns Recognition results Correct patterns violet@ice.ntnu.edu.tw

Conclusions (1) • Test data: 21 sequences • Detection rate (CART): 99% • Misdetection: 1% (11 frames) • Recognition rate (CHAM): 85% of detected road signs • Since our system only outputs a result for each input sequence, this ratio is enough for our system to recognize road signs correctly. violet@ice.ntnu.edu.tw

Conclusions (2) • A neural-based dynamic visual model • Three major components: sensory, perceptual and conceptual component • Future Researches • Potential applications • Improvement of the DVM structure • DVM implementation violet@ice.ntnu.edu.tw

A Road Sign Recognition System Based on a Dynamic Visual Model

A Road Sign Recognition System Based on a Dynamic Visual Model

Presentation Transcript

A Generic Virtual Content Insertion System Based on Visual Attention Analysis

A mobile single sign-on system

Image-based stress recognition from a model-based tracking system

Visual Dynamic Model Inspecting with OPM Model-Based Simulation Environment

A STUDY ON SPEECH RECOGNITION USING DYNAMIC TIME WARPING

Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino , Sergio et al.

A Study on Automatic Recognition of Road Signs

Road sign recognition with the e-puck

A Study on Detection Based Automatic Speech Recognition

Serialization Sets A Dynamic Dependence-Based Parallel Execution Model

Visual Dynamic Model Inspecting with OPM Model-Based Simulation Environment

Road Sign Recognition System Based on GentleBoost with Sharing Features

A Probabilistic Model of the Visual System

ELISHA: A Visual-Based Anomaly Detection System

A Highly Selective Dilepton Trigger System Based on Ring Recognition

character recognition based on probability tree model

A Game Based on Speech Recognition

Visual Dynamic Model Inspecting with OPM Model-Based Simulation Environment

TEXTAL: A System for Automated Model Building Based on Pattern Recognition

On Visual Recognition

A Road Sign Recognition System Based on a Dynamic Visual Model

MODELING AND RECOGNITION OF DYNAMIC VISUAL PROCESSES