10 likes | 134 Views
S 1. a 11. a 12. t. t. t. S 2. a 22. a 13. S 3. a 33. Audio-Visual Speech Recognition .
E N D
S1 a11 a12 t t t S2 a22 a13 S3 a33 Audio-VisualSpeech Recognition In this paper, we present a macro-cuboïd based representation together with a probabilistic matching function to recognise lip movements from pronounced digits. Our model (1) automatically selects spatio-temporal features extracted from 10 digit model templates and (2) matches them with probe video sequences. Spatio-temporal features embed lip movements from pronouncing digits and contain more discriminative information than spatial features alone (see Figure 1). A model template for each digit is represented by a set of spatio-temporal macro-cuboïds at multiple scales. A probabilistic sequence matching function automatically segments a probe video sequence and matches the most likely sequence of digits recognised in the probe sequence. We demonstrate the proposed approach using the CUAVE \cite{Patterson2002} database and compare our representational scheme with three alternative methods, based on optical flow, intensity gradient and block matching, respectively. Our evaluation shows that the proposed approach outperforms the others in recognition accuracy and is … MFCC Vectors AUDIO Audio Feature Extraction Segmentation Audio Front End Fusion Spatio-Temporal Descriptors VIDEO Face Detection Visual Feature Extraction ROI Selection Hidden Markov Model Visual Front End Front End Bimodal Model Decision Saliency Detection THE MULTIDISCIPLINARY POWER OFCOMPUTER VISION Input Image Colour BY RG Luminance Orientation Center - Surround Combine Maps Normalize Saliency Map Milan Verma, Samuel Pachoud & BushraAkhtar Cellular Mechanics Analysis KneeJoint Cytoskeleton Mitochondria Nucleus Various Intracellular Organelles Imaged and Analysed 0.1 cm 2 cm 4 cm 6 cm 8 cm 10 cm Membrane Strain Map Articular Cartilage High Resolution Microscopy Imaging Motile Organelle Detection Pressure Applied to Cartilage Cells Under the Microscope