110 likes | 275 Views
Overview. Valid DB Results Complexity Analysis New database Protocol Requirements Analysis. Valid DB Results. Valid DB Description 106 Person. 2 sentences with frontal view in 5 different environments. 1 with fixed head motion. Resolution: 768 X 576 @ 25 f/s.
E N D
Overview • Valid DB Results • Complexity Analysis • New database Protocol • Requirements Analysis
Valid DB Results Valid DB Description 106 Person. 2 sentences with frontal view in 5 different environments. 1 with fixed head motion. Resolution: 768 X 576 @ 25 f/s. Duration: 45 sec per person
Valid DB Results Visual lip features extracted @ 25 f/s. Half of DB, for 3rd sentence. Features • Area contained in lip boundary. • Length of major and minor axis. • Eccentricity of lip boundary. Arranged in matlab tables, for each video file
Complexity Analysis • Video to Image • Currently being done by hand. • Valid videos around 30 MB for 2 sentences. • 200 Images of 20 MB, JPEG @ 90 % quality. • Cut and Resize mouth to 45 X 90 pixels • Bicubic Interpolation, 32 ops per pixel. • 400 KB, Color • Color Transform • 4 ops per pixel. • 200 KB , BW • Calculate threshold • Otsu Thresholding, O(n2) • 200 KB , BW
Complexity Analysis • Canny edge detection • 130 ops per pixel • 200 KB , BW • Mathematical morphology • Erosion, dilation with square mask of size 2 pixels. • 200 KB , BW • Calculating blob areas • Based on run length codes, image scan and consolidation. • Takes few milliseconds • 200 KB , BW • Calculating convex hull • O(nlogn) • 200 KB , BW • Calculate area, major/minor axis. • 200 KB , BW • Write table file • 2 KB , Matlab Table
New Database Protocol • Contents • Text dependent speech with well illuminated frontal view of face. • Video resolution • 640X480 pixels • Temporal Resolution • 25 f/s • Distance between eyes • 40-60 pixels • Video compression • Preferably none • Video Format • avi • Color • 16 bit or higher • Number of Subjects • 40-60 • Length required for enrollment • 5 mins • Length required for recognition • 10-15 secs
Requirements Analysis Device Description Color LCD Screen Normal PDA 480 X 320 GUI 570 X 480 Input device Keypad / touch screen Video Camera 640X480, Color 16 bit RAM….?? Permanent memory 2 KB per video per person
Requirements Analysis GUI Requirements Window like environment Message Box Push Buttons Video Display frame
Requirements Analysis Application Functionality Enrollment Phase • Camera turns on and displays video. • User aligns face in template • User pushes recording button when ready • Software prompts sentences to speak • User pushes recording button again to finish • Software displays option window • User pushes enroll • Software extracts video parameters and saves them in permanent memory.
Requirements Analysis Application Functionality Identification Phase • Similar to enrolment except • User pushes Identify button • Software extracts video parameters and uses them to identify person