1 / 11

Overview

Overview. Valid DB Results Complexity Analysis New database Protocol Requirements Analysis. Valid DB Results. Valid DB Description 106 Person. 2 sentences with frontal view in 5 different environments. 1 with fixed head motion. Resolution: 768 X 576 @ 25 f/s.

alina
Download Presentation

Overview

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Overview • Valid DB Results • Complexity Analysis • New database Protocol • Requirements Analysis

  2. Valid DB Results Valid DB Description 106 Person. 2 sentences with frontal view in 5 different environments. 1 with fixed head motion. Resolution: 768 X 576 @ 25 f/s. Duration: 45 sec per person

  3. Valid DB Results Visual lip features extracted @ 25 f/s. Half of DB, for 3rd sentence. Features • Area contained in lip boundary. • Length of major and minor axis. • Eccentricity of lip boundary. Arranged in matlab tables, for each video file

  4. Complexity Analysis • Video to Image • Currently being done by hand. • Valid videos around 30 MB for 2 sentences. • 200 Images of 20 MB, JPEG @ 90 % quality. • Cut and Resize mouth to 45 X 90 pixels • Bicubic Interpolation, 32 ops per pixel. • 400 KB, Color • Color Transform • 4 ops per pixel. • 200 KB , BW • Calculate threshold • Otsu Thresholding, O(n2) • 200 KB , BW

  5. Complexity Analysis • Canny edge detection • 130 ops per pixel • 200 KB , BW • Mathematical morphology • Erosion, dilation with square mask of size 2 pixels. • 200 KB , BW • Calculating blob areas • Based on run length codes, image scan and consolidation. • Takes few milliseconds • 200 KB , BW • Calculating convex hull • O(nlogn) • 200 KB , BW • Calculate area, major/minor axis. • 200 KB , BW • Write table file • 2 KB , Matlab Table

  6. New Database Protocol • Contents • Text dependent speech with well illuminated frontal view of face. • Video resolution • 640X480 pixels • Temporal Resolution • 25 f/s • Distance between eyes • 40-60 pixels • Video compression • Preferably none • Video Format • avi • Color • 16 bit or higher • Number of Subjects • 40-60 • Length required for enrollment • 5 mins • Length required for recognition • 10-15 secs

  7. Requirements Analysis Device Description Color LCD Screen Normal PDA 480 X 320 GUI 570 X 480 Input device Keypad / touch screen Video Camera 640X480, Color 16 bit RAM….?? Permanent memory 2 KB per video per person

  8. Requirements Analysis GUI Requirements Window like environment Message Box Push Buttons Video Display frame

  9. Requirements Analysis

  10. Requirements Analysis Application Functionality Enrollment Phase • Camera turns on and displays video. • User aligns face in template • User pushes recording button when ready • Software prompts sentences to speak • User pushes recording button again to finish • Software displays option window • User pushes enroll • Software extracts video parameters and saves them in permanent memory.

  11. Requirements Analysis Application Functionality Identification Phase • Similar to enrolment except • User pushes Identify button • Software extracts video parameters and uses them to identify person

More Related