80 likes | 209 Views
Analysis and Knowledge Extraction from Video & Audio. Rick Parent Jim Davis Raghu Machiraju Deleon Wang. Department of Computer and Information Science Ohio State University. Overview. Motivation. Streaming data from video & audio. Problem. Human operators & large data sets. Solution.
E N D
Analysis and Knowledge Extraction from Video & Audio Rick Parent Jim Davis Raghu Machiraju Deleon Wang Department of Computer and Information Science Ohio State University
Overview Motivation Streaming data from video & audio Problem Human operators & large data sets Solution Focus on human behavior Extract important events Use multimodal approach Security (real-time processing) Annotating recorded video Processing archival material Applications
Objectives Detect and track people to extract audio-visual events Present graphical summaries to human operator via secure web-based interface Build prototype system • 3 level system • Person/action detection • Sequential long-term tracking • Multi-modal identification Incrementally constructs event model to focus attention and resources to track and recognize people across sequences
Person Detection and Activity Recognition(Jim Davis) Thermal-based image analysis and person detection Framework for recognizing basic human activities
Sequential-frame tracking(Raghu Machiraju, Rick Parent) Monitor across sequences Tack human figure poses Capture appearance Characterize motions
+ = Robust Speaker Recognition(Deleon Wang) Usable speech extraction from multiple speaker audio By tracking pitch and extracting voiced segments
Deliverables Demonstration subsystems Person detection Long-term tracking Speech recognition 6 mos: review of basic work 12 months: demo of capabilities, summary report
Expenditures 6 Student-quarters of support over 12 months 2 Qtrs: Person detection (Davis) 3 Qtrs: Tracking (Machiraju & Parent) 1 Qtr: Speech (Wang)