1 / 19

Speech Processing

Speech Processing. Applications of Images and Signals in High Schools. AEGIS RET All-Hands Meeting Florida Institute of Technology July 6, 2012. Contributors. Dr . Veton Këpuska , Faculty Mentor, FIT vkepuska@fit.edu Jacob Zurasky , Graduate Student Mentor, FIT jzuraksy@my.fit.edu

geri
Download Presentation

Speech Processing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Speech Processing Applications of Images and Signals in High Schools AEGIS RET All-Hands Meeting Florida Institute of Technology July 6, 2012

  2. Contributors Dr. VetonKëpuska, Faculty Mentor, FIT vkepuska@fit.edu Jacob Zurasky, Graduate Student Mentor, FIT jzuraksy@my.fit.edu Becky Dowell, RET Teacher, BPS Titusville High dowell.jeanie@brevardschools.org

  3. Motivation • Timeline / Background – need to add this • Difficulties • Siri demo

  4. Motivation • Speech audio processing has increased in its usefulness. • Applications • Siri on iPhone 4S • Automated telephone systems • Voice transcription (e.g. dictation software) • Hands-free computing (e.g., OnStar) • Video games (e.g., XBOX Kinect) • Military applications (e.g., aircraft control) • Healthcare applications

  5. Motivation • Speech recognition requires speech to first be characterized by a set of “features”. • Features are used to determine what words are spoken. • Our project implements the feature extraction stage of a speech processing application.

  6. Speech Recognition Front End: Pre-processing Back End: Recognition Features Recognized speech Speech Large amount of data. Ex: 256 samples Reduced data size. Ex: 13 features • Front End – reduce amount of data for back end, but keep enough data to accurately describe the signal. Output is feature vector. • 256 samples ------> 13 features • Back End - statistical models used to classify feature vectors as a certain sound in speech

  7. Front-End Processing of Speech Recognizer • Pre-emphasis • Window • FFT • Mel-Scale • log • IFFT

  8. Speech Analysis Project • Added GUI • Allow user to record audio or input audio from a sound file • Displays graph of the audio • User can click on graph to select speech frame • Processes speech frame and displays output for each state of processing • Displays spectrogram

  9. GUI Components

  10. GUI Components Plotting Axes

  11. Buttons GUI Components Plotting Axes

  12. Future Work • Improve GUI • Audio Effects • Noise Filtering

  13. References • Ingle, Vinay K., and John G. Proakis. Digital signal processing using MATLAB. 2nd ed. Toronto, Ont.: Nelson, 2007. • Oppenheim, Alan V., and Ronald W. Schafer. Discrete-time signal processing. 3rd ed. Upper Saddle River: Pearson, 2010. • Weeks, Michael. Digital signal processing using MATLAB and wavelets. Hingham,Mass.: Infinity Science Press, 2007.

  14. Thank you! Questions?

  15. Unit Plan

More Related