1 / 13

Isolated word, speaker independent speech recognition

Isolated word, speaker independent speech recognition. Kaustubh R. Kale Guide: Dr. John G. Harris. Project Goals. To make appliances smart Use Dynamic Time Warping algorithm 13 Mel frequency Cepstral coefficients as the extracted features Gui development and hardware interface.

Download Presentation

Isolated word, speaker independent speech recognition

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Isolated word, speaker independent speech recognition Kaustubh R. Kale Guide: Dr. John G. Harris EEL 6825 project

  2. Project Goals • To make appliances smart • Use Dynamic Time Warping algorithm • 13 Mel frequency Cepstral coefficients as the extracted features • Gui development and hardware interface EEL 6825 project

  3. Description • Schematic Diagram • Endpoint detection in Java • DTW analysis in Matlab • Parallel Port operations via C++ • Demo FOR MORE INFO... http://www.dcs.shef.ac.uk/~stu/com326/ EEL 6825 project

  4. Schematic Diagram Java Matlab C++ Appliance EEL 6825 project

  5. Endpoint detection in Java. • Utterances are of unequal lengths • Preceded by silence • Use of signal power p[i..j] = k=i..js[k]2 EEL 6825 project

  6. DTW analysis in Matlab • Two basic concepts to be understood: 1. Feature extraction from the time dependant signal 2. Distance calculation: a.Local distance between features b.Global distance between signals EEL 6825 project

  7. DTW Flow • To obtain a global distance, time alignment must be done D(I,j)=min[D(I-1,j-1),D(I-1,j),D(I,j-1)] +d(I,j) EEL 6825 project

  8. C++ interface with the port • The matlab passes on the a parameter to the C++ program • The C++ program drives the respective pins on the parallel port The Parameters: 1 = lights and fan off 2 = lights on and fan off 3 = fan on and lights off 4 = lights on and fan on EEL 6825 project

  9. Classification Errors • For speaker dependent operation the classification errors were 20% • For speaker independent operation the classification errors were 30%-40% EEL 6825 project

  10. Demonstration End to end operation EEL 6825 project

  11. Future work • Making the DTW more robust to ambient noise • Achieving speaker independent word recognition • Efficient inter component communication EEL 6825 project

  12. Conclusion • Via this program the goal of having voice operated smart appliance was achieved • The error rate was around 20% EEL 6825 project

  13. Thanks! Question time… EEL 6825 project

More Related