Chairman:Hung-Chi Yang Presenter: Yue -Fong Guo Advisor: Dr. Yeou-Jiunn Chen Date: 2013.3.20

Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling V. Karjigi, P. RaoDept. of Electrical Engineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India Received 8 December 2011; received in revised form 12 March 2012; accepted 23 April 2012 Available online 1 June 2012 Chairman:Hung-Chi YangPresenter: Yue-Fong Guo Advisor: Dr. Yeou-Jiunn ChenDate: 2013.3.20

Outline • Introduction • MFCC • 2D-DCT • Polynomial surface

Outline • GMM • Results • Conclusion

Introduction • Automatic speech recognition (ASR) system • The goal is the lexical content of the human voice is converted to a computer-readable input • Attempt to identify or confirm issue voice speaker rather than the content of the terms contained therein

Introduction • Automatic speech recognition (ASR) system • Acoustics feature • Signal processing and feature extraction • Mel frequency cepstral coefficients (MFCC) • Acoustics model • Statistically speech model • Gaussian mixture model (GMM)

MFCC • Mel frequency cepstral coefficients (MFCC) • MFCC takes human perception sensitivity with respect to frequencies into consideration, and therefore are best for speech/speaker recognition.

MFCC • Pre-emphasis • The speech signal s(n) is sent to a high-pass filter • Frame blocking • Hamming windowing • Each frame has to be multiplied with a hamming window in order to keep the continuity of the first and the last points in the frame

MFCC • Fast Fourier Transform or FFT • The time domain signal into a frequency domain • Triangular BandpassFilters • Smooth the magnitude spectrum such that the harmonics are flattened in order to obtain the envelop of the spectrum with harmonics. • Discrete cosine transform or DCT

MFCC • Log energy • The energy within a frame is also an important feature that can be easily obtained • Delta cepstrum • Actually used in speech recognition, we usually coupled differential cepstrum parameters to show the changes of the the cepstrum parameters of the time

2D-DCT • 2D-DCT modeling

Polynomial surface • Polynomial surface modeling

GMM • Gaussian mixture model (GMM) • Is an effective tool for data modeling and pattern classification • Speaker acoustic characteristics for clustering, and then each group of acoustic characteristics described with a Gaussian density distribution

Databases • Databases • Evaluated on two distinct datasets • American English continuous speech as provided in the TIMIT database • Marathi words database specially created for the purpose

Results

Conclusion • A comparison of performance with published results on the same task revealed that the spectro-temporal feature systems tested in this work improve upon the best previous systems’ performances in terms of classification accuracies on the specified datasets.

The End

Chairman:Hung-Chi Yang Presenter: Yue -Fong Guo Advisor: Dr. Yeou-Jiunn Chen Date: 2013.3.20

Chairman:Hung-Chi Yang Presenter: Yue -Fong Guo Advisor: Dr. Yeou-Jiunn Chen Date: 2013.3.20

Presentation Transcript

Advisor: Prof. Zaniolo Hung-chih Yang Ling-Jyh Chen

Chairman ： Dr.Hung -Chi Yang Presenter ： Ping-Yang Liao Adviser ： Dr. Yi-Chun Du

Chairman:Hung -Chi Yang Presenter: Yu-Kai Wang Advisor: Dr. Yeou-Jiunn Chen Date: 2013.3.6

Presenter: Yu-Chu Chen Advisor: Ming- Puu Chen Date: 2009/3/2

Chairman: Dr. Hung-Chi Yang Presenter: Fong- Ren Sie Advisor: Dr. Yen-Ting Chen Date: 2013.10.16

Presenter: Che-Yu Lin Advisor: Ming-Puu Chen Date: 06/15/2009

Presenter: Hao -Ling Huang Advisor: Ming- Puu Chen Date: 2009/10/28

Chairman : Dr.Hung -Chi Yang Presenter : Ping-Chen Hsu Advisor : Yen-Ting Chen Date : 2013.12.18

Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date: 2007.08.02

Advisor: Hsin-His Chen Reporter: Chi-Hsin Yu Date: 2007.06.21

Advisor : Dr. Hsu Presenter : Jia-Hao Yang Author : X Tan, S Chen, ZH Zhou, F Zhang

Advisor : Dr. Hsu Presenter : Ai-Chen Liao Authors : Yiu-ming Cheung

Chairman:Hung -Chi Yang Presenter: Yu-Kai Wang Advisor: Dr. Yeou-Jiunn Chen Date: 2013.3.6

Chuan-Hung Chen

Presenter: Asta Y.Z. Lord Advisor: Ming-Puu Chen Date: March 21, 2009

Yue Guo-an Chen Hao Zhang Yanyan

Advisor ： Min-Puu Chen Presenter ： Kuei-Hui Hsiao Date ： July 8,2008

Presenter: Jing-Yi Zhao Advisor: Ming-Puu Chen Date: Nov. 25, 2009

Adviser: Dr. Yeou -Jiunn Chen Presenter: Ming –Da Lee

Zong-Liang Yang Guo-Yue Niu

Presenter : Shao-Kai Liao Adviser : Tsung-Fu Chien Chairman : Hung-Chi Yang Date : 5.22.2013

Presenter: Che-Yu Lin Advisor: Min-Puu Chen Date: 04/27/2009