50 likes | 162 Views
Selected topics from. 40 years of research on speech and speaker recognition. Sadaoki Furui. Tokyo Institute of Technology Department of Computer Science furui@cs.titech.ac.jp. Generations of ASR technology. 1950. 1960. 1970. 1980. 1990. 2000. 2010. 1952. 1968. 1G.
E N D
Selected topics from 40 years of research on speech and speaker recognition Sadaoki Furui Tokyo Institute of Technology Department of Computer Science furui@cs.titech.ac.jp
Generations of ASR technology 1950 1960 1970 1980 1990 2000 2010 1952 1968 1G Heuristic approaches (analog filter bank + logic circuits) 1980 2G 1968 Pattern matching (LPC, FFT, DTW) 3G 1980 1990 Statistical framework (HMM, n-gram, neural net) 3.5G 1990 Discriminative approaches, robust training, Prehistory normalization, adaptation, spontaneous speech, rich transcription 4G ? Extended knowledge processing Our research NTT Labs (+Bell Labs), Tokyo Tech Collaboration with other labs
ATTENTION! TRIAL LIMITATION - ONLY 3 SELECTED PAGES MAY BE CONVERTED PER CONVERSION. PURCHASING A LICENSE REMOVES THIS LIMITATION. TO DO SO, PLEASE CLICK ON THE FOLLOWING LINK: https://www.pdfconverter.com/purchase/