Evolution of Speech and Speaker Recognition Technology: Insights from 40 Years of Research

Selected topics from 40 years of research on speech and speaker recognition Sadaoki Furui Tokyo Institute of Technology Department of Computer Science furui@cs.titech.ac.jp

Generations of ASR technology 1950 1960 1970 1980 1990 2000 2010 1952 1968 1G Heuristic approaches (analog filter bank + logic circuits) 1980 2G 1968 Pattern matching (LPC, FFT, DTW) 3G 1980 1990 Statistical framework (HMM, n-gram, neural net) 3.5G 1990 Discriminative approaches, robust training, Prehistory normalization, adaptation, spontaneous speech, rich transcription 4G ? Extended knowledge processing Our research NTT Labs (+Bell Labs), Tokyo Tech Collaboration with other labs

Japanese traditional cuisine “Kaiseki-ryori”

ATTENTION! TRIAL LIMITATION - ONLY 3 SELECTED PAGES MAY BE CONVERTED PER CONVERSION. PURCHASING A LICENSE REMOVES THIS LIMITATION. TO DO SO, PLEASE CLICK ON THE FOLLOWING LINK: https://www.pdfconverter.com/purchase/

Evolution of Speech and Speaker Recognition Technology: Insights from 40 Years of Research

Evolution of Speech and Speaker Recognition Technology: Insights from 40 Years of Research

Presentation Transcript

Selected Topics in Propagation

Overview of Selected Topics

STAFFING – SELECTED TOPICS

Surveillance: Selected Topics

Selected Design Topics

Skeletal System - Selected Topics

Selected Topics From Chapter6 Iteration

Unit 4 Selected Topics

Selected Advanced Topics

Selected Research Topics

Selected topics in Transcription

Nuclear Chemistry (selected topics)

Selected Topics in

CSC590 Selected Topics

Selected Topics from Philippians

Selected Topics first

Selected Research Topics

Microcontroller Interfacing: Selected Topics

Selected Advanced Topics

Selected Topics from Philippians

Selected Topics in Propagation

DAQ Overview + selected Topics