Major components in a Speech Recognition System, excerpted from Lecture 1. (Image by James Glass and Victor Zue.)
6.345 is a course in the department's "Bioelectrical Engineering" concentration. This course offers a full set of lecture slides
with accompanying speech samples, as well as homework assignments
and other materials used in the course.
6.345 introduces students to the rapidly developing field of automatic speech recognition. Its content is divided into three parts. Part I deals with background material in the acoustic theory of speech production, acoustic-phonetics, and signal representation. Part II describes algorithmic aspects of speech recognition systems including pattern classification, search algorithms, stochastic modelling, and language modelling techniques. Part III compares and contrasts the various approaches to speech recognition, and describes advanced techniques used for acoustic-phonetic modelling, robust speech recognition, speaker adaptation, processing paralinguistic information, speech understanding, and multimodal processing.
Media player software, such as Quicktime® Player, RealOne™ Player, or Windows Media® Player, is required to run the .wav files found on this course site.
RealOne™ is a trademark or a registered trademark of RealNetworks, Inc.
QuickTime® is a trademark of Apple Computer, Inc., registered in the U.S. and other countries.
Windows Media® is a registered trademark or trademark of Microsoft Corporation in the U.S. and/or other countries.
*Some translations represent previous versions of courses.