While learning for the exam of this semester’s Pattern & Speech Recognition course by Prof. Klakow (highly, highly recommended), we (a couple of people, look for the names in the document itself) put together a summary with a couple a topics from the course.
- Feature Extraction from Sound
- Bayesian Decision Theory
- Maximum Likelihood Estimation
- Nonparametric Techniques
- Gaussian Mixture Models
- Decision Trees
The text is put together from various sources, but mostly based on slides and notes from the lectures. Some sections are pending (Hidden Markov Models, Bayesian Networks, Markov Random Fields), other topics from the lecture are plainly missing (HMMs in Speech Recognition, Acoustic Modeling, Speaker Adaptation, Normal Distributions). However, to my knowledge, nobody has been examined in any of the missing speeach-recognition related missing sections.
Get the PDF version of the summary.
The LaTeX source file, along with pictures, is kept in a Mercurial repository. To get the source files, do:
$ hg clone static-http://diotavelli.net/files/psr0708-summary/repository psr
The source file is named summ.tex and should build on most LaTeX installations without requiring additional packages.
Please notify me of any bugs or errors!