[ home ] [ people ] [ projects ] [ courses ] [ meetings ]


Robust Speech Recognition: the Algonquin Algorithm


Monday, Feb. 25th -- Trausti Thor Kristjansson


Abstract:
 

Robustness to ambient noise and channel distortion is crucial to real world speech recognition applications.  I will present a probabilistic graphical model of noisy speech generation and discuss an approximate method for inference in this model, called Algonquin. Algonquin uses an iterative variational method to find an approximation to the posterior distribution of a clean speech vector, given the noisy observed speech vector. Any model based cleaning method will perform poorly if the parameters of the environment model (noise and channel) are poorly estimated.  The framework allows us to adapt the environment parameters online using a Generalized EM method. I will present recognition results for the basic and adaptive algorithms on the Aurora 2 database.