Repository hosted by TU Delft Library

Home · Contact · About · Disclaimer ·

Channel-dependent GMM and multi-class logistic: Regression models for language recognition

Publication files not online:

Author: Leeuwen, D.A. van · Brümmer, Niko
Institution: TNO Defensie en Veiligheid
Source:Speaker recognition Odyssey 2006
Identifier: 16365
Keywords: Informatics · automatic speech recognition


This paper describes two new approaches to spoken language recognition. These were both successfully applied in the NIST 2005 Language Recognition Evaluation. The first approach extends the Gaussian Mixture Model technique with channel dependency, which results in actual detection costs (CDET) of 0.095 in NIST LRE-2005, and which should be compared to a traditional 2-gender dependency of GMM language models achieving 0.120. The second approach is a Multi-class Logistic Regression system, which operates similarly to a Support Vector Machine (SVM), but can be trained for all languages simultaneously. This new approach resulted in a CDET of 0.198. The joint TNO-Spescom Datavoice (TNO-SDV) submission to NIST LRE-2005 contained two more systems and obtained a result of 0.0958.