Publication Details

Title: GDNN: A Gender-Dependent Neural Network for Continuous Speech Recognition
Author: Y. Konig, N. Morgan, and C. Chandra
Group: ICSI Technical Reports
Date: December 1991
PDF: http://www.icsi.berkeley.edu/pubs/techreports/tr-91-071.pdf

Overview:
Conventional speaker-independent speech recognition systems do not consider speaker-dependent parameters in the probability estimation of phonemes. These recognition systems are instead tuned to the ensemble statistics over many speakers. Most parametric representations of speech, however, are highly speaker dependent, and probability distributions suitable for a certain speaker may not perform as well for other speakers. It would be desirable to incorporate constraints on analysis that rely on the same speaker producing all the frames in an utterance. Our experiments take a first step towards this speaker consistency modeling by using a classification network to help generate gender-dependent phonetic probabilities for a statistical recognition system. Our results show a good classification rate for the gender classification net. Simple use of such a model to augment an existing larger network that estimates phonetic probabilities does not help speech recognition performance. However, when the new net is properly integrated in an HMM recognizer, it provides significant improvement in word accuracy.

Bibliographic Information:
ICSI Technical Report TR-91-071

Bibliographic Reference:
Y. Konig, N. Morgan, and C. Chandra. GDNN: A Gender-Dependent Neural Network for Continuous Speech Recognition. ICSI Technical Report TR-91-071, December 1991