GDNN: A Gender-Dependent Neural Network for Continuous Speech Recognition

TitleGDNN: A Gender-Dependent Neural Network for Continuous Speech Recognition
Publication TypeTechnical Report
Year of Publication1991
AuthorsKonig, Y., Morgan N., & Chandra C.
Other Numbers701

Conventional speaker-independent speech recognition systems do not consider speaker-dependent parameters in the probability estimation of phonemes. These recognition systems are instead tuned to the ensemble statistics over many speakers. Most parametric representations of speech, however, are highly speaker dependent, and probability distributions suitable for a certain speaker may not perform as well for other speakers. It would be desirable to incorporate constraints on analysis that rely on the same speaker producing all the frames in an utterance. Our experiments take a first step towards this speaker consistency modeling by using a classification network to help generate gender-dependent phonetic probabilities for a statistical recognition system. Our results show a good classification rate for the gender classification net. Simple use of such a model to augment an existing larger network that estimates phonetic probabilities does not help speech recognition performance. However, when the new net is properly integrated in an HMM recognizer, it provides significant improvement in word accuracy.

Bibliographic Notes

ICSI Technical Report TR-91-071

Abbreviated Authors

Y. Konig, N. Morgan, and C. Chandra

ICSI Publication Type

Technical Report