GDNN: A Gender-Dependent Neural Network for Continuous Speech Recognition
Title | GDNN: A Gender-Dependent Neural Network for Continuous Speech Recognition |
Publication Type | Technical Report |
Year of Publication | 1991 |
Authors | Konig, Y., Morgan N., & Chandra C. |
Other Numbers | 701 |
Abstract | Conventional speaker-independent speech recognition systems do not consider speaker-dependent parameters in the probability estimation of phonemes. These recognition systems are instead tuned to the ensemble statistics over many speakers. Most parametric representations of speech, however, are highly speaker dependent, and probability distributions suitable for a certain speaker may not perform as well for other speakers. It would be desirable to incorporate constraints on analysis that rely on the same speaker producing all the frames in an utterance. Our experiments take a first step towards this speaker consistency modeling by using a classification network to help generate gender-dependent phonetic probabilities for a statistical recognition system. Our results show a good classification rate for the gender classification net. Simple use of such a model to augment an existing larger network that estimates phonetic probabilities does not help speech recognition performance. However, when the new net is properly integrated in an HMM recognizer, it provides significant improvement in word accuracy. |
URL | http://www.icsi.berkeley.edu/pubs/techreports/tr-91-071.pdf |
Bibliographic Notes | ICSI Technical Report TR-91-071 |
Abbreviated Authors | Y. Konig, N. Morgan, and C. Chandra |
ICSI Publication Type | Technical Report |