An Investigation of Tandem MLP Features for ASR
Title | An Investigation of Tandem MLP Features for ASR |
Publication Type | Technical Report |
Year of Publication | 2007 |
Authors | Faria, A. |
Other Numbers | 2212 |
Abstract | This project explores speech feature representations produced by discriminatively trained multi-layer perceptrons. Previous research has demonstrated that such a tandem approach can be successfully exploited for large-vocabulary automatic speech recognition systems. The principal aim of this work is to empirically evaluate some variants of these features. While experimental results validate some of the design choices of the standard implementation, other evidence suggests alternatives that may improve performance. From this exploratory investigation, we hypothesize which of the various modifications are most promising; applied to a Mandarin broadcast news task, the new configuration demonstrates significant improvement. Along with the novel presentation of a best-case scenario and other cheating experiments, an interpretation of these results is discussed with the hope of guiding future directions of research. |
URL | http://www.icsi.berkeley.edu/pubs/techreports/faria_icsitr.pdf |
Bibliographic Notes | ICSI Technical Report TR-07-003 |
Abbreviated Authors | A. Faria |
ICSI Research Group | Speech |
ICSI Publication Type | Technical Report |