| Global Posterior Probability Estimates as Decision Confidence Measures in an Automatic Speech Recognition System | W. Warren | Ph.D. Dissertation, University of California at Berkeley | December 2000 | Speech | |
| Using Mutual Information to Design Feature Combinations | D. Ellis and J. Bilmes | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Decoding Speech in the Presence of Other Sound Sources | J. Barker, M. Cooke, and D. Ellis | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Using Acoustic Condition Clustering to Improve Acoustic Change Detection on Broadcast News | J.F. Lopez and D. Ellis | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China, Vol. 4, pp. 568-571 | October 2000 | Speech | [PDF]
|
| Consonant Discrimination in Elicited and Spontaneous Speech: A Case for Signal-Adaptive Front Ends in ASR | K. Sönmez, M. Plauché, E. Shriberg, and H. Franco | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| On Data-Derived Temporal Processing in Speech Feature Extraction | M. Shire and B. Chen | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Automatic Phonetic Transcription of Spontaneous Speech American English | S. Chang, L. Shastri, and S. Greenberg | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| A Comparison of Data-Derived and Knowledge-Based Modeling of Pronunciation Variation | M. Wester and E.Fosler-Lussier | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Automatic Labeling of Semantic Roles | D. Gildea and D. Jurafsky | The 38th Annual Meeting of the Association for Computational Linguistics (ACL-2000), Hong Kong, pp. 512-520 | October 2000 | Speech | [PDF]
|
| Prosody-Based Automatic Segmentation of Speech into Sentences and Topics | E. Shriberg, A. Stolcke, D. Hakkani-Tür, and G. Tür | Speech Communications, T. Robinson and S. Rendals, eds., Vol. 32, Issue 1-2, pp. 127-154 | September 2000 | Speech | |
| Tandem Connectionist Feature Stream Extraction for Conventional HMM Systems | H. Hermansky, D. Ellis, and S. Sharma | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1635-1638 | June 2000 | Speech | [PDF]
|
| Feature Extraction Using Non-Linear Transformation for Robust Speech Recognition on the Aurora Database | S. Sharma, D. Ellis, S. Kajarekar, P. Jain, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. II-1117-1120 | June 2000 | Speech | [PDF]
|
| Data-driven RASTA Filters in Reverberation | M. Shire and B. Chen | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1627-1630 | June 2000 | Speech | [PDF]
|
| Improved Recognition by Combining Different Features and Different Systems | D.P.W. Ellis | Proceedings of the Applied Voice Input/Output Society (AVIOS-2000), San Jose, California | May 2000 | Speech | [PDF]
|
| An Introduction to the Diagnostic Evaluation of the Switchboard-Corpus Automatic Speech Recognition Systems | S. Greenberg, S. Chang, and J. Hollenback | Proceedings of the National Institute of Standards and Technology Speech Transcription Workshop, College Park, Maryland | May 2000 | Speech | [PDF]
|
| Prosodic Stress Revisited: Reassessing the Fole of Fundamental Frequency | R. Silipo and S. Greenberg | Proceedings of the National Institute of Standards and Technology Speech Transcription Workshop, College Park, Maryland | May 2000 | Speech | [PDF]
|
| The Uninvited Guest: Information's Role in Guiding the Production of Spontaneous Speech | S. Greenberg and E. Fosler-Lussier | Proceedings of the Crest Workshop on Models of Speech Production: Motor Planning and Articulatory Modelling, Kloster Seeon, Germany | May 2000 | Speech | [PDF]
|
| Relevance of Time-Frequency Features for Phonetic and SpeakerChannel Classification | H.H. Yan, S. Sharma, S. van Vuuren, and H. Hermansky | Speech Communication,Vol. 1, No. 31, pp. 35-50 | May 2000 | Speech | [PDF]
|
| Linguistic Dissection of Switchboard-Corpus Automatic Speech Recognition Systems | S. Greenberg and S. Chang | Proceedings of the ISCA Workshop on Automatic Speech Recognition: Challenges for the New Millennium, Paris, France | 2000 | Speech | [PDF]
|
| Discriminant Training of Front-End and Acoustic Modeling Stages to Heterogeneous Acoustic Environments for Multi-stream Automatic Speech Recognition | M. Shire | Ph.D Dissertation, University of California at Berkeley, Fall 2000 | 2000 | Speech | [PDF]
|
| Search for Information Bearing Components in Speech | H.H. Yang and H. Hermansky | Advances in Neural Information Processing Systems, Vol. 12, S.A. Solla, T.K. Leen and K.-R. Muller, eds., MIT Press | 2000 | Speech | |
| Contextual Word and Syllable Pronunciation Models | E. Fosler-Lussier | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-99), Keystone, Colorado | December 1999 | Speech | [PDF]
|
| Combined Speech and Speaker Recognition With Speaker-adapted Connectionist Models | D. Genoud, D. Ellis, and N. Morgan | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-99), Keystone, Colorado | December 1999 | Speech | [PDF]
|
| Effects of Speaking Rate and Word Frequency on Conversational Pronunciations | E. Fosler-Lussier and N. Morgan | Speech Communication Vol. 29, No. 2-4, pp. 137-158 | November 1999 | Speech | [PDF]
|
| Natural Statistical Models for Automatic Speech Recognition | J. Bilmes | Ph.D. Thesis, University of California at Berkeley, Fall 1999. Also ICSI Technical Report TR-99-016 | October 1999 | Speech | [PDF]
|
| Multi-Level Decision Trees for Static and Dynamic Pronunciation Models | E. Fosler-Lussier | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. I-463-466 | September 1999 | Speech | [PDF]
|
| Multi-stream Speech Recognition: Ready for Prime Time? | A. Janin, D. Ellis, and N. Morgan | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. II-591-594 | September 1999 | Speech | [PDF]
|
| Speech/music Discrimination Based on Posterior Probability Features | G. Williams and D. Ellis | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. II-687-690 | September 1999 | Speech | [PDF]
|
| Temporal Constraints on Speech Intelligibility as Deduced From Exceedingly Sparse Spectral Representations | R. Silipo, S. Greenberg, and T. Arai | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. VI-2687-2690 | September 1999 | Speech | [PDF]
|
| Data-Driven Modulation Filter Design Under Adverse Acoustic Conditions and Using Phonetic and Syllabic Units | M.L. Shire | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. III-1123-1126 | September 1999 | Speech | [PDF]
|
| Topic-Based Language Models Using EM | D. Gildea and T. Hofmann | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. V-2167-2170 | September 1999 | Speech | [PDF]
|
| Sooner or Later: Exploring Asynchrony in Multi-Band Speech Recognition | N. Mirghafori and N. Morgan | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, Vol. 2, pp. 595-598 | September 1999 | Speech | [PDF]
|
| Dynamic Pronunciation Models for Autmoatic Speech Recognition | E. Fosler-Lussier | Ph.D. Thesis, UC Berkeley, Fall 1999, ICSI Technical Report TR-99-015 | September 1999 | Speech | [PDF]
|
| Dynamic Pronunciation Models for Automatic Speech Recognition | E. Fosler-Lussier | Ph.D Dissertation, University of California at Berkeley | August 1999 | Speech | [PDF]
|
| Forms of English Function Words - Effects of Disfluencies, Turn Position, Age and Sex, and Predictability | A. Bell, D. Jurafsky, E. Fosler-Lussier, C. Girand, and D. Gildea | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 1, pp. 395-398 | August 1999 | Speech | [PDF]
|
| Incorporating Contextual Phonetics Into Automatic Speech Recognition | E. Fosler-Lussier, S. Greenberg, and N. Morgan | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 1, pp. 611-614 | August 1999 | Speech | [PDF]
|
| Statistical Acoustic Indications of Coarticulation | K. Kirchoff and J. Bilmes | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 3, pp. 1729-1732 | August 1999 | Speech | [PDF]
|
| Syllable Detection and Segmentation Using Temporal Flow Neural Networks | L. Shastri, S. Chang, and S. Greenberg | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 3, pp. 1721-1724 | August 1999 | Speech | [PDF]
|
| Automatic Transcription of Prosodic Stress for Spontaneous English Discourse | R. Silipo and S. Greenberg | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 3, pp. 2351-2354 | August 1999 | Speech | [PDF]
|
| Syllable Intelligibility for Temporally-Filtered LPC Cepstral Trajectories | T. Arai, M. Pavel, H. Hermansky, and C. Avendano | Journal of the Acoustical Society of America, Vol. 105, No. 5, pp. 2783-2791 | May 1999 | Speech | [PDF]
|
| Buried Markov Models for Speech Recognition | J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-713-716 | March 1999 | Speech | [PDF]
|
| Size Matters: An Empirical Study of Neural Network Training for Large Vocabulary Continuous Speech Recognition | D. Ellis and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1013-1016 | March 1999 | Speech | [PDF]
|
| Dynamic Classifier Combinations in Hybrid Speech Recognition Systems Using Utterance-Level Confidence Values | K. Kirchhoff and J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-693-696 | March 1999 | Speech | [PDF]
|
| Using Boosting to Improve a Hybrid HMM/Neural Network Speech Recognizer | H. Schwenk | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1009-1012 | March 1999 | Speech | [PDF]
|
| Temporal Patterns (TRAPS) in ASR of Noisy Speech | H. Hermansky and S. Sharma | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona | March 1999 | Speech | |
| Relevancy of Time Frequency Features for Phonetic Classification Measured by Mutual Information | H.H. Yang, S. van Vuuren, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona | March 1999 | Speech | |
| Not Just What, But Also When: Guided Automatic Pronunciation Modeling for Broadcast News | E. Fosler-Lussier and G. Williams | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| Reducing Errors by Increasing the Error Rate: MLP Acoustic Modeling for Broadcast News Transcription | N. Morgan, D. Ellis, E. Fosler-Lussier, A. Janin, and B. Kingsbury | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| An Overview of the SPRACH System for the Transcription of Broadcast News | G. Cook, J. Christie, D. Ellis, E. Fosler-Lussier, Y. Gotoh, B. Kingsbury, N. Morgan, S. Renals, T. Robinson, and G. Williams | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| Using Knowledge to Organize Sound: The Prediction-driven Approach to Computational Auditory Scene Analysis and Its Application to Speech/Nonspeech Mixtures | D. Ellis | Speech Communication, Vol. 27, Issue 3-4, pp. 281-298 | 1999 | Speech | |