| Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data? | A. Venkataraman, Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2777-2780 | September 2005 | Speech | [PDF]
|
| Using MLP Features in SRI's Conversational Speech Recognition System | Q. Zhu, A. Stolcke, B.Y. Chen, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2141-2144 | September 2005 | Speech | [PDF]
|
| Improved MLP Structures for Data-Driven Feature Extraction for ASR | Q. Zhu, B. Chen, F. Grezl, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132 | September 2005 | Speech | [PDF]
|
| Automatic Data Selection for MLP-Based Feature Extraction for ASR | C. Pelaez-Moreno, Q. Zhu, B. Chen, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 229-232 | September 2005 | Speech | [PDF]
|
| Efficient Pitch-Based Estimation of VTLN Warp Factors | A. Faria and D. Gelbart | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 213-216 | September 2005 | Speech | [PDF]
|
| A Methodology for Comparing Grammar-Based and Robust Approaches to Speech Understanding | P. Bouillon, N. Chatzichrisafis, B.A. Hockey, M. Rayner, M. Santaholma, M. Starlander, H. Isahara, K. Kanzaki, and Y. Nakao | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 1877-1880 | September 2005 | Speech | |
| Spontaneous Speech: How People Really Talk, and Why Engineers Should Care | E. Shriberg | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 1781-1784 | September 2005 | Speech | [PDF]
|
| Pushing the Envelope - Aside | N. Morgan, Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cetin, H. Bourlard, and M. Athineos | IEEE Signal Processing Magazine, Vol. 22, No. 5, pp. 81-88 | September 2005 | Speech | |
| Automatic Speech Recognition with Neural Spike Trains | M. Holmberg, D. Gelbart, U. Ramacher, and W. Hemmert | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal | September 2005 | Speech | [PDF]
|
| MLLR Transforms as Features in Speaker Recognition | A. Stolcke, L. Ferrer, S. Kajarekar, E. Shriberg, and A. Venkataraman | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2425-2428 | September 2005 | Speech | |
| Improved MLP Structures for Data-Driven Feature Extraction for ASR | Q. Zhu, B. Chen, F. Grezl, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132 | September 2005 | Speech | |
| The Effects of Speech Recognition and Punctuation on Information Extraction Performance | J. Makhoul, A. Baron, I. Bulyko, L. Nguyen, L. Ramshaw, D. Stallard, R. Schwartz, and B. Xiang | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 57-60 | September 2005 | Speech | |
| Meeting Acts: A Labeling System for Group Interaction in Meetings | R. Bates, P. Menning, E. Willingham, and C. Kuyper | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisbon, Portugal | September 2005 | Speech | [PDF]
|
| Using Symbolic Prominence to Help Design Feature Subsets for Topic Classification and Clustering of Natural Human-Human Conversations | C. Boulis and M. Ostendof | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisbon, Portugal | September 2005 | Speech | [PDF]
|
| Modeling Prosodic Feature Sequences for Speaker Recognition | E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke | Speech Communication, Vol. 46, Issues 3-4, pp. 455-472 | July 2005 | Speech | |
| Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System | A. Stolcke, X. Anguera, K. Boakye, O. Cetin, F. Grezl, A. Janin, A. Mandal, B. Peskin, C. Wooters, and J. Zheng | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 463-475 | July 2005 | Speech | [PDF]
|
| Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System | X. Anguera, C. Wooters, B. Peskin, and M. Aguilo | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 402-414 | July 2005 | Speech | [PDF]
|
| Multi-Microphone Signal Processing for Automatic Speech Recognition in Meeting Rooms | M. Ferras Font | M.S. Thesis, Universitat Politecnica de Catalunya, Barcelona, Spain | July 2005 | Speech | [PDF]
|
| Comparison of Grammar Based and Statistical Language Models Trained on the Same Data | B.A. Hockey and M. Rayner | Presented at the Workshop on Spoken Language Understanding at the 20th AIII National Conference on Artificial Intelligence, Pittsburgh, Pennsylvania | July 2005 | Speech | |
| Toward Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 187-193 | July 2005 | Speech | [PDF]
|
| Accent Classification for Speech Recognition | A. Faria | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 285-293 | July 2005 | Speech | [PDF]
|
| The Role of Disfluencies on Topic Classification of Human-Human Conversations | C. Boulis, J. G. Kahn, and M. Ostendorf | Proceedings of the Spoken Language Understanding Workshop Program at the 20th National Conference on Artificial Intelligence (AAAI-05), Pittsburgh, Pennsylvania | July 2005 | Speech | [PDF]
|
| Using Conditional Random Fields For Sentence Boundary Detection in Speech | Y. Liu, A. Stolcke, E. Shriberg, and M. Harper | Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), Ann Arbor, Michigan, pp. 451-458 | June 2005 | Speech | [PDF]
|
| A Voice-Enabled Procedure Browser for the International Space Station | M. Rayner, B.A. Hockey, N. Chatzichrisafis, K. Farrell, and J.M. Renders | Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), Ann Arbor, Michigan, pp. 29-32 (interactive poster and demo track) | June 2005 | Speech | |
| A Quantitative Analysis of Lexical Differences Between Genders in Telephone Conversations | C. Boulis and M. Ostendof | Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics (ACL 2005), Ann Arbor, Michigan, pp. 435-442 | June 2005 | Speech | [PDF]
|
| The Sequential GMM: A Gaussian Mixture Model Based Speaker Verification System that Captures Sequential Information | S. Stafford | M.S. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| Speaker Recogntion in the Text-Independent Domain Using Keyword Hidden Markov Models | K. Boakye | M.S. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| Learning Discriminant Narrow-Band Temporal Patterns for Automatic Recognition of Conversational Telephone Speech | B.Y. Chen | Ph.D. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| A Generic Multi-Lingual Open Source Platform for Limited-Domain Medical Speech Translation | P. Bouillon, M. Rayner, N. Chatzichrisafis, B.A. Hockey, M. Santaholma, M. Starlander, H. Isahara, K. Kanzaki, and Y. Nakao | Proceedings of the 10th Annual Conference of the European Association of Machine Translation (EAMT 2005), Budapest, Hungary, pp. 5-58 | May 2005 | Speech | |
| Text Classification by Augmenting the Bag-of-Words Representation with Redundancy-Compensated Bigrams | C. Boulis and M. Ostendof | Proceedings of the SIAM International Conference on Data Mining at the Workshop on Feature Selection in Data Mining (SIAM-FSDM 2005), Newport Beach, California | April 2005 | Speech | [PDF]
|
| Automatic Dialog Act Segmentation and Classification in Multiparty Meetings | J. Ang, Y. Liu, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 1061-1064 | March 2005 | Speech | [PDF]
|
| Tonotopic Multi-Layered Perceptron: A Neural Network for Learning | B. Y. Chen, Q. Zhu, and N. Morgan | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 945-948 | March 2005 | Speech | [PDF]
|
| Improved Phonetic Speaker Recognition Using Lattice Decoding | A. O. Hatch, B. Peskin, and A. Stolcke | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 169-172 | March 2005 | Speech | [PDF]
|
| Multi-Rate and Variable-Rate Modeling of Speech at Phone and Syllable Time Scales | O. Cetin and M. Ostendorf | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 665-668 | March 2005 | Speech | |
| Speaker Detection Without Models | D. Gillick, S. Stafford, and B. Peskin | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 757-760 | March 2005 | Speech | [PDF]
|
| Structural Metadata Research in the EARS Program | Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, J. Ang, D. Hillard, M. Ostendorf, M. Tomalin, P. Woodland, and M. Harper | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 957-960 | March 2005 | Speech | [PDF]
|
| Dialog Act Tagging Using Graphical Models | G. Ji and J. Bilmes | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, Vol. 1, pp. 33-36 | March 2005 | Speech | [PDF]
|
| Clap Detection and Discrimination for Rhythm Therapy | N. Lesser and D.P.W. Ellis | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 37-40 | March 2005 | Speech | [PDF]
|
| Structural Event Detection for Rich Transcription of Speech | Y. Liu | Ph.D Thesis, Purdue University, West Lafayette, Indiana | December 19 2004 | Speech | [PDF]
|
| Towards Robust Speaker Segmentation: The ICSI-SRI Fall 2004 Diarization System | C. Wooters, J. Fung, B. Peskin, and X. Anguera | Proceedings of Fall 2004 Rich Transcription Workshop (RT-04F), Nov. 2004 | November 2004 | Speech | [PDF]
|
| Incorporating Tandem/HATs MLP Features into SRI's Conversational Speech Recognition System | Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan | Proceedings of the EARS RT-04F Workshop, Palisades, New York, November 2004. | November 2004 | Speech | [PDF]
|
| SmartKom English: From Robust Recognition to Felicitous Interaction | D. Gelbart, J. Bryants, A. Stolcke, R. Porzel, M. Baudis, and N. Morgan | In SmartKom--Foundations of Multimodal Dialogue Systems, W. Wahlster, ed., pp. 453-470, Springer | November 2004 | Speech | [PDF]
|
| From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System | N. Mirghafori, A. Stolcke, C. Wooters, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Auditory-Based Automatic Speech Recognition | W. Hemmert, M. Holmberg, and D. Gelbart | Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Vocabulary and Language Model Adaptation Using Information Retrieval | B. Bigi, Y. Huang, and R. De Mori | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Learning Long-Term Temporal Features in LVCSR Using Neural Networks | B. Chen, Q. Zhu, and N. Morgan | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| On Using MLP Features in LVCSR | Q. Zhu, B. Chen, N. Morgan. and A. Stolcke | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Combining Multiple Clustering Systems | C. Boulis and M. Ostendof | Proceedings of the 15th European Conference on Machine Learning (ECML/PKDD 2004), Pisa, Italy | September 2004 | Speech | [PDF]
|
| Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic Dependencies | M. Galley, K. McKeown, J. Hirschberg, and E. Shriberg | Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL 04), Barcelona, Spain | July 2004 | Speech | [PDF]
|
| Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in Speech | Y. Liu, A. Stolcke, E. Shriberg, and M. Harper | Proceedings of Conference on Empirical Methods in Natural Language Processing, Barcelona | July 2004 | Speech | [PDF]
|