| A Robust Speaker Clustering Algorithm | J. Ajmera and C. Wooters | Proceedings of IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands | December 2003 | Speech | [PDF]
|
| The Relationship Between Dialogue Acts and Hot Spots in Meetings | B. Wrede and E. Shriberg | Proceedings of IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands | December 2003 | Speech | [PDF]
|
| Speech Recognition Technology | H. Franco, F. Beaufays, N. Morgan, and H. Bourlard | Chapter in Handbook of Brain Theory and Neural Networks, 2nd edition, M. Arbib ed. MIT Press | 2004 | Speech | |
| Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research | H. Hermansky and N. Morgan | Journal of Negative Results in Speech and Audio Sciences, Vol. 1, Issue 1 | 2004 | Speech | [PDF]
|
| Speech Recognition and the Auditory Perspective | N. Morgan, H. Bourlard, and H. Hermansky | Chapter in Speech Processing in the Auditory System, S. Greenberg and W. Ainsworth, eds, Springer | 2004 | Speech | |
| Scaling Up: Learning Large-Scale Recognition Methods from Small-Scale Recognition Tasks | N. Morgan, B. Chen, Q. Zhu, and A. Stolcke | ICSI Technical Report tr-03-02. Also Special Workshop in Maui(SWIM) paper 218. | 2004 | Speech | [PDF]
|
| Multimodal Model Integration for Sentence Unit Detection | L. Chen, Y. Liu, M. Harper, and E. Shriberg | Sixth International Conference on Multimodal Interfaces, October 2004 | 2004 | Speech | |
| Using Machine Learning to Cope with Imbalanced Classes in Natural Speech: Evidence from Sentence Boundary and Disfluency Detection | Y. Liu, E. Shriberg, A. Stolcke, and M. Harper | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | 2004 | Speech | [PDF]
|
| Prosody Modeling for Automatic Speech Recognition and Understanding | E. Shriberg and A. Stolcke | Mathematical Foundations of Speech and Language Modeling, M. Johnson, M. Ostendorf, S. Khudanpur, R. Rosenfeld (eds.), Volume 138 in IMA Volumes in Mathematics and its Applications, pp. 105-114, Springer-Verlag. | 2004 | Speech | [PDF]
|
| Speech recognition on vector architectures | A. Janin | Ph.D. Thesis, University of California at Berkeley | 2004 | Speech | [PDF]
|
| Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus | L. Chen, Y. Liu, M. Harper, E. Maia, and S. McRoy | Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, Portugal, pp. 759-762 | 2004 | Speech | [PDF]
|
| The ICSI/SRI/UW RT04 Structural Metadata Extraction System | Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, and M. Harper | RT-04 EARS Workshop | January 2004 | Speech | |
| Meeting Recorder Project: Dialog Act Labeling Guide | R. Dhillon, S. Bhagat, H. Carvey, and E. Shriberg | ICSI Technical Report TR-04-002 | February 2004 | Speech | [PDF]
|
| Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing | E. Shriberg and A. Stolcke | Proceedings of the International Conference on Speech Prosody, Nara, Japan, March 2004. | March 2004 | Speech | [PDF]
|
| Improving Automatic Sentence Boundary Detection with Confusion Networks | D. Hillard, M. Ostendorf, A. Stolcke, Y. Liu, and E. Shriberg | Proceedings of HLT-NAACL Conference, Boston | April 2004 | Speech | [PDF]
|
| The ICSI Meeting Recorder Dialog Act (MRDA) Corpus | E. Shriberg, R. Dhillon, S. Bhagat, J. Ang, and H. Carvey | Proceedings of the Human Language Technology Conference at the North American Chapter of the Association for Computational Linguistics, Boston, Massachusetts | April 2004 | Speech | [PDF]
|
| Text-Constrained Speaker Recognition on a Text-Independent Task | K. Boakye and B. Peskin | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2004), Toledo, Spain | May 2004 | Speech | [PDF]
|
| Desperately Seeking Impostors: Data-Mining for Competitive Impostor Testing in a Text-Dependent Speaker Verification System | M. Hebert and N. Mirghafori | Proceedings of IEEE ICASSP, Montreal | May 2004 | Speech | [PDF]
|
| The ICSI Meeting Project: Resources and Research | A. Janin, J. Ang, S. Bhagat, R. Dhillon, J. Edwards, J. Macias, N. Morgan, B. Peskin, E. Shriberg, A. Stolcke, C. Wooters, and B. Wrede | Proceedings of the ICASSP 2004 Meeting Recognition Workshop, Montreal, Canada | May 2004 | Speech | [PDF]
|
| Parameterization of the Score Threshold for a Text-Dependent Adaptive Speaker Verification System | N. Mirghafori and M. Hebert | Proceedings of IEEE ICASSP, Montreal | May 2004 | Speech | [PDF]
|
| TRAPping Conversational Speech: Extending TRAP/Tandem Approaches to Conversational Telephone Speech Recognition | N. Morgan, B. Y. Chen, Q. Zhu, and A. Stolcke | Proceedings of IEEE ICASSP, Montreal | May 2004 | Speech | [PDF]
|
| Detection and Compensation of Sensor Malfunction in Time Delay Based Direction of Arrival Estimation | T. Pirinen, J. Yli-Hietanen, P. Pertilä, and A. Visa | Proceedings of IEEE ISCAS, Vancouver | May 2004 | Speech | [PDF]
|
| Progress in Meeting Recognition: The ICSI-SRI-UW Spring 2004 Evaluation System | A. Stolcke, C. Wooters, N. Mirghafori, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | NIST ICASSP 2004 Meeting Recognition Workshop, Montreal | May 2004 | Speech | [PDF]
|
| The ICSI Meeting Corpus: Close-Talking and Far-Field, Multi-Channel Transcriptions for Speech and Language Researchers | J. A. Edwards | Proceedings of the Workshop on Compiling and Processing Spoken Language Corpora at the Fourth International Conference on Language Resources and Evaluation (LREC 2004), pp. 8-11 | May 2004 | Speech | [PDF]
|
| Modeling NERFs for Speaker Recognition | S. Kajarekar, L. Ferrer, K. Sonmez, J. Zheng, E. Shriberg, and A. Stolcke | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2004), Toledo, Spain, pp. 51-56 | May 2004 | Speech | [PDF]
|
| Multi-Speaker Language Modeling | G. Ji and J. Bilmes | Proceedings of the Human Language Technology Conference at the North American Chapter of the Association for Computational Linguistics, Boston, Massachusetts, pp. 133-136 | May 2004 | Speech | [PDF]
|
| Adaptive Language Modeling with Varied Sources to Cover New Vocabulary Items | S. Schwarm, I. Bulyko, and M. Ostendorf | IEEE Transactions on Speech and Audio Processing, Vol. 12, No. 3, pp. 334-342 | May 2004 | Speech | [PDF]
|
| Multiband Audio Modeling for Single-Channel Acoustic Source Separation | M.J. Reyes-Gomez, D. Ellis, and N. Jojic | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), Montreal, Canada, Vol.5, pp. 641-644 | May 2004 | Speech | [PDF]
|
| The 2004 ICSI-SRI-UW Meeting Recognition System | C. Wooters, N. Mirghafori, A. Stolcke, T. Pirinen, I Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | Proceedings of the Joint AMI/PASCAL/IM2/IM4 Workshop on Multimodal and Related Machine Learning Algorithms (MLMI '04), Martigny, Switzerland, pp. 196-208 | June 2004 | Speech | [PDF]
|
| Long-Term Temporal Features for Conversational Speech Recognition | B. Chen, Q. Zhu, and N. Morgan | Proceedings of the First International Workshop on Machine Learning for Multimodal Interaction (MLMI 2004), Martigny, Switzerland | June 2004 | Speech | |
| Tandem Connectionist Feature Extraction for Conversational Speech Recognition | Q. Zhu, B. Chen, N. Morgan, and A.Stolcke | Proceedings of the First International Workshop on Machine Learning for Multimodal Interaction (MLMI 2004), Martigny, Switzerland | June 2004 | Speech | |
| Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic Dependencies | M. Galley, K. McKeown, J. Hirschberg, and E. Shriberg | Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL 04), Barcelona, Spain | July 2004 | Speech | [PDF]
|
| Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in Speech | Y. Liu, A. Stolcke, E. Shriberg, and M. Harper | Proceedings of Conference on Empirical Methods in Natural Language Processing, Barcelona | July 2004 | Speech | [PDF]
|
| Time Delay Based Failure-Robust Direction of Arrival Estimation | T. Pirinen and J. Yli-Hietanen | Proceedings of IEEE SAM 2004, Sitges, Barcelona, Spain. | July 2004 | Speech | [PDF]
|
| Combining Multiple Clustering Systems | C. Boulis and M. Ostendof | Proceedings of the 15th European Conference on Machine Learning (ECML/PKDD 2004), Pisa, Italy | September 2004 | Speech | [PDF]
|
| From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System | N. Mirghafori, A. Stolcke, C. Wooters, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Auditory-Based Automatic Speech Recognition | W. Hemmert, M. Holmberg, and D. Gelbart | Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Vocabulary and Language Model Adaptation Using Information Retrieval | B. Bigi, Y. Huang, and R. De Mori | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Learning Long-Term Temporal Features in LVCSR Using Neural Networks | B. Chen, Q. Zhu, and N. Morgan | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| On Using MLP Features in LVCSR | Q. Zhu, B. Chen, N. Morgan. and A. Stolcke | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Towards Robust Speaker Segmentation: The ICSI-SRI Fall 2004 Diarization System | C. Wooters, J. Fung, B. Peskin, and X. Anguera | Proceedings of Fall 2004 Rich Transcription Workshop (RT-04F), Nov. 2004 | November 2004 | Speech | [PDF]
|
| Incorporating Tandem/HATs MLP Features into SRI's Conversational Speech Recognition System | Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan | Proceedings of the EARS RT-04F Workshop, Palisades, New York, November 2004. | November 2004 | Speech | [PDF]
|
| SmartKom English: From Robust Recognition to Felicitous Interaction | D. Gelbart, J. Bryants, A. Stolcke, R. Porzel, M. Baudis, and N. Morgan | In SmartKom--Foundations of Multimodal Dialogue Systems, W. Wahlster, ed., pp. 453-470, Springer | November 2004 | Speech | [PDF]
|
| Structural Event Detection for Rich Transcription of Speech | Y. Liu | Ph.D Thesis, Purdue University, West Lafayette, Indiana | December 19 2004 | Speech | [PDF]
|
| Automatic Dialog Act Segmentation and Classification in Multiparty Meetings | J. Ang, Y. Liu, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 1061-1064 | March 2005 | Speech | [PDF]
|
| Tonotopic Multi-Layered Perceptron: A Neural Network for Learning | B. Y. Chen, Q. Zhu, and N. Morgan | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 945-948 | March 2005 | Speech | [PDF]
|
| Improved Phonetic Speaker Recognition Using Lattice Decoding | A. O. Hatch, B. Peskin, and A. Stolcke | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 169-172 | March 2005 | Speech | [PDF]
|
| Multi-Rate and Variable-Rate Modeling of Speech at Phone and Syllable Time Scales | O. Cetin and M. Ostendorf | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 665-668 | March 2005 | Speech | |
| Speaker Detection Without Models | D. Gillick, S. Stafford, and B. Peskin | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 757-760 | March 2005 | Speech | [PDF]
|
| Structural Metadata Research in the EARS Program | Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, J. Ang, D. Hillard, M. Ostendorf, M. Tomalin, P. Woodland, and M. Harper | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 957-960 | March 2005 | Speech | [PDF]
|