| Nuts and Flakes: A Study of Data Characteristics in Speaker Diarization | N. Mirghafori and C. Wooters | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 1017-1020 | May 2006 | Speech | [PDF]
|
| Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap | O. Cetin and E.E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 357-360 | May 2006 | Speech | [PDF]
|
| Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, A. Stolcke, E.E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Vol. 1, Toulouse, France, pp. 581-584 | May 2006 | Speech | [PDF]
|
| A* Based Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 215-219 | November 2005 | Speech | [PDF]
|
| Toward Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 187-193 | July 2005 | Speech | [PDF]
|
| Speech Recognition for Illiterate Access to Information and Technology | M. Plauché, N. Udhyakummar, C. Wooters, J. Pal, and D. Ramachadran | Proceedings of the First International Conference on Information and Communication Technologies and Development (ICTD '06), Berkeley, California, pp. 83-92 | May 2006 | Speech | [PDF]
|
| Tamil Market: A spoken dialog system for rural India | M. Plauché and M. Prabaker | Working Papers in Computer-Human Interfaces | April 2006 | Speech | [PDF]
|
| The challenges of IT research in developing regions | E. Brewer, M. Demmer, M. Ho, R.J. Honicky, J. Pal, M. Plauché, and S. Surana | IEEE Pervasive Computing, Vol. 5, No. 2, pp. 15-23 | April 2006 | Speech | |
| Overlap in Meetings: ASR Effects and Analysis by Dialog Factors, Speakers, and Collection Site | O. Cetin and E. Shriberg | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 212-224 | May 2006 | Speech | [PDF]
|
| Putting Linguistics into Speech Recognition: The Regulus Grammar Compiler | M. Rayner, B.A. Hockey, and P. Bouillon | CSLI Press | May 2006 | Speech | |
| REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications | M. Rayner, P. Bouillon, B.A. Hockey, and N. Chatzichrisafis | Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006), Genoa, Italy, pp. 783-788 | May 2006 | Speech | [PDF]
|
| A Multilingual Shared Grammar for Recognition and Generation (in French) | P. Bouillon, M. Rayner, B. Novellas, Y. Nakao, M. Santaholma, M. Starlander, and N. Chatzichrisafis | Proceedings of the 13th Conference on Natural Language Processing (TALN 2006), Leuwen, Belgium, pp. 93-102 | April 2006 | Speech | |
| Improving the Usability of MedSLT: Back-Translation and the Help System (in Japanese) | Y. Nakao, M. Rayner, N. Chatzichrisafis, K. Kanzaki, P. Bouillon, B.A. Hockey, and H. Isahara | Proceedings of the 12th Annual Meeting of the Japanese Society for Natural Language Processing (NLP2006), Tokyo, Japan | March 2006 | Speech | |
| Using Prosody for Automatic Sentence Segmentation of Multi-Party Meetings | J. Kolar, E. Shriberg, and Y. Liu | Proceedings of 9th International Conference on Text, Speech and Dialogue (TSD 2006), Brno, Czech Republic, pp. 629-636 | September 2006 | Speech | [PDF]
|
| A Generic Multi-Lingual Open Source Platform for Limited-Domain Medical Speech Translation | P. Bouillon, M. Rayner, N. Chatzichrisafis, B.A. Hockey, M. Santaholma, M. Starlander, H. Isahara, K. Kanzaki, and Y. Nakao | Proceedings of the 10th Annual Conference of the European Association of Machine Translation (EAMT 2005), Budapest, Hungary, pp. 5-58 | May 2005 | Speech | |
| Japanese Speech Understanding Using Grammar Specialization | M. Rayner, N. Chatzichrisafis, P. Bouillon, Y. Nakao, H. Isahara, K. Kanzaki, B. A. Hockey, M. Santaholma, and M. Starlander | Proceedings of the Joint Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP 2005), Vancouver, Canada, pp. 26-27 | October 2005 | Speech | |
| Reranking for Sentence Boundary Detection in Conversational Speech | B. Roark, Y. Liu, M. Harper, R. Stewart, M. Lease, M. Snover, Z. Shafran, B. Dorr, J. Hale, A. Krasnyanskaya, and L. Young | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Vol. 1, Toulouse, France, pp. 545-548 | May 2006 | Speech | |
| MLLR Transforms as Features in Speaker Recognition | A. Stolcke, L. Ferrer, S. Kajarekar, E. Shriberg, and A. Venkataraman | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2425-2428 | September 2005 | Speech | |
| Improved MLP Structures for Data-Driven Feature Extraction for ASR | Q. Zhu, B. Chen, F. Grezl, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132 | September 2005 | Speech | |
| On Speaker-Specific Prosodic Models for Automatic Dialog Act Segmentation of Multi-Party Meetings | J. Kolar, E. Shriberg, and Y. Liu | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2014-2017 | September 2006 | Speech | [PDF]
|
| Within-Class Covariance Normalization for SVM-Based Speaker Recognition | A. O. Hatch, S. Kajarekar, and A. Stolcke | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1471-1474 | September 2006 | Speech | [PDF]
|
| Improved Speech Activity Detection Using Cross-Channel Features for Recognition of Multiparty Meetings | K. Boakye and A. Stolcke | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1962-1965 | September 2006 | Speech | [PDF]
|
| Friends and Enemies: A Novel Initialization for Speaker Diarization | X. Anguera, C. Wooters, and J. Hernando | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 689-692 | September 2006 | Speech | [PDF]
|
| Robust Speaker Diarization for Meetings: ICSI RT06s evaluation system | X. Anguera, C. Wooters, and J. Pardo | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1674-1677 | September 2006 | Speech | [PDF]
|
| Multi-Stream Speaker Diarization Systems for the Meetings Domain | A. Gallardo-Antolin, X. Anguera, and C. Wooters | Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006—ICSLP), Philadelphia, Pennsylvania, pp. 2186-2189 | September 2006 | Speech | [PDF]
|
| Speaker Diarization for Multiple Distant Microphone Meetings: Mixing Acoustic Features And Inter-Channel Time Differences | J. Pardo, X. Anguera, and C. Wooters | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2194-2197 | September 2006 | Speech | [PDF]
|
| The ICSI+ Muilti-Lingual Sentence Segmentation System | M. Zimmerman, D. Hakkani-Tur, J. Fung, N. Mirghafori, L. Gottlieb, E. Shriberg, and Y. Liu | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 117-120 | September 2006 | Speech | |
| QASR: Question Answering Using Semantic Roles for Speech Interface | S. Stenchikova, D. Hakkani-Tur, and G. Tur | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1185-1188 | September 2006 | Speech | |
| Speaker Diarization for Multi-Microphone Meetings Using Only Between-Channel Differences | J.M. Pardo, X Anguera, and C. Wooters | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 257-264 | May 2006 | Speech | [PDF]
|
| Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization | X. Anguera, C. Wooters, and J. Hernando | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 248-256 | May 2006 | Speech | [PDF]
|
| Hybrid Speech/Non-Speech Detector Applied to Speaker Diarization of Meetings | X. Anguera, M. Aguilo, C. Wooters, C. Nadeu, and J. Hernando | Proceedings of IEEE Odyssey: The Speaker and Language Recognition Workshop, San Juan de Puerto Rico, pp. 1-6 | June 2006 | Speech | [PDF]
|
| Recent Innovations in Speech-to-Text Transcription at SRI-ICSI-UW | A. Stolcke, B. Chen, H. Franco, V.R.R. Gadde, M. Graciarena, M.-Y. Hwang, K. Kirchhoff, N. Morgan, X. Lin, T. Ng, M. Ostendorf, K. Sönmez, A. Venkataraman, D. Vergyri, W. Wang, J. Zheng, and Q. Zhu | IEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1729-1744 | September 2006 | Speech | [PDF]
|
| Enriching Speech Recognition with Automatic Detection of Sentence Boundaries and Disfluencies | Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, and M. Harper | IEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1526-1540 | September 2006 | Speech | [PDF]
|
| A Study in Machine Learning from Imbalanced Data for Sentence Boundary Detection in Speech | Y. Liu, N.V. Chawla, M.P. Harper, E. Shriberg, and A. Stolcke | Computer Speech and Language, Vol. 20, Issue 4, pp. 468-494 | October 2006 | Speech | [PDF]
|
| Let's DISCOH: Collecting an Annotated Open Corpus with Dialogue Acts and Reward Signals for Natural Language Helpdesks | G. Andeani, D. Di Fabbrizio, M. Gilbert, D. Gillick, D. Hakkani-Tur, and O. Lemon | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 218-221 | December 2006 | Speech | [PDF]
|
| Model Adaptation for Dialog Act Tagging | G. Tur, U. Guz, and D. Hakkani-Tur | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 94-97 | December 2006 | Speech | [PDF]
|
| Impact of Automatic Comma Prediction on POS/Name Tagging of Speech | D. Hillard, Z. Huang, H. Ji, R. Grishman, D. Hakkani-Tur, M. Harper, M. Ostendorf, and W. Wang | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 58-61 | December 2006 | Speech | [PDF]
|
| Model Adaptation for Sentence Segmentation from Speech | S. Cuendet, D. Hakkani-Tur, and G. Tur | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 102-105 | December 2006 | Speech | [PDF]
|
| Comparing Evaluation Metrics for Sentence Boundary Detection | Y. Liu and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Vol. 4, pp. 185-188, Honolulu, Hawaii | April 2007 | Speech | [PDF]
|
| Word-Conditioned Phone N-Grams for Speaker Recognition | H. Lei and N. Mirghafori | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, pp. 253-256 | April 2007 | Speech | [PDF]
|
| Statistical Sentence Extraction for Information Distillation | D. Hakkani-Tur and G. Tur | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 1-4 | April 2007 | Speech | [PDF]
|
| Entropy Based Classifier Combination for Sentence Segmentation | M. Magimai Doss, D. Hakkani-Tur, O. Cetin, E. Shriberg, J. Fung, and N. Mirghafori | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 189-192 | April 2007 | Speech | [PDF]
|
| Manual Transcription of Conversational Speech at the Articulatory Feature Level | K. Livescu, A. Bezman, N. Borges, L. Yung, O. Cetin, J. Frankel, S. King, M. Magimai-Doss, X. Chi, and L. Lavoie | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 953-956 | April 2007 | Speech | |
| Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 Jhu Summer Workshop | K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. Saenko | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii | April 2007 | Speech | |
| Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition | J. Zheng, O. Cetin, M.-Y. Huang, X. Lei, A. Stolcke, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 633-636 | April 2007 | Speech | |
| An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling | O. Cetin, A. Kantor, S. King, C. Bartels, M. Magimai-Doss, J. Frankel, and K. Livescu | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 645-648 | April 2007 | Speech | |
| Wide-Band Perceptual Audio Coding Based on Frequency-Domain Linear Prediction | P. Motlicek, V. Ullal, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 1, pp. 265-268 | April 2007 | Speech | |
| Speech recognition on vector architectures | A. Janin | Ph.D. Thesis, University of California at Berkeley | 2004 | Speech | [PDF]
|
| The ICSI-SRI Spring 2006 Meeting Recognition System | A. Janin, A. Stolcke, X. Anguera, K. Boakye, O. Cetin, J. Frankel, and J. Zheng | In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006), Lecture Notes in Computer Science. Springer | 2006 | Speech | [PDF]
|
| How to Build a Spoken Dialog System with Limited (or No) Resources | M. Plauché, O. Cetin, and N. Uhdaykumar | Presented at the Workshop on AI in ICT for Development at the 20th International Joint Conference on AI (IJCAI07), Hyderabad, India | January 2007 | Speech | |