| Improving the Usability of MedSLT: Back-Translation and the Help System (in Japanese) | Y. Nakao, M. Rayner, N. Chatzichrisafis, K. Kanzaki, P. Bouillon, B.A. Hockey, and H. Isahara | Proceedings of the 12th Annual Meeting of the Japanese Society for Natural Language Processing (NLP2006), Tokyo, Japan | March 2006 | Speech | |
| Improved Overlapped Speech Handling for Speaker Diarization | K. Boakye, O. Vinyals, and G. Friedland | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 941-944 | August 2011 | Speech | |
| Improved Classification of Speaking Styles for Mental Health Monitoring using Phoneme Dynamics | K. Chang, H. Lei, and J. Canny | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 85-88 | August 2011 | Speech | [PDF]
|
| Effective Arabic Dialect Classification Using Diverse Phonotactic Models | M. Akbacak, D. Vergyri, A. Stolcke, N. Scheffer, and A. Mandal | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 737-740 | August 2011 | Speech | [PDF]
|
| Java Visual Speech Components for Rapid Application Development of GUI based Speech Processing Applications | S. Steidl, K. Riedhammer, T. Bocklet, F. Hoenig, and E. Noeth | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 3257-3260 | August 2011 | Speech | |
| Data Selection with Kurtosis and Nasality features for Speaker Recognition | H. Lei and N. Mirghafori | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 2753-2756 | August 2011 | Speech | [PDF]
|
| Constrained Cepstral Speaker Recognition Using Matched UBM and JFA Training | M. H. Sanchez, L. Ferrer, E. Shriberg, and A. Stolcke | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 141-144 | August 2011 | Speech | [PDF]
|
| Comparing Different Flavors of Spectro-Temporal Features for ASR | B. T. Meyer, S. V. Ravuri, M. R. Schaedler, and N. Morgan | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 1269-1272 | August 2011 | Speech | [PDF]
|
| Using Spectro-Temporal Features to Improve AFE Feature Extraction for ASR | S. Ravuri and N. Morgan | Proceedings of the 11th Internationational Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 1181-1184 | September 2010 | Speech | |
| Can Conversational Word Usage Be Used to Predict Speaker Demographics? | D. Gillick | Proceedings of the 11th Internationational Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan | September 2010 | Speech | [PDF]
|
| A Space-Time Theory of Pitch and Timbre Based on Cortical Expansion of the Cochlea Traveling Wave Delay | S. Greenberg, D. Poeppel, and T. Roberts | Proceedings of the 11th International Symposium on Hearing, Grantham, United Kingdom | August 1997 | Speech | [PDF]
|
| Multimodal Speaker Diarization Using Oriented Optical Flow Histograms | M. Knox and G. Friedland | Proceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 290-293 | September 2010 | Speech | [PDF]
|
| A Hybrid Approach to Online Speaker Diarization | C. Vaquero, O. Vinyals, and G. Friedland | Proceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2642-2645 | September 2010 | Speech | [PDF]
|
| System Output Combination for Improved Speaker Diarization | S. Bozonnet, N. Evans, X. Anguera, O. Vinyals, G. Friedland, and C. Fredouille | Proceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2642-2645 | September 2010 | Speech | [PDF]
|
| A Comparative Large Scale Study of MLP Features for Mandarin ASR | F. Valente, M. Magimai Doss, C. Plahl, S. Ravuri, and W. Wang | Proceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2630-2363 | September 2010 | Speech | [PDF]
|
| Discriminative Training for Hierarchical Clustering in Speaker Diarization | O. Vinyals, G. Friedland, and N. Morgan | Proceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2326-2329 | September 2010 | Speech | [PDF]
|
| Using Artistic Markers and Speaker Identification for Narrative-Theme Navigation of Seinfeld Episodes | G. Friedland, L. Gottlieb, and A. Janin | Proceedings of the 11th IEEE International Symposium on Multimedia (ISM2009), San Diego, California, pp. 511-516 | December 2009 | Speech | [PDF]
|
| Any Questions? Automatic Question Detection in Meetings | K. Boakye, B. Favre, and D. Hakkani-Tür | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 485-489 | December 2009 | Speech | [PDF]
|
| Robust Speaker Diarization for Short Speech Recordings | D. Imseng and G. Friedland | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 432-437 | December 2009 | Speech | [PDF]
|
| Integrating Prosodic Features in Extractive Meeting Summarization | S. Xie, D. Hakkani-Tür, B. Favre, and Y. Liu | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 387-391 | December 2009 | Speech | [PDF]
|
| Importance of Nasality Measures for Speaker Recognition Data Selection and Performance Prediction | H. Lei and E. Lopez-Gonzalo | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 888-891 | September 2009 | Speech | [PDF]
|
| Hill-Climbing Feature Selection for Multi-Stream ASR | D. Gelbart, N. Morgan, and A. Tsymbal | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2967-2970 | September 2009 | Speech | [PDF]
|
| Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR System | F. Valente, M. Magimai-Doss, C. Plahl, and S. Ravuri | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2963-2966 | September 2009 | Speech | [PDF]
|
| Multi-Stream to Many-Stream: Using Spectro-Temporal Features for ASR | S. Y. Zhao, S. Ravuri, and N. Morgan | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2951-2954 | September 2009 | Speech | [PDF]
|
| Modeling Other Talkers for Improved Dialog Act Recognition in Meetings | K. Laskowski and E. Shriberg | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2783-2786 | September 2009 | Speech | [PDF]
|
| Phrase and Word Level Strategies for Detecting Appositions in Speech | B. Favre and D. Hakkani-Tür | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2711-2714 | September 2009 | Speech | [PDF]
|
| Combining Semantic and Syntactic Information Sources for 5-W Question Answering | S. Yaman, D. Hakkani-Tür, and G. Tur | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2707-2710 | September 2009 | Speech | [PDF]
|
| Classification-Based Strategies for Combining Multiple 5-W Question Answering Systems | S. Yaman, D. Hakkani-Tür, G. Tur, R. Grishman, M. Harper, K. R. McKeown, A. Meyers, and K. Sharma | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2703-2706 | September 2009 | Speech | [PDF]
|
| Mel, Linear, and Antimel Frequency Cepstral Coefficients in Broad Phonetic Regions for Telephone Speaker Recognition | H. Lei and E. Lopez-Gonzalo | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2323-2326 | September 2009 | Speech | [PDF]
|
| A Human Benchmark for Language Recognition | R. Orr and D. A. Van Leeuwen | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2175-2178 | September 2009 | Speech | |
| Feature-Based and Channel-Based Analyses of Intrinsic Variability in Speaker Verification | M. Graciarena, T. Bocklet, E. Shriberg, A. Stolcke, and S. Kajarekar | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2015-2018 | September 2009 | Speech | |
| Does Session Variability Compensation in Speaker Recognition Model Intrinsic Variation Under Mismatched Conditions? | E. Shriberg, S. Kajarekar, and N. Scheffer | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1551-1554 | September 2009 | Speech | [PDF]
|
| Leveraging Sentence Weights in a Concept-Based Optimization Framework for Extractive Meeting Summarization | S. Xie, B. Favre, D. Hakkani-Tür, and Y. Liu | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1503-1506 | September 2009 | Speech | [PDF]
|
| ClusterRank: A Graph Based Method for Meeting Summarization | N. Garg, B. Favre, K. Riedhammer, and D. Hakkani-Tür | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1499-1502 | September 2009 | Speech | [PDF]
|
| Combined Low Level and High Level Features for Out-of-Vocabulary Word Detection | B. Lecouteux, G. Linarès, and B. Favre | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1187-1190 | September 2009 | Speech | [PDF]
|
| A Hardware-Independent Fast Logarithm Approximation with Adjustable Accuracy | O. Vinyals and G. Friedland | Proceedings of the 10th IEEE International Symposium on Multimedia, Berkeley, California, pp. 61-65 | December 2008 | Speech | [PDF]
|
| Exploiting Chinese Character Models to Improve Speech Recognition Performance | J. L. Hieronymus, X. Liu, M. J. F. Gales, and P. C. Woodland | Proceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UK | September 2009 | Speech | |
| A Generic Multi-Lingual Open Source Platform for Limited-Domain Medical Speech Translation | P. Bouillon, M. Rayner, N. Chatzichrisafis, B.A. Hockey, M. Santaholma, M. Starlander, H. Isahara, K. Kanzaki, and Y. Nakao | Proceedings of the 10th Annual Conference of the European Association of Machine Translation (EAMT 2005), Budapest, Hungary, pp. 5-58 | May 2005 | Speech | |
| The ICSI Summarization System at TAC 2008 | D. Gillick, B. Favre, and D. Hakkani-Tur | Proceedings of Text Analysis Conference (TAC), Gaithersburg, Maryland | November 2008 | Speech | [PDF]
|
| Synchronous Parsing of Syntactic and Semantic Structures | B. Bohnet | Proceedings of Quatrième Conférence Internationale Sur La Théorie Sens-Texte (Fourth International Conference on Meaning-Text Theory, MTT’09), Montreal, Canada | June 2009 | Speech | [PDF]
|
| Comparisons of Recent Speaker Recognition Approaches Based on Word Conditioning | H. Lei and N. Mirghafori | Proceedings of Odyssey 2008, Stellenbosch, South Africa | January 2008 | Speech | [PDF]
|
| Anchored Speech Recognition for Question Answering | S. Yaman, G. Tür, D. Vergyri, D. Hakkani-Tür, M. Harper, and W. Wang | Proceedings of North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009): Short Papers, Boulder, Colorado, pp. 265-268 | June 2009 | Speech | [PDF]
|
| Efficient Parsing for Transducer Grammars | J. DeNero, M. Bansal, A. Pauls, and D. Klein | Proceedings of North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009), Boulder, Colorado, pp. 227-235. | May 2009 | Speech | [PDF]
|
| The Relation Between Stress Accent and Vocalic Identity in Spontaneous American English Discourse | S. Greenberg, S. Chang, and L. Hitchcock | Proceedings of ISCA Workshop on Prosody in Speech Recognition and Understanding, Red Bank, New Jersey | October 2001 | Speech | |
| Auditory-Based Automatic Speech Recognition | W. Hemmert, M. Holmberg, and D. Gelbart | Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Using Machine Learning to Cope with Imbalanced Classes in Natural Speech: Evidence from Sentence Boundary and Disfluency Detection | Y. Liu, E. Shriberg, A. Stolcke, and M. Harper | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | 2004 | Speech | [PDF]
|
| From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System | N. Mirghafori, A. Stolcke, C. Wooters, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Vocabulary and Language Model Adaptation Using Information Retrieval | B. Bigi, Y. Huang, and R. De Mori | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Learning Long-Term Temporal Features in LVCSR Using Neural Networks | B. Chen, Q. Zhu, and N. Morgan | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| On Using MLP Features in LVCSR | Q. Zhu, B. Chen, N. Morgan. and A. Stolcke | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|