| Speaker Adaptation of Language and Prosodic Models for Automatic Dialog Act Segmentation of Speech | J. Kolar, Y. Liu, and E. Shriberg | Speech Communication, Vol. 52, Issue 3, pp. 236-245 | March 2010 | Speech | |
| Exploiting Chinese Character Models to Improve Speech Recognition Performance | J. L. Hieronymus, X. Liu, M. J. F. Gales, and P. C. Woodland | Proceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UK | September 2009 | Speech | |
| Speaker Diarization For Multiple-distant-microphone Meetings Using Several Sources of Information | J. M. Pardo, X. Anguera, and C. Wooters | IEEE Transactions on Computers, Vol. 56, Issue 9, IEEE Computer Society, California, pp. 1212-1224 | September 2007 | Speech | [PDF]
|
| The Effects of Speech Recognition and Punctuation on Information Extraction Performance | J. Makhoul, A. Baron, I. Bulyko, L. Nguyen, L. Ramshaw, D. Stallard, R. Schwartz, and B. Xiang | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 57-60 | September 2005 | Speech | |
| Speaker Diarization for Multiple Distant Microphone Meetings: Mixing Acoustic Features And Inter-Channel Time Differences | J. Pardo, X. Anguera, and C. Wooters | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2194-2197 | September 2006 | Speech | [PDF]
|
| SPERT-II: A Vector Microprocessor System and Its Application to Large Problems in Backpropagation Training | J. Wawrzynek, K. Asanovic, B. Kingsbury, J. Beck, D. Johnson, and N. Morgan | Proceedings of the Advances in Neural Information Processing Systems 8 Conference (NIPS 8), Denver, Colorado, pp. 619-625. Also in IEEE Computer, Vol. 29, No. 3, pp 79-86, March 1996. | November 1995 | Speech | |
| fMPE-MAP: Improved Discriminative Adaptation for Modeling New Domains | J. Zheng and A. Stolcke | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1573-1576 | August 2007 | Speech | [PDF]
|
| Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition | J. Zheng, O. Cetin, M.-Y. Huang, X. Lei, A. Stolcke, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 633-636 | April 2007 | Speech | |
| Prosodic Cues For Emotion Recognition In Communicator Dialogs | J.C. Ang | M.S. Thesis, University of California at Berkeley | December 2002 | Speech | [PDF]
|
| Using Acoustic Condition Clustering to Improve Acoustic Change Detection on Broadcast News | J.F. Lopez and D. Ellis | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China, Vol. 4, pp. 568-571 | October 2000 | Speech | [PDF]
|
| Speaker Diarization for Multi-Microphone Meetings Using Only Between-Channel Differences | J.M. Pardo, X Anguera, and C. Wooters | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 257-264 | May 2006 | Speech | [PDF]
|
| Audio Segmentation for Meetings Speech Processing | K. A. Boakye | UC Berkeley dissertation | December 2008 | Speech | [PDF]
|
| Training Neural Networks with SPERT-II | K. Asanovic, J. Beck, D. Johnson, B. Kingsbury, N. Morgan, and J. Wawrzynek | Chapter in Parallel Architectures for Artificial Networks - Paradigms and Implementations, eds. N. Sundararajan and P. Saratchandran, IEEE Computer Society Press, pp. 345-364 | 1998 | Speech | |
| A View of the Parallel Computing Landscape | K. Asanović, R. Bodik, J. Demmel, T. Keaveny, K. Keutzer, J. D. Kubiatowicz, N. Morgan, D. A. Patterson, K. Sen, J. Wawrzynek, D. Wessel, and K. A. Yelick | Communications of the ACM, Vol. 52, No. 10, pp. 56-67 | October 2009 | Speech | [PDF]
|
| Speaker Recogntion in the Text-Independent Domain Using Keyword Hidden Markov Models | K. Boakye | M.S. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| Improved Speech Activity Detection Using Cross-Channel Features for Recognition of Multiparty Meetings | K. Boakye and A. Stolcke | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1962-1965 | September 2006 | Speech | [PDF]
|
| Text-Constrained Speaker Recognition on a Text-Independent Task | K. Boakye and B. Peskin | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2004), Toledo, Spain | May 2004 | Speech | [PDF]
|
| Any Questions? Automatic Question Detection in Meetings | K. Boakye, B. Favre, and D. Hakkani-Tür | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 485-489 | December 2009 | Speech | [PDF]
|
| Two's a Crowd: Improving Speaker Diarization by Automatically Identifying and Excluding Overlapped Speech Authors | K. Boakye, O. Vinyals, and G. Friedland | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 32-35 | September 2008 | Speech | |
| Improved Overlapped Speech Handling for Speaker Diarization | K. Boakye, O. Vinyals, and G. Friedland | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 941-944 | August 2011 | Speech | |
| Improved Classification of Speaking Styles for Mental Health Monitoring using Phoneme Dynamics | K. Chang, H. Lei, and J. Canny | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 85-88 | August 2011 | Speech | [PDF]
|
| A Low-Cost Mobile Pointing and Drawing Device | K. Jantz, G. Friedland, L. Knipping, and R. Rojas | Proceedings of the ACM Workshop on Educational Multimedia and Multimedia Education at ACM Multimedia 2007, Augsburg, Germany, pp. 121-122 | September 2007 | Speech | |
| An Iterative Unsupervised Learning Method for Information Distillation | K. Kamangar, D. Hakkani-Tur, G. Tur, and M. Levit | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4949 - 4952 | April 2008 | Speech | [PDF]
|
| Dynamic Classifier Combinations in Hybrid Speech Recognition Systems Using Utterance-Level Confidence Values | K. Kirchhoff and J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-693-696 | March 1999 | Speech | [PDF]
|
| Statistical Acoustic Indications of Coarticulation | K. Kirchoff and J. Bilmes | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 3, pp. 1729-1732 | August 1999 | Speech | [PDF]
|
| Modeling Other Talkers for Improved Dialog Act Recognition in Meetings | K. Laskowski and E. Shriberg | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2783-2786 | September 2009 | Speech | [PDF]
|
| Comparing the Contributions of Context and Prosody in Text-Independent Dialog Act Recognition | K. Laskowski and E. Shriberg | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5374-5377 | March 2010 | Speech | [PDF]
|
| Detecting Music in Ambient Audio by Long-Window Autocorrelation | K. Lee and D. Ellis | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 9-12 | April 2008 | Speech | [PDF]
|
| Audio-Based Semantic Concept Classification for Consumer Video | K. Lee and D. Ellis | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 6, pp. 1406-1416 | August 2010 | Speech | [PDF]
|
| Detecting Local Semantic Concepts in Environmental Sounds Using Markov Model Based Clustering | K. Lee, D. Ellis, and A. Loui | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, March 2010 | March 2010 | Speech | [PDF]
|
| Manual Transcription of Conversational Speech at the Articulatory Feature Level | K. Livescu, A. Bezman, N. Borges, L. Yung, O. Cetin, J. Frankel, S. King, M. Magimai-Doss, X. Chi, and L. Lavoie | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 953-956 | April 2007 | Speech | |
| Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 Jhu Summer Workshop | K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. Saenko | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii | April 2007 | Speech | |
| Who, What, When, Where, Why? Comparing Multiple Approaches to the Cross-Lingual 5W Task | K. Parton, K. R. McKeown, R. Coyne, M. T. Diab, R. Grishman, D. Hakkani-Tür, M. Harper, H. Ji, W. Y. Ma, A. Meyers, S. Stolbach, A. Sun, G. Tur, W. Xu, and S. Yaman | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Lanaguage Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore, pp. 423-431 | August 2009 | Speech | [PDF]
|
| A Keyphrase Based Approach to Interactive Meeting Summarization | K. Riedhammer, B. Favre, and D. Hakkani-Tur | Proceedings of IEEE Workshop on Spoken Language Technologies (SLT2008), Goa, India, pp. 153-156 | December 2008 | Speech | [PDF]
|
| Long Story Short - Global Unsupervised Models for Keyphrase Based Meeting Summarization | K. Riedhammer, B. Favre, and D. Hakkani-Tur | Speech Communication, Vol. 52, Issue 10, pp. 801-815. DOI:10.1016/j.specom.2010.06.002 | October 2010 | Speech | |
| Packing the Meeting Summarization Knapsack | K. Riedhammer, D. Gillick, B. Favre, and D. Hakkani-Tur | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2434-2437 | September 2008 | Speech | [PDF]
|
| Modeling Dynamic Prosodic Variation for Speaker Verification | K. Sonmez, E. Shriberg, L. Heck, and M. Weintraub | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia, Vol. 7, p. 3189 | November 1998 | Speech | |
| Consonant Discrimination in Elicited and Spontaneous Speech: A Case for Signal-Adaptive Front Ends in ASR | K. Sönmez, M. Plauché, E. Shriberg, and H. Franco | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Overlapped Speech Detection for Improved Speaker Diarization in Multiparty Meetings | K.A. Boakye, B. Trueba-Hornero, O. Vinyals, and G. Friedland | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4353-4356 | April 2008 | Speech | [PDF]
|
| Speech Intelligibility Derived From Asynchrounous Processing of Auditory-Visual Information | K.W. Grant and S. Greenberg | Proceedings of the International Conference on Auditory-Visual Speech Processing Workshop (AVSP 2001), Scheelsminde, Denmark | September 2001 | Speech | [PDF]
|
| Multimodal Model Integration for Sentence Unit Detection | L. Chen, Y. Liu, M. Harper, and E. Shriberg | Sixth International Conference on Multimodal Interfaces, October 2004 | 2004 | Speech | |
| Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus | L. Chen, Y. Liu, M. Harper, E. Maia, and S. McRoy | Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, Portugal, pp. 759-762 | 2004 | Speech | [PDF]
|
| Far-Field ASR on Inexpensive Microphones | L. Docio, D. Gelbart, and N. Morgan | Proceedings of Eighth European Conference on Speech Communication and Technology (EUROSPEECH 2003), Geneva, Switzerland, pp. 2141-2144 | September 2003 | Speech | [PDF]
|
| Parameterization of Prosodic Feature Distributions for SVM Modeling in Speaker Recognition | L. Ferrer, E. Shriberg, S. Kajarekar, and K. Sonmez | Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 233-236 | April 2007 | Speech | [PDF]
|
| A Smoothing Kernel for Spatially Related Features and Its Application to Speaker Verification | L. Ferrer, K. Sonmez, and E. Shriberg | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 738-741 | August 2007 | Speech | [PDF]
|
| An Anticorrelation Kernel for Subsystem Training in Multiple Classifier Systems | L. Ferrer, K. Sönmez, and E. Shriberg | Journal of Machine Learning Research, Vol. 10, pp. 2079-2114 | September 2009 | Speech | [PDF]
|
| System Combination Using Auxiliary Information for Speaker Verification | L. Ferrer, M. Graciarena, A. Zymnis, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4853-4856 | April 2008 | Speech | [PDF]
|
| A Comparison of Approaches for Modeling Prosodic Features in Speaker Recognition | L. Ferrer, N. Scheffer, and E. Shriberg | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4414-4417 | March 2010 | Speech | [PDF]
|
| On the Use of Artificial Conversation Data for Speaker Recognition in Cars | L. Gottlieb and G. Friedland | Proceedings of the Third IEEE International Conference on Semantic Computing (ICSC-2009), Berkeley, California, pp. 124-128 | September 2009 | Speech | [PDF]
|
| Pushing the Limits of Mechanical Turk: Qualifying the Crowd for Video Geo-Location | L. Gottlieb, J. Choi, P. Kelm, T. Sikora, and G. Friedland | Proceedings of the ACM Workshop on Crowdsourcing for Multimedia (CrowdMM 2012), held in conjunction with ACM Multimedia 2012, pp. 23-28, Nara, Japan | October 2012 | Speech | [PDF]
|