| Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 Jhu Summer Workshop | K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. Saenko | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii | April 2007 | Speech | |
| Manual Transcription of Conversational Speech at the Articulatory Feature Level | K. Livescu, A. Bezman, N. Borges, L. Yung, O. Cetin, J. Frankel, S. King, M. Magimai-Doss, X. Chi, and L. Lavoie | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 953-956 | April 2007 | Speech | |
| Detecting Local Semantic Concepts in Environmental Sounds Using Markov Model Based Clustering | K. Lee, D. Ellis, and A. Loui | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, March 2010 | March 2010 | Speech | [PDF]
|
| Detecting Music in Ambient Audio by Long-Window Autocorrelation | K. Lee and D. Ellis | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 9-12 | April 2008 | Speech | [PDF]
|
| Audio-Based Semantic Concept Classification for Consumer Video | K. Lee and D. Ellis | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 6, pp. 1406-1416 | August 2010 | Speech | [PDF]
|
| Modeling Other Talkers for Improved Dialog Act Recognition in Meetings | K. Laskowski and E. Shriberg | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2783-2786 | September 2009 | Speech | [PDF]
|
| Comparing the Contributions of Context and Prosody in Text-Independent Dialog Act Recognition | K. Laskowski and E. Shriberg | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5374-5377 | March 2010 | Speech | [PDF]
|
| Statistical Acoustic Indications of Coarticulation | K. Kirchoff and J. Bilmes | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 3, pp. 1729-1732 | August 1999 | Speech | [PDF]
|
| Dynamic Classifier Combinations in Hybrid Speech Recognition Systems Using Utterance-Level Confidence Values | K. Kirchhoff and J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-693-696 | March 1999 | Speech | [PDF]
|
| An Iterative Unsupervised Learning Method for Information Distillation | K. Kamangar, D. Hakkani-Tur, G. Tur, and M. Levit | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4949 - 4952 | April 2008 | Speech | [PDF]
|
| A Low-Cost Mobile Pointing and Drawing Device | K. Jantz, G. Friedland, L. Knipping, and R. Rojas | Proceedings of the ACM Workshop on Educational Multimedia and Multimedia Education at ACM Multimedia 2007, Augsburg, Germany, pp. 121-122 | September 2007 | Speech | |
| Improved Classification of Speaking Styles for Mental Health Monitoring using Phoneme Dynamics | K. Chang, H. Lei, and J. Canny | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 85-88 | August 2011 | Speech | [PDF]
|
| Two's a Crowd: Improving Speaker Diarization by Automatically Identifying and Excluding Overlapped Speech Authors | K. Boakye, O. Vinyals, and G. Friedland | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 32-35 | September 2008 | Speech | |
| Improved Overlapped Speech Handling for Speaker Diarization | K. Boakye, O. Vinyals, and G. Friedland | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 941-944 | August 2011 | Speech | |
| Any Questions? Automatic Question Detection in Meetings | K. Boakye, B. Favre, and D. Hakkani-Tür | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 485-489 | December 2009 | Speech | [PDF]
|
| Text-Constrained Speaker Recognition on a Text-Independent Task | K. Boakye and B. Peskin | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2004), Toledo, Spain | May 2004 | Speech | [PDF]
|
| Improved Speech Activity Detection Using Cross-Channel Features for Recognition of Multiparty Meetings | K. Boakye and A. Stolcke | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1962-1965 | September 2006 | Speech | [PDF]
|
| Speaker Recogntion in the Text-Independent Domain Using Keyword Hidden Markov Models | K. Boakye | M.S. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| A View of the Parallel Computing Landscape | K. Asanović, R. Bodik, J. Demmel, T. Keaveny, K. Keutzer, J. D. Kubiatowicz, N. Morgan, D. A. Patterson, K. Sen, J. Wawrzynek, D. Wessel, and K. A. Yelick | Communications of the ACM, Vol. 52, No. 10, pp. 56-67 | October 2009 | Speech | [PDF]
|
| Training Neural Networks with SPERT-II | K. Asanovic, J. Beck, D. Johnson, B. Kingsbury, N. Morgan, and J. Wawrzynek | Chapter in Parallel Architectures for Artificial Networks - Paradigms and Implementations, eds. N. Sundararajan and P. Saratchandran, IEEE Computer Society Press, pp. 345-364 | 1998 | Speech | |
| Audio Segmentation for Meetings Speech Processing | K. A. Boakye | UC Berkeley dissertation | December 2008 | Speech | [PDF]
|
| Speaker Diarization for Multi-Microphone Meetings Using Only Between-Channel Differences | J.M. Pardo, X Anguera, and C. Wooters | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 257-264 | May 2006 | Speech | [PDF]
|
| Using Acoustic Condition Clustering to Improve Acoustic Change Detection on Broadcast News | J.F. Lopez and D. Ellis | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China, Vol. 4, pp. 568-571 | October 2000 | Speech | [PDF]
|
| Prosodic Cues For Emotion Recognition In Communicator Dialogs | J.C. Ang | M.S. Thesis, University of California at Berkeley | December 2002 | Speech | [PDF]
|
| Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition | J. Zheng, O. Cetin, M.-Y. Huang, X. Lei, A. Stolcke, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 633-636 | April 2007 | Speech | |
| fMPE-MAP: Improved Discriminative Adaptation for Modeling New Domains | J. Zheng and A. Stolcke | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1573-1576 | August 2007 | Speech | [PDF]
|
| SPERT-II: A Vector Microprocessor System and Its Application to Large Problems in Backpropagation Training | J. Wawrzynek, K. Asanovic, B. Kingsbury, J. Beck, D. Johnson, and N. Morgan | Proceedings of the Advances in Neural Information Processing Systems 8 Conference (NIPS 8), Denver, Colorado, pp. 619-625. Also in IEEE Computer, Vol. 29, No. 3, pp 79-86, March 1996. | November 1995 | Speech | |
| Speaker Diarization for Multiple Distant Microphone Meetings: Mixing Acoustic Features And Inter-Channel Time Differences | J. Pardo, X. Anguera, and C. Wooters | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2194-2197 | September 2006 | Speech | [PDF]
|
| The Effects of Speech Recognition and Punctuation on Information Extraction Performance | J. Makhoul, A. Baron, I. Bulyko, L. Nguyen, L. Ramshaw, D. Stallard, R. Schwartz, and B. Xiang | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 57-60 | September 2005 | Speech | |
| Speaker Diarization For Multiple-distant-microphone Meetings Using Several Sources of Information | J. M. Pardo, X. Anguera, and C. Wooters | IEEE Transactions on Computers, Vol. 56, Issue 9, IEEE Computer Society, California, pp. 1212-1224 | September 2007 | Speech | [PDF]
|
| Exploiting Chinese Character Models to Improve Speech Recognition Performance | J. L. Hieronymus, X. Liu, M. J. F. Gales, and P. C. Woodland | Proceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UK | September 2009 | Speech | |
| Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings | J. Kolar, Y. Liu, and E. Shriberg | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1621-1624 | August 2007 | Speech | [PDF]
|
| Genre Effects on Automatic Sentee Segmentation of Speech: A Comparison of Broadcast News and Broadcast Conversationsnc | J. Kolar, Y. Liu, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4701-4704 | April 2009 | Speech | [PDF]
|
| Speaker Adaptation of Language and Prosodic Models for Automatic Dialog Act Segmentation of Speech | J. Kolar, Y. Liu, and E. Shriberg | Speech Communication, Vol. 52, Issue 3, pp. 236-245 | March 2010 | Speech | |
| Using Prosody for Automatic Sentence Segmentation of Multi-Party Meetings | J. Kolar, E. Shriberg, and Y. Liu | Proceedings of 9th International Conference on Text, Speech and Dialogue (TSD 2006), Brno, Czech Republic, pp. 629-636 | September 2006 | Speech | [PDF]
|
| On Speaker-Specific Prosodic Models for Automatic Dialog Act Segmentation of Multi-Party Meetings | J. Kolar, E. Shriberg, and Y. Liu | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2014-2017 | September 2006 | Speech | [PDF]
|
| Integrating RASTA-PLP into Speech Recognition | J. Koehler, N. Morgan, H. Hermansky, H.G. Hirsch, and G. Tong | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, pp. I-421-424 | 1994 | Speech | |
| How Good Is the Crowd at "Real" WSD? | J. Hong and C. F. Baker | Proceedings of the Fifth Linguistic Annotation Workshop (LAW-V), Portland, Oregon | June 2011 | Speech | [PDF]
|
| Prosodic Features and Feature Selection for Multi-lingual Sentence Segmentation | J. Fung, D. Hakkani-Tur, M. Magimai-Doss, E. Shriberg, S. Cuendet, and N. Mirghafori | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2585-2588 | August 2007 | Speech | [PDF]
|
| Chapter 17: The Transcription of Discourse | J. Edwards | The Handbook of Discourse Analysis, D. Shriffrin, D. Tannen and H. Hamilton, eds. Oxford: Blackwell, pp. 321-348 | 2001 | Speech | |
| Efficient Parsing for Transducer Grammars | J. DeNero, M. Bansal, A. Pauls, and D. Klein | Proceedings of North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009), Boulder, Colorado, pp. 227-235. | May 2009 | Speech | [PDF]
|
| Fast Consensus Decoding over Translation Forests | J. DeNero, D. Chiang, and K. Knight | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore | August 2009 | Speech | [PDF]
|
| Asynchronous Binarization for Synchronous Grammars | J. DeNero, A. Pauls, and D. Klein | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore | August 2009 | Speech | [PDF]
|
| Sampling Alignment Structure Under a Bayesian Translation Model | J. DeNero, A. Bouchard-Côté, and D. Klein | Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP), Waikiki, Honolulu, Hawaii, pp. 314-323 | October 2008 | Speech | [PDF]
|
| Opportunities and Challenges of Parallelizing Speech Recognition | J. Chong, G. Friedland, A. Janin, and N. Morgan | Proceedings of the Second USENIX Workshop on Hot Topics in Parallelism (HotPar '10), Berkeley, California | June 2010 | Speech | [PDF]
|
| The 2012 ICSI/Berkeley Video Location Estimation System | J. Choi, V. Ekambaram, G. Friedland, and K. Ramchandran | Presented at the MediaEval 2012 Workshop, Pisa, Italy | October 2012 | Speech | [PDF]
|
| The 2011 ICSI Video Location Estimation System | J. Choi, H. Lei, and G. Friedland | Proceedings of the MediaEval 2011 Workshop, Pisa, Italy | September 2011 | Speech | [PDF]
|
| Multimodal Location Estimation of Consumer Media – Dealing with Sparse Training Data | J. Choi, G. Friedland, V. Ekambaram, and K. Ramchandran | Proceedings of the IEEE International Conference on Multimedia and Expo, Melbourne, Australia, pp. 43-48 | July 2012 | Speech | [PDF]
|
| The 2010 ICSI Video Location Estimation System | J. Choi, A. Janin, and G. Friedland | Proceedings of the MediaEval 2010 Workshop, Pisa Italy | October 2010 | Speech | [PDF]
|
| Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location Estimation | J. Choi and G. Friedland | Proceedings of the IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246 | September 2011 | Speech | [PDF]
|