| Automatic Punctuation and Disfluency Detection in Multi-Party Meetings Using Prosodic and Lexical Cues | D. Baron, E. Shriberg, and A. Stolcke | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado, pp. 949-952 | September 2002 | Speech | [PDF]
|
| Automatic Phonetic Transcription of Spontaneous Speech American English | S. Chang, L. Shastri, and S. Greenberg | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Automatic Learning of Word Pronunciation from Data | E. Fosler, M. Weintraub, S. Wegmann, Y. H. Kao, S. Khudanpur, C. Galles, and M. Saraclar | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| Automatic Laughter Segmentation | M. T. Knox | Master's report | May 2008 | Speech | [PDF]
|
| Automatic Laughter Detection Using Neural Networks | M. Knox and N. Mirghafori | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2973-2976 | August 2007 | Speech | [PDF]
|
| Automatic Labeling of Semantic Roles | D. Gildea and D. Jurafsky | The 38th Annual Meeting of the Association for Computational Linguistics (ACL-2000), Hong Kong, pp. 512-520 | October 2000 | Speech | [PDF]
|
| Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech | S. Cuendet, D. Hakkani-Tur, and E. Shriberg | Proceedings of Fourth International Conference on Machine Learning and Multimodal Interaction, Brno, Czech Republic, pp. 144-155 | June 2007 | Speech | [PDF]
|
| Automatic Disfluency Identification in Conversational Speech Using Multiple Knowledge Sources | Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| Automatic Dialog Act Segmentation and Classification in Multiparty Meetings | J. Ang, Y. Liu, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 1061-1064 | March 2005 | Speech | [PDF]
|
| Automatic Data Selection for MLP-Based Feature Extraction for ASR | C. Pelaez-Moreno, Q. Zhu, B. Chen, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 229-232 | September 2005 | Speech | [PDF]
|
| Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization | X. Anguera, C. Wooters, and J. Hernando | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 248-256 | May 2006 | Speech | [PDF]
|
| Automated Lecture Recording | G. Friedland, L. Knipping, and W. Huerst | Encyclopedia of Multimedia, B. Furht, ed., Springer | October 2008 | Speech | |
| Automated Information Extraction in Production | R. Desutter, J.P. Evain, G. Friedland, A. Messina, and M. Sano | Special issue in Multimedia Tools and Applications, Springer | 2011 | Speech | |
| Auditory-Based Automatic Speech Recognition | W. Hemmert, M. Holmberg, and D. Gelbart | Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Audio-Based Semantic Concept Classification for Consumer Video | K. Lee and D. Ellis | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 6, pp. 1406-1416 | August 2010 | Speech | [PDF]
|
| Audio Segmentation for Meetings Speech Processing | K. A. Boakye | UC Berkeley dissertation | December 2008 | Speech | [PDF]
|
| Audio Information Access from Meeting Rooms | S. Renals and D. Ellis | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong | April 2003 | Speech | [PDF]
|
| Asynchronous Binarization for Synchronous Grammars | J. DeNero, A. Pauls, and D. Klein | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore | August 2009 | Speech | [PDF]
|
| Associating Children’s Non-Verbal and Verbal Behaviour: Body Movements, Emotions, and Laughter in a Human-Robot Interaction | A. Batliner, S. Steidl, and E. Nöth | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 22-27 | May 2011 | Speech | [PDF]
|
| Articulatory Features for Expressive Speech Synthesis | A. Black, H. T. Bunnell, Y. Dou, P. Kumar, F. Metze, D. Perry, T. Polzehl, K. Prahallad, S. Steidl, and C. Vaug | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 Jhu Summer Workshop | K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. Saenko | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii | April 2007 | Speech | |
| Appscio: A Software Environment for Semantic Multimedia Analysis | G. Friedland, E. Hensley, J. Schumacher, and R. Jain | Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, California, pp. 456-459 | August 2008 | Speech | [PDF]
|
| Applications of Keyword-Constraining in Speaker Recognition | H. Lei | MS Thesis, University of California-Berkeley | July 2007 | Speech | [PDF]
|
| Any Questions? Automatic Question Detection in Meetings | K. Boakye, B. Favre, and D. Hakkani-Tür | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 485-489 | December 2009 | Speech | [PDF]
|
| Anthropocentric Video Segmentation for Lecture Webcasts | G. Friedland and R. Rojas | EURASIP Journal on Image and Video Processing, Vol. 8, Issue 2, Article 9 | January 2008 | Speech | [PDF]
|
| Anchored Speech Recognition for Question Answering | S. Yaman, G. Tür, D. Vergyri, D. Hakkani-Tür, M. Harper, and W. Wang | Proceedings of North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009): Short Papers, Boulder, Colorado, pp. 265-268 | June 2009 | Speech | [PDF]
|
| Analytics for Experts | G. Friedland | Featured paper in ACM SIGMM Records, Vol. 1, Issue 1 | March 2009 | Speech | [PDF]
|
| An Overview of the SPRACH System for the Transcription of Broadcast News | G. Cook, J. Christie, D. Ellis, E. Fosler-Lussier, Y. Gotoh, B. Kingsbury, N. Morgan, S. Renals, T. Robinson, and G. Williams | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| An Iterative Unsupervised Learning Method for Information Distillation | K. Kamangar, D. Hakkani-Tur, G. Tur, and M. Levit | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4949 - 4952 | April 2008 | Speech | [PDF]
|
| An Introduction to the Diagnostic Evaluation of the Switchboard-Corpus Automatic Speech Recognition Systems | S. Greenberg, S. Chang, and J. Hollenback | Proceedings of the National Institute of Standards and Technology Speech Transcription Workshop, College Park, Maryland | May 2000 | Speech | [PDF]
|
| An Introduction to Hybrid HMM/Connectionist Continuous Speech Recognition | N. Morgan and H. Bourlard | IEEE Signal Processing Magazine, pp. 25-42 | May 1995 | Speech | [PDF]
|
| An Improved Approximation Algorithm for Vertex Cover with Hard Capacities | R. Gandhi, E. Halperin, S. Khuller, G. Kortsarz, and A. Srinivasan | Proceedings of the 30th International Colloquium on Automata, Languages and Programming (ICALP 2003), Eindhoven, The Netherlands, pp. 164-175 | June 2003 | Speech | [PDF]
|
| An Elitist Approach to Articulatory-Acoustic Feature Classification | S. Chang, S. Greenberg, and M. Wester | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling | O. Cetin, A. Kantor, S. King, C. Bartels, M. Magimai-Doss, J. Frankel, and K. Livescu | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 645-648 | April 2007 | Speech | |
| An Anticorrelation Kernel for Subsystem Training in Multiple Classifier Systems | L. Ferrer, K. Sönmez, and E. Shriberg | Journal of Machine Learning Research, Vol. 10, pp. 2079-2114 | September 2009 | Speech | [PDF]
|
| An Analysis of Sentence Segmentation Features for Broadcast News, Broadcast Conversations, and Meetings | S. Cuendet, E. Shriberg, B. Favre, J. Fung, and D. Hakkani-Tür | Proceedings of the SIGIR Workshop on Searching Conversational Spontaneous Speech, Amsterdam, Netherlands, pp. 43-59 | July 2007 | Speech | |
| An Adaptive Initialization Method for Speaker Diarization Based on Prosodic Features | D. Imseng and G. Friedland | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4946-4949 | March 2010 | Speech | [PDF]
|
| Adaptive Language Modeling with Varied Sources to Cover New Vocabulary Items | S. Schwarm, I. Bulyko, and M. Ostendorf | IEEE Transactions on Speech and Audio Processing, Vol. 12, No. 3, pp. 334-342 | May 2004 | Speech | [PDF]
|
| Acoustic Super Models for Large Scale Video Event Detection | R. Mertens, H. Lei, L. Gottlieb, G. Friedland, and A. Divakaran | Proceedings of the ACM International Workshop on Events in Multimedia (EiMM11), Scottsdale, Arizona | November 2011 | Speech | [PDF]
|
| Acoustic Sub-word Models in the Berkeley Restaurant Project | C. Wooters and N. Morgan | Proceedings of the International Conference on Spoken Language Processing (ICSLP'92), pp. 1551-1554 | 1992 | Speech | |
| Acoustic Front-End Optimization for Bird Species Recognition | M. Graciarena, M. Delplanche, E. Shriberg, A. Stolcke, and L. Ferrer | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 293-296 | March 2010 | Speech | [PDF]
|
| Acoustic Beamforming for Speaker Diarization of Meetings | X. Anguera, C. Wooters, and J. Hernando | IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 7, IEEE Computer Society, California, pp. 2011-2022 | September 2007 | Speech | |
| Accent Classification for Speech Recognition | A. Faria | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 285-293 | July 2005 | Speech | [PDF]
|
| A* Based Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 215-219 | November 2005 | Speech | [PDF]
|
| A Voice-Enabled Procedure Browser for the International Space Station | M. Rayner, B.A. Hockey, N. Chatzichrisafis, K. Farrell, and J.M. Renders | Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), Ann Arbor, Michigan, pp. 29-32 (interactive poster and demo track) | June 2005 | Speech | |
| A View of the Parallel Computing Landscape | K. Asanović, R. Bodik, J. Demmel, T. Keaveny, K. Keutzer, J. D. Kubiatowicz, N. Morgan, D. A. Patterson, K. Sen, J. Wawrzynek, D. Wessel, and K. A. Yelick | Communications of the ACM, Vol. 52, No. 10, pp. 56-67 | October 2009 | Speech | [PDF]
|
| A Training Algorithm for Statistical Sequence Recognition with Applications to Transition-Based Speech Recognition | H. Bourlard, Y. Konig, and N. Morgan | IEEE Signal Processing Letters, pp. 203-205 | July 1996 | Speech | |
| A Text-constrained Prosodic System for Speaker Verification | E. Shriberg and L. Ferrer | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1226-1229 | August 2007 | Speech | [PDF]
|
| A Syllable, Articulatory-Feature, and Stress-Accent Model of Speech Recognition | S. Chang | Ph.D. Thesis, University of California at Berkeley. Also ICSI Technical Report TR-02-007 | September 2002 | Speech | [PDF]
|
| A Study of Two Dimensional Linear Descriminants For ASR | S. Kajarekar, B. Yegnanarayana, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |