| Best Papers from the 10th IEEE International Symposium on Multimedia | G. Friedland and S.-C. Shen, eds. | International Journal on Semantic Computing (IJSC), World Scientific, Vol. 3, Issue 2 | June 2009 | Speech | |
| Selected Papers from the Third IEEE International Conference on Semantic Computing (ICSC2009) | G. Friedland and S. C. Shen, eds. | International Journal on Semantic Computing, Vol. 3, Issue 4 | December 2009 | Speech | |
| Selected Papers from the 11th IEEE International Symposium on Multimedia (ISM2009) | G. Friedland and M.-L. Shyu, eds. | International Journal on Semantic Computing, Vol. 4, No. 2 | November 2010 | Speech | |
| Using A Million Connections for Continuous Speech Recognition | N. Morgan | Invited paper for the International Conference on Neural Information Processing (ICONIP' 94), Seoul, South Korea, pp. 1439-1444 | October 1994 | Speech | |
| IXIR: A Statistical Information Distillation System | M. Levit, D. Hakkani-Tür, G. Tür, and D. Gillick | Journal of Computer Speech and Language, Vol. 23, Issue 4, pp. 527-542 | October 2009 | Speech | [PDF]
|
| Cascaded Model Adaptation for Dialog Act Segmentation and Tagging | U. Guz, G. Tur, D. Hakkani-Tür, and S. Cuendet | Journal of Computer Speech and Language, Vol. 24, Issue 2, pp. 289-306 | April 2010 | Speech | |
| An Anticorrelation Kernel for Subsystem Training in Multiple Classifier Systems | L. Ferrer, K. Sönmez, and E. Shriberg | Journal of Machine Learning Research, Vol. 10, pp. 2079-2114 | September 2009 | Speech | [PDF]
|
| Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research | H. Hermansky and N. Morgan | Journal of Negative Results in Speech and Audio Sciences, Vol. 1, Issue 1 | 2004 | Speech | [PDF]
|
| A Comparison of Single- and Multi-Objective Programming Approaches to Problems with Multiple Design Objectives | S. Yaman and C.-H. Lee | Journal of Signal Processing Systems, MLSP special issue | November 2008 | Speech | [PDF]
|
| Syllable Intelligibility for Temporally-Filtered LPC Cepstral Trajectories | T. Arai, M. Pavel, H. Hermansky, and C. Avendano | Journal of the Acoustical Society of America, Vol. 105, No. 5, pp. 2783-2791 | May 1999 | Speech | [PDF]
|
| Robust Speaker Diarization for Meetings: ICSI TR06 Meetings Evaluation System | X. Anguera, C. Wooters, and J. Pardo | Lecture Notes in Computer Science, Volume 4299, 2006, pp. 346-358, ISSN 0302-9743 | 2006 | Speech | [PDF]
|
| Prosodic Similarities of Dialog Act Boundaries Across Speaking Styles | E. Shriberg, B. Favre, J. Fung, D. Hakkani-Tur, and S. Cuendet | Linguistic Patterns in Spontaneous Speech, S.-C. Tseng, ed., pp. 213-239, Institute of Linguistics | 2009 | Speech | [PDF]
|
| Multi-Microphone Signal Processing for Automatic Speech Recognition in Meeting Rooms | M. Ferras Font | M.S. Thesis, Universitat Politecnica de Catalunya, Barcelona, Spain | July 2005 | Speech | [PDF]
|
| Prosodic Cues For Emotion Recognition In Communicator Dialogs | J.C. Ang | M.S. Thesis, University of California at Berkeley | December 2002 | Speech | [PDF]
|
| Prosody-Based Automatic Detection of Punctuation and Interruption Events in the ICSI Meeting Recorder Corpus | D. Baron | M.S. Thesis, University of California at Berkeley | May 2002 | Speech | [PDF]
|
| Word-Level Confidence Estimation for Automatic Speech Recognition | A. Hatch | M.S. Thesis, University of California at Berkeley | August 2001 | Speech | [PDF]
|
| The Sequential GMM: A Gaussian Mixture Model Based Speaker Verification System that Captures Sequential Information | S. Stafford | M.S. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| Speaker Recogntion in the Text-Independent Domain Using Keyword Hidden Markov Models | K. Boakye | M.S. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| Automatic Laughter Segmentation | M. T. Knox | Master's report | May 2008 | Speech | [PDF]
|
| Prosody Modeling for Automatic Speech Recognition and Understanding | E. Shriberg and A. Stolcke | Mathematical Foundations of Speech and Language Modeling, M. Johnson, M. Ostendorf, S. Khudanpur, R. Rosenfeld (eds.), Volume 138 in IMA Volumes in Mathematics and its Applications, pp. 105-114, Springer-Verlag. | 2004 | Speech | [PDF]
|
| Applications of Keyword-Constraining in Speaker Recognition | H. Lei | MS Thesis, University of California-Berkeley | July 2007 | Speech | [PDF]
|
| Factoring Networks by a Statistical Method | N. Morgan and H. Bourlard | Neural Computation, Vol. 4 No. 6, pp. 835-838 | 1992 | Speech | [PDF]
|
| Factoring Networks by a Statistical Method | N. Morgan and H. Bourlard | Neural Computation, Vol. 4 No. 6, pp. 835-838 | 1992 | Speech | [PDF]
|
| Statistical Inference in Multilayer Perceptrons and Hidden Markov Models with Applications in Continuous Speech Recognition | H. Bourlard, N. Morgan, and C. Wellekens | Neuro Computing, Algorithms, Architectures and Applications, NATO ASI Series, Vol. F68, pp. 217-226 | 1990 | Speech | |
| Progress in Meeting Recognition: The ICSI-SRI-UW Spring 2004 Evaluation System | A. Stolcke, C. Wooters, N. Mirghafori, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | NIST ICASSP 2004 Meeting Recognition Workshop, Montreal | May 2004 | Speech | [PDF]
|
| Dynamic Pronunciation Models for Automatic Speech Recognition | E. Fosler-Lussier | Ph.D Dissertation, University of California at Berkeley | August 1999 | Speech | [PDF]
|
| Perceptually-Inspired Signal Processing Strategies for Robust Speech Recognition in Reverberant Environments | B. Kingsbury | Ph.D Dissertation, University of California at Berkeley | December 1998 | Speech | [PDF]
|
| A Multi-Band Approach to Automatic Speech Recognition | N. Mirghafori | Ph.D Dissertation, University of California at Berkeley, December 1998. Also ICSI Technical Report, TR-99-004, January 1999 | December 1998 | Speech | [PDF]
|
| Discriminant Training of Front-End and Acoustic Modeling Stages to Heterogeneous Acoustic Environments for Multi-stream Automatic Speech Recognition | M. Shire | Ph.D Dissertation, University of California at Berkeley, Fall 2000 | 2000 | Speech | [PDF]
|
| Speech Recognition with Dynamic Bayesian Networks | G. Zweig | Ph.D Dissertation, University of California at Berkeley, Spring 1998 | 1998 | Speech | [PDF]
|
| Robust Speech Recognition Based on Spectro-Temporal Processing | M. Kleinschmidt | Ph.D Dissertation, University of Oldenberg, Germany | 2002 | Speech | |
| Structural Event Detection for Rich Transcription of Speech | Y. Liu | Ph.D Thesis, Purdue University, West Lafayette, Indiana | December 19 2004 | Speech | [PDF]
|
| Global Posterior Probability Estimates as Decision Confidence Measures in an Automatic Speech Recognition System | W. Warren | Ph.D. Dissertation, University of California at Berkeley | December 2000 | Speech | |
| Dynamic Pronunciation Models for Autmoatic Speech Recognition | E. Fosler-Lussier | Ph.D. Thesis, UC Berkeley, Fall 1999, ICSI Technical Report TR-99-015 | September 1999 | Speech | [PDF]
|
| Learning Discriminant Narrow-Band Temporal Patterns for Automatic Recognition of Conversational Telephone Speech | B.Y. Chen | Ph.D. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| Speech recognition on vector architectures | A. Janin | Ph.D. Thesis, University of California at Berkeley | 2004 | Speech | [PDF]
|
| Natural Statistical Models for Automatic Speech Recognition | J. Bilmes | Ph.D. Thesis, University of California at Berkeley, Fall 1999. Also ICSI Technical Report TR-99-016 | October 1999 | Speech | [PDF]
|
| Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition | S.L. Wu | Ph.D. Thesis, University of California at Berkeley, Spring 1998. Also ICSI Technical Report TR-98-014 | 1998 | Speech | [PDF]
|
| A Syllable, Articulatory-Feature, and Stress-Accent Model of Speech Recognition | S. Chang | Ph.D. Thesis, University of California at Berkeley. Also ICSI Technical Report TR-02-007 | September 2002 | Speech | [PDF]
|
| Efficient Parsing of Syntactic and Semantic Dependency Structures | B. Bohnet | Presented at the 13th Conference on Computational Natural Language Learning (CoNLL-2009), Boulder, Colorado | June 2009 | Speech | [PDF]
|
| Cybercasing the Joint: Language Technologies, Multimedia Retrieval, and Online Privacy | G. Friedland | Presented at the Language Technologies Institute Colloquium, Carnegie Mellon University, Pittsburgh, Pennsylvania | April 13 2012 | Speech | [PDF]
|
| The 2012 ICSI/Berkeley Video Location Estimation System | J. Choi, V. Ekambaram, G. Friedland, and K. Ramchandran | Presented at the MediaEval 2012 Workshop, Pisa, Italy | October 2012 | Speech | [PDF]
|
| From AUDREY to Siri: Is Speech Recognition A Solved Problem? | R. Pieraccini | Presented at the Mobile Voice Conference, San Francisco, California | March 2012 | Speech | [PDF]
|
| Detecting Categories in News Video Using Acoustic, Speech, and Image Features | S. Petrov, A. Faria, P. Michaillat, A. Berg, A. Stolcke, D. Klein, and J. Malik | Presented at the NIST TREC Video Retrieval Workshop, Gaithersburg, Maryland | November 2006 | Speech | [PDF]
|
| How to Build a Spoken Dialog System with Limited (or No) Resources | M. Plauché, O. Cetin, and N. Uhdaykumar | Presented at the Workshop on AI in ICT for Development at the 20th International Joint Conference on AI (IJCAI07), Hyderabad, India | January 2007 | Speech | |
| Comparison of Grammar Based and Statistical Language Models Trained on the Same Data | B.A. Hockey and M. Rayner | Presented at the Workshop on Spoken Language Understanding at the 20th AIII National Conference on Artificial Intelligence, Pittsburgh, Pennsylvania | July 2005 | Speech | |
| Perceptually Motivated Sub-Band Decomposition for FDLP Audio Coding | P. Motlicek, S. Ganapathy, H. Hermansky, H. Garudadri, and M. Athineos | Proceedings of 11th International Conference on Text, Speech, and Dialogue (TSD 2008), Brno, Czech Republic, pp. 435-442 | September 2008 | Speech | [PDF]
|
| Autoregressive Modeling of Hilbert Envelopes for Wide-Band Audio Coding | S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri | Proceedings of 124th Convention of Audio Engineering Society (AES), Amsterdam, the Netherlands, paper 7481 | May 2008 | Speech | |
| Role Recognition for Meeting Participants: An Approach Based on Lexical Information and Social Network Analysis | N. Garg, S. Favre, H. Salamin, D. Hakkani-Tur, and A. Vinciarelli | Proceedings of 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 693-696. | October 2008 | Speech | [PDF]
|
| A Multi-DSP Ring Array for Connectionist Simulations | J. Beck, N. Morgan, A. Allman, and J. Beer | Proceedings of 23rd Asilomar Conference on Signals, Systems & Computers | 1989 | Speech | |