| Structural Event Detection for Rich Transcription of Speech | Y. Liu | Ph.D Thesis, Purdue University, West Lafayette, Indiana | December 19 2004 | Speech | [PDF]
|
| Robust Speech Recognition Based on Spectro-Temporal Processing | M. Kleinschmidt | Ph.D Dissertation, University of Oldenberg, Germany | 2002 | Speech | |
| Speech Recognition with Dynamic Bayesian Networks | G. Zweig | Ph.D Dissertation, University of California at Berkeley, Spring 1998 | 1998 | Speech | [PDF]
|
| Discriminant Training of Front-End and Acoustic Modeling Stages to Heterogeneous Acoustic Environments for Multi-stream Automatic Speech Recognition | M. Shire | Ph.D Dissertation, University of California at Berkeley, Fall 2000 | 2000 | Speech | [PDF]
|
| A Multi-Band Approach to Automatic Speech Recognition | N. Mirghafori | Ph.D Dissertation, University of California at Berkeley, December 1998. Also ICSI Technical Report, TR-99-004, January 1999 | December 1998 | Speech | [PDF]
|
| Dynamic Pronunciation Models for Automatic Speech Recognition | E. Fosler-Lussier | Ph.D Dissertation, University of California at Berkeley | August 1999 | Speech | [PDF]
|
| Perceptually-Inspired Signal Processing Strategies for Robust Speech Recognition in Reverberant Environments | B. Kingsbury | Ph.D Dissertation, University of California at Berkeley | December 1998 | Speech | [PDF]
|
| Progress in Meeting Recognition: The ICSI-SRI-UW Spring 2004 Evaluation System | A. Stolcke, C. Wooters, N. Mirghafori, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | NIST ICASSP 2004 Meeting Recognition Workshop, Montreal | May 2004 | Speech | [PDF]
|
| Statistical Inference in Multilayer Perceptrons and Hidden Markov Models with Applications in Continuous Speech Recognition | H. Bourlard, N. Morgan, and C. Wellekens | Neuro Computing, Algorithms, Architectures and Applications, NATO ASI Series, Vol. F68, pp. 217-226 | 1990 | Speech | |
| Factoring Networks by a Statistical Method | N. Morgan and H. Bourlard | Neural Computation, Vol. 4 No. 6, pp. 835-838 | 1992 | Speech | [PDF]
|
| Factoring Networks by a Statistical Method | N. Morgan and H. Bourlard | Neural Computation, Vol. 4 No. 6, pp. 835-838 | 1992 | Speech | [PDF]
|
| Applications of Keyword-Constraining in Speaker Recognition | H. Lei | MS Thesis, University of California-Berkeley | July 2007 | Speech | [PDF]
|
| Prosody Modeling for Automatic Speech Recognition and Understanding | E. Shriberg and A. Stolcke | Mathematical Foundations of Speech and Language Modeling, M. Johnson, M. Ostendorf, S. Khudanpur, R. Rosenfeld (eds.), Volume 138 in IMA Volumes in Mathematics and its Applications, pp. 105-114, Springer-Verlag. | 2004 | Speech | [PDF]
|
| Automatic Laughter Segmentation | M. T. Knox | Master's report | May 2008 | Speech | [PDF]
|
| Prosodic Cues For Emotion Recognition In Communicator Dialogs | J.C. Ang | M.S. Thesis, University of California at Berkeley | December 2002 | Speech | [PDF]
|
| Prosody-Based Automatic Detection of Punctuation and Interruption Events in the ICSI Meeting Recorder Corpus | D. Baron | M.S. Thesis, University of California at Berkeley | May 2002 | Speech | [PDF]
|
| Word-Level Confidence Estimation for Automatic Speech Recognition | A. Hatch | M.S. Thesis, University of California at Berkeley | August 2001 | Speech | [PDF]
|
| The Sequential GMM: A Gaussian Mixture Model Based Speaker Verification System that Captures Sequential Information | S. Stafford | M.S. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| Speaker Recogntion in the Text-Independent Domain Using Keyword Hidden Markov Models | K. Boakye | M.S. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| Multi-Microphone Signal Processing for Automatic Speech Recognition in Meeting Rooms | M. Ferras Font | M.S. Thesis, Universitat Politecnica de Catalunya, Barcelona, Spain | July 2005 | Speech | [PDF]
|
| Prosodic Similarities of Dialog Act Boundaries Across Speaking Styles | E. Shriberg, B. Favre, J. Fung, D. Hakkani-Tur, and S. Cuendet | Linguistic Patterns in Spontaneous Speech, S.-C. Tseng, ed., pp. 213-239, Institute of Linguistics | 2009 | Speech | [PDF]
|
| Robust Speaker Diarization for Meetings: ICSI TR06 Meetings Evaluation System | X. Anguera, C. Wooters, and J. Pardo | Lecture Notes in Computer Science, Volume 4299, 2006, pp. 346-358, ISSN 0302-9743 | 2006 | Speech | [PDF]
|
| Syllable Intelligibility for Temporally-Filtered LPC Cepstral Trajectories | T. Arai, M. Pavel, H. Hermansky, and C. Avendano | Journal of the Acoustical Society of America, Vol. 105, No. 5, pp. 2783-2791 | May 1999 | Speech | [PDF]
|
| A Comparison of Single- and Multi-Objective Programming Approaches to Problems with Multiple Design Objectives | S. Yaman and C.-H. Lee | Journal of Signal Processing Systems, MLSP special issue | November 2008 | Speech | [PDF]
|
| Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research | H. Hermansky and N. Morgan | Journal of Negative Results in Speech and Audio Sciences, Vol. 1, Issue 1 | 2004 | Speech | [PDF]
|
| An Anticorrelation Kernel for Subsystem Training in Multiple Classifier Systems | L. Ferrer, K. Sönmez, and E. Shriberg | Journal of Machine Learning Research, Vol. 10, pp. 2079-2114 | September 2009 | Speech | [PDF]
|
| Cascaded Model Adaptation for Dialog Act Segmentation and Tagging | U. Guz, G. Tur, D. Hakkani-Tür, and S. Cuendet | Journal of Computer Speech and Language, Vol. 24, Issue 2, pp. 289-306 | April 2010 | Speech | |
| IXIR: A Statistical Information Distillation System | M. Levit, D. Hakkani-Tür, G. Tür, and D. Gillick | Journal of Computer Speech and Language, Vol. 23, Issue 4, pp. 527-542 | October 2009 | Speech | [PDF]
|
| Using A Million Connections for Continuous Speech Recognition | N. Morgan | Invited paper for the International Conference on Neural Information Processing (ICONIP' 94), Seoul, South Korea, pp. 1439-1444 | October 1994 | Speech | |
| Selected Papers from the 11th IEEE International Symposium on Multimedia (ISM2009) | G. Friedland and M.-L. Shyu, eds. | International Journal on Semantic Computing, Vol. 4, No. 2 | November 2010 | Speech | |
| Selected Papers from the Third IEEE International Conference on Semantic Computing (ICSC2009) | G. Friedland and S. C. Shen, eds. | International Journal on Semantic Computing, Vol. 3, Issue 4 | December 2009 | Speech | |
| Best Papers from the 10th IEEE International Symposium on Multimedia | G. Friedland and S.-C. Shen, eds. | International Journal on Semantic Computing (IJSC), World Scientific, Vol. 3, Issue 2 | June 2009 | Speech | |
| Best Papers from the Second IEEE International Conference on Semantic Computing (IJSC) | G. Friedland and C. Martell, eds. | International Journal on Semantic Computing (IJSC), Vol. 2, Issue 3 | September 2008 | Speech | |
| Object Cut and Paste in Images and Videos | G. Friedland, K. Jantz, T. Lenz, F. Wiesel, and R. Rojas | International Journal of Semantic Computing, World Scientific, Vol. 1, Issue 2, pp. 221-247, USA | July 2007 | Speech | |
| Semantic Computing and Privacy: A Case Study Using Inferred Geo-Tagging | G. Friedland and J. Choi | International Journal of Semantic Computing, Vol. 5, No. 1, pp. 79-93. Also Best Poster in the Electrical and Computer Science and Engineering Track at the Korean Student Technical and Leadership Conference, Chicago, Illinois, March 2012. DOI: 10.1142/S1793351X11001171 | March 2011 | Speech | [PDF]
|
| Features Based on Auditory Physiology and Perception | R. M. Stern and N. Morgan | In Techniques for Noise Robustness in Automatic Speech Recognition, T. Virtanen, B. Raj, and R. Singh, Wiley Publishing | 2012 | Speech | |
| Syllable Models for Mandarin Speech Recognition: Exploiting Character Language Models | X. Liu, J. L. Hieronymus, M. J. F. Gales, and P. C. Woodland | In submission | 2012 | Speech | |
| Speaker Diarization | G. Friedland | In Speech and Audio Signal Processing, 2nd edition, B. Gold, N. Morgan, D. Ellis, eds., Wiley | 2011 | Speech | |
| SmartKom English: From Robust Recognition to Felicitous Interaction | D. Gelbart, J. Bryants, A. Stolcke, R. Porzel, M. Baudis, and N. Morgan | In SmartKom--Foundations of Multimodal Dialogue Systems, W. Wahlster, ed., pp. 453-470, Springer | November 2004 | Speech | [PDF]
|
| Speaker Recognition and Diarization | G. Friedland and D. van Leeuwen | In Semantic Computing, P. Sheu, H. Yu, C. V. Ramamamoorthy, A. K. Joshi, and L. A. Zadeh, eds., pp. 115-130, IEEE Press/Wiley | 2010 | Speech | |
| The ICSI-SRI Spring 2006 Meeting Recognition System | A. Janin, A. Stolcke, X. Anguera, K. Boakye, O. Cetin, J. Frankel, and J. Zheng | In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006), Lecture Notes in Computer Science. Springer | 2006 | Speech | [PDF]
|
| The Grammar of Hitting and Breaking | C. J. Fillmore | In Readings in English Transformational Grammar, R. Jacobs and P. Rosenbaum, eds., pp. 120-133, Georgetown University Press. | June 1970 | Speech | [PDF]
|
| Speaker Diarization | G. Friedland and F. Valente | In Multimodal Signal Processing: Human Interactions in Meetings, S. Reynals, H. Bourlard, J. Carletta, and A. Popescu-Belis, eds., Cambridge University Press | June 2012 | Speech | |
| Computationally Efficient Clustering of Audio-Visual Meeting Data | H. Hung, G. Friedland, and C. Yeo | In Multimedia Interaction and Intelligent User Interfaces: Principles, Methods, and Applications, M. Etho, J. Luo, and L. Shao, eds., pp. 25-59 | 2010 | Speech | |
| Term-Weighting for Summarization of Multi-Party Spoken Dialogues | G. Murray and S. Renals | In Machine Learning for Multimodal Interaction IV (Lecture Notes in Computer Science, Vol. 4892), pp. 155-166, Springer | 2007 | Speech | |
| Interpretation of Spatial Language in a Map Navigation Task | M. Levit and D. Roy | IEEE Transactions on Systems, Man and Cybernetics, Part B, vol. 37, no. 3, IEEE Systems, man, and Cybernetics Society, pp.667-679 | June 2007 | Speech | |
| Generative and Discriminative Methods Using Morphological Information for Sentence Segmentation of Turkish | U. Guz, B. Favre, D. Hakkani-Tur, and G. Tur | IEEE Transactions on Speech, Audio and Language Processing, Special Issue on Processing Morphologically Rich Languages, Vol. 17, No. 5, pp. 895-903 | July 2009 | Speech | [PDF]
|
| The Challenge of Spoken Language Systems: Research Directions for the Nineties | R. Cole, L. Hirschman, L. Atlas, M. Beckman, A. Biermann, M. Bush, M. Clements, J. Cohen, O. Garcia, B. Hanson, H. Hermansky, S. Levinson, K. McKeown, N. Morgan, D. Novick, M. Ostendorf, S. Oviatt, P. Price, H. Silverman, J. Spitz, A. Waibel, C. Weinstein, S. Zahorian, and V. Zue | IEEE Transactions on Speech and Audio Processing, Vol. 3, No. 1, pp. 1-21 | January 1995 | Speech | |
| Automatic Speech Recognition with an Adaptation Model Motivated by Auditory Processing | M. Holmberg, D. Gelbart, and W. Hemmert | IEEE Transactions on Speech and Audio Processing, Vol. 14, Issue 1, pp. 44-49 | January 2006 | Speech | [PDF]
|
| Adaptive Language Modeling with Varied Sources to Cover New Vocabulary Items | S. Schwarm, I. Bulyko, and M. Ostendorf | IEEE Transactions on Speech and Audio Processing, Vol. 12, No. 3, pp. 334-342 | May 2004 | Speech | [PDF]
|