| Tamil Market: A spoken dialog system for rural India | M. Plauché and M. Prabaker | Working Papers in Computer-Human Interfaces | April 2006 | Speech | [PDF]
|
| Speech and Audio Signal Processing | B. Gold and N. Morgan | Wiley Press, New York | 1999 | Speech | |
| Speech and Audio Signal Processing: Processing and Perception of Speech and Music, 2nd Edition | B. Gold, N. Morgan, and D. Ellis | Wiley | November 2011 | Speech | |
| Finding Difficult Speakers in Automatic Speaker Recognition | L. Stoll | UC Berkeley PhD thesis, Berkeley, California | December 2011 | Speech | [PDF]
|
| Phonetic- and Speaker-Discriminant Features for Speaker Recognition | L. Stoll | UC Berkeley Masters Thesis | December 2006 | Speech | [PDF]
|
| On the Use of Spectro-Temporal Features in Noise-Additive Speech | S. Ravuri | UC Berkeley Master's thesis, Spring 2011 | 2011 | Speech | [PDF]
|
| Ensemble Feature Selection for Multi-stream Automatic Speech Recognition | D. Gelbart | UC Berkeley dissertation | December 2008 | Speech | [PDF]
|
| Kernel Optimization for Support Vector Machines: Application to Speaker Verification | A. Hatch | UC Berkeley dissertation | December 2006 | Speech | [PDF]
|
| Audio Segmentation for Meetings Speech Processing | K. A. Boakye | UC Berkeley dissertation | December 2008 | Speech | [PDF]
|
| Structured Approaches to Data Selection for Speaker Recognition | H. Lei | UC Berkeley dissertation | December 2010 | Speech | [PDF]
|
| Narrative Theme Navigation for Sitcoms Supported by Fan-Generated Scripts | G. Friedland, A. Janin, and L. Gottlieb | To appear in Multimedia Tools and Applications, Springer | 2012 | Speech | [PDF]
|
| Connectionist Speech Recognition: A Hybrid Approach | H. Bourlard and N. Morgan | The Kluwer International Series in Engineering and Computer Science; v. 247, Boston: Kluwer Academic Publishers | 1993 | Speech | |
| Chapter 17: The Transcription of Discourse | J. Edwards | The Handbook of Discourse Analysis, D. Shriffrin, D. Tannen and H. Hamilton, eds. Oxford: Blackwell, pp. 321-348 | 2001 | Speech | |
| Automatic Labeling of Semantic Roles | D. Gildea and D. Jurafsky | The 38th Annual Meeting of the Association for Computational Linguistics (ACL-2000), Hong Kong, pp. 512-520 | October 2000 | Speech | [PDF]
|
| The Modulation Spectrogram: In Pursuit of an Invariant Representation of Speech | S. Greenberg and B. Kingsbury | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 3, pp. 1647-1650 | April 1997 | Speech | [PDF]
|
| Integrating Syllable Boundary Information Into Speech Recognition | S.L. Wu, M. Shire, S. Greenberg, and N. Morgan | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 987-990 | April 1997 | Speech | [PDF]
|
| The Weft: A Representation for Periodic Sounds | D. Ellis | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 1307-1310 | April 1997 | Speech | [PDF]
|
| Recognizing Reverberant Speech With RASTA-PLP | B. Kingsbury and N. Morgan | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 1259-1262 | April 1997 | Speech | [PDF]
|
| Switchboard-DAMSL Labeling Project Coder's Manual | D. Jurafsky, E. Shriberg, and D. Biasca | Technical Report 97-02, University of Colorado, Institute of Cognitive Science, Boulder, Colorado | 1997 | Speech | [PDF]
|
| What's New in Government-Sponsored Speech Recognition Research | N. Morgan | Speech Technology Magazine, Vol. 7, No. 5 | September 2002 | Speech | |
| Prosody-Based Automatic Segmentation of Speech into Sentences and Topics | E. Shriberg, A. Stolcke, D. Hakkani-Tür, and G. Tür | Speech Communications, T. Robinson and S. Rendals, eds., Vol. 32, Issue 1-2, pp. 127-154 | September 2000 | Speech | |
| Relevance of Time-Frequency Features for Phonetic and SpeakerChannel Classification | H.H. Yan, S. Sharma, S. van Vuuren, and H. Hermansky | Speech Communication,Vol. 1, No. 31, pp. 35-50 | May 2000 | Speech | [PDF]
|
| Speaker Adaptation of Language and Prosodic Models for Automatic Dialog Act Segmentation of Speech | J. Kolar, Y. Liu, and E. Shriberg | Speech Communication, Vol. 52, Issue 3, pp. 236-245 | March 2010 | Speech | |
| Long Story Short - Global Unsupervised Models for Keyphrase Based Meeting Summarization | K. Riedhammer, B. Favre, and D. Hakkani-Tur | Speech Communication, Vol. 52, Issue 10, pp. 801-815. DOI:10.1016/j.specom.2010.06.002 | October 2010 | Speech | |
| Speech Encoding in a Model of Peripheral Auditory Processing: Quantitative Assessment by Means of Automatic Speech Recognition | M. Holmberg, D. Gelbart, and W. Hemmert | Speech Communication, Vol. 49, Issue 12, pp. 917-932 | December 2007 | Speech | |
| Modeling Prosodic Feature Sequences for Speaker Recognition | E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke | Speech Communication, Vol. 46, Issues 3-4, pp. 455-472 | July 2005 | Speech | |
| Using Knowledge to Organize Sound: The Prediction-driven Approach to Computational Auditory Scene Analysis and Its Application to Speech/Nonspeech Mixtures | D. Ellis | Speech Communication, Vol. 27, Issue 3-4, pp. 281-298 | 1999 | Speech | |
| Robust Speech Recognition Using the Modulation Spectrogram | B. Kingsbury, N. Morgan, and S. Greenberg | Speech Communication, Vol. 25, pp. 117-132 | 1998 | Speech | |
| Neural nets and hidden Markov models: Review and Generalizations | H. Bourlard, N. Morgan, and S. Renals | Speech Communication, Vol. 11, No.2-3, pp. 237-246 | 1992 | Speech | |
| Towards Increasing Speech Recognition Error Rates | H. Bourlard, H., Hermansky, and N. Morgan | Speech Communication, pp. 205-231 | May 1996 | Speech | |
| Effects of Speaking Rate and Word Frequency on Conversational Pronunciations | E. Fosler-Lussier and N. Morgan | Speech Communication Vol. 29, No. 2-4, pp. 137-158 | November 1999 | Speech | [PDF]
|
| Educational Multimedia | G. Friedland, L. Knipping, and W. Huerst (guest editors) | Special Section in IEEE Multimedia Magazine, pp. 54-74, July-Sept. 2008 | July 2008 | Speech | [PDF]
|
| Multimedia Technologies for E-learning | G. Friedland and L. Knipping (editors) | Special issue of International Journal of Interactive Technology Smart Education (ITSE), Vol 4, No 1, Troubador Publishing Ltd., United Kingdom | March 2007 | Speech | |
| Multimedia Technologies for E-Learning 2007 | G. Friedland, L. Knipping, and N. Ludwig (eds.) | Special Issue of Interactive Technology Smart Education (ITSE), Vol. 4, Issue 4 | November 2007 | Speech | |
| Automated Information Extraction in Production | R. Desutter, J.P. Evain, G. Friedland, A. Messina, and M. Sano | Special issue in Multimedia Tools and Applications, Springer | 2011 | Speech | |
| ICSI System Description for SRE2008 Submission | H. Lei and D.V. Leeuwen | Speaker Recognition Evaluation 2008, National Institute of Standards and Technology | 2008 | Speech | [PDF]
|
| Higher Level Features in Speaker Recognition | E. Shriberg | Speaker Classification I (Lecture Notes in Computer Science, Vol. 4343), pp. 241-259, Springer: Heidelberg / Berlin | 2007 | Speech | |
| Multimodal Model Integration for Sentence Unit Detection | L. Chen, Y. Liu, M. Harper, and E. Shriberg | Sixth International Conference on Multimodal Interfaces, October 2004 | 2004 | Speech | |
| Hearing is Believing: Biologically-Inspired Feature Extraction for Robust Automatic Speech Recognition | R. M. Stern and N. Morgan | Signal Processing Magazine, Vol. 29, No. 6, pp. 34-43 | November 2012 | Speech | [PDF]
|
| The ICSI/SRI/UW RT04 Structural Metadata Extraction System | Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, and M. Harper | RT-04 EARS Workshop | January 2004 | Speech | |
| RASTA Extensions: Robustness to Additive and Convolutional Noise | N. Morgan and H. Hermansky | Proceedings of the Workshop on Speech Processing in Adverse Conditions, pp. 115-118 | 1992 | Speech | |
| A Graph-Based Semi-Supervised Learning for Question Semantic Labeling | A. Celikyilmaz and D. Hakkani-Tur | Proceedings of the Workshop on Semantic Search at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2010), Los Angeles, California, pp. 27-35 | June 2010 | Speech | [PDF]
|
| LDA Based Similarity Modeling for Question Answering | A. Celikyilmaz, D. Hakkani-Tur, and G. Tur | Proceedings of the Workshop on Semantic Search at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2010), Los Angeles, California, pp. 1-9 | June 2010 | Speech | [PDF]
|
| A Parallel Meeting Diarist | G. Friedland, J. Chong, and A. Janin | Proceedings of the Workshop on Searching Spontaneous Conversational Speech (SSCS) at the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 57-60 | October 2010 | Speech | [PDF]
|
| A Scalable Global Model for Summarization | D. Gillick and B. Favre | Proceedings of the Workshop on Integer Linear Programming for Natural Language Processing at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009), Boulder, Colorado, pp. 10-18 | June 2009 | Speech | [PDF]
|
| SpeechCorder, The Portable Meeting Recorder | A. Janin and N. Morgan | Proceedings of the Workshop on Hands-Free Speech Communication, Kyoto, Japan | April 2001 | Speech | [PDF]
|
| Combining Bottom-Up and Top-Down Constraints for Robust ASR: The Multiscore Decoder | J. Barker, M. Cooke, and D. Ellis | Proceedings of the Workshop on Consistent and Reliable Acoustic Cues (CRAC-2001), Aalborg, Denmark | September 2001 | Speech | |
| The ICSI Meeting Corpus: Close-Talking and Far-Field, Multi-Channel Transcriptions for Speech and Language Researchers | J. A. Edwards | Proceedings of the Workshop on Compiling and Processing Spoken Language Corpora at the Fourth International Conference on Language Resources and Evaluation (LREC 2004), pp. 8-11 | May 2004 | Speech | [PDF]
|
| Towards Subband-Based Speech Recognition | H. Bourlard, S. Dupont, H. Hermansky, and N. Morgan | Proceedings of the VIII European Signal Processing Conference (EUSIPCO '96), Trieste, Italy, pp. 1579-1582 | 1996 | Speech | |
| Spectro-temporal Gabor Features as a Front End for Automatic Speech Recognition | M. Kleinschmidt | Proceedings of the Triennial Forum Acusticum 2002, Seville, Spain | September 2002 | Speech | [PDF]
|