| Compensation for the effect of the communication channel in Perceptual Linear Predictive (PLP) analysis of speech | H. Hermansky, A. Bayya, N. Morgan, P. Kohn | Proceedings of the Second European Conference on Speech Communication and Technology (Eurospeech '91), Genova, Italy, pp. 1367-1370 | 1991 | Speech | |
| Visualizing Large-Screen Electronic Chalkboard Content on Handheld Devices | A. Lüning, G. Friedland, L. Knipping, and R. Rojas | Proceedings of the Second IEEE International Workshop on Multimedia Technologies for E-Learning at 9th IEEE Symposium on Multimedia, Taichung, Taiwan, pp. 369-375 | December 2007 | Speech | |
| The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System | A. Stolcke, X. Anguera, K. Boakye, O. Cetin, A. Janin, M. Magimai-Doss, C. Wooters, and J. Zheng | Proceedings of the Second International Workshop on Classification of Events, Activities, and Relationships (CLEAR 2007) and the Fifth Rich Transcription 2007 Meeting Recognition (RT 2007), Baltimore, Maryland, pp. 450-463 | May 2007 | Speech | [PDF]
|
| The ICSI RT07s Speaker Diarization System | C. Wooters and M. Huijbregts | Proceedings of the Second International Workshop on Classification of Events, Activities, and Relationships (CLEAR 2007) and the Fifth Rich Transcription 2007 Meeting Recognition (RT 2007), Baltimore, Maryland, pp. 509-519 | May 2007 | Speech | [PDF]
|
| Toward Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 187-193 | July 2005 | Speech | [PDF]
|
| Accent Classification for Speech Recognition | A. Faria | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 285-293 | July 2005 | Speech | [PDF]
|
| Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System | X. Anguera, C. Wooters, B. Peskin, and M. Aguilo | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 402-414 | July 2005 | Speech | [PDF]
|
| Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System | A. Stolcke, X. Anguera, K. Boakye, O. Cetin, F. Grezl, A. Janin, A. Mandal, B. Peskin, C. Wooters, and J. Zheng | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 463-475 | July 2005 | Speech | [PDF]
|
| Opportunities and Challenges of Parallelizing Speech Recognition | J. Chong, G. Friedland, A. Janin, and N. Morgan | Proceedings of the Second USENIX Workshop on Hot Topics in Parallelism (HotPar '10), Berkeley, California | June 2010 | Speech | [PDF]
|
| Text Classification by Augmenting the Bag-of-Words Representation with Redundancy-Compensated Bigrams | C. Boulis and M. Ostendof | Proceedings of the SIAM International Conference on Data Mining at the Workshop on Feature Selection in Data Mining (SIAM-FSDM 2005), Newport Beach, California | April 2005 | Speech | [PDF]
|
| An Analysis of Sentence Segmentation Features for Broadcast News, Broadcast Conversations, and Meetings | S. Cuendet, E. Shriberg, B. Favre, J. Fung, and D. Hakkani-Tür | Proceedings of the SIGIR Workshop on Searching Conversational Spontaneous Speech, Amsterdam, Netherlands, pp. 43-59 | July 2007 | Speech | |
| Text-Constrained Speaker Recognition on a Text-Independent Task | K. Boakye and B. Peskin | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2004), Toledo, Spain | May 2004 | Speech | [PDF]
|
| Modeling NERFs for Speaker Recognition | S. Kajarekar, L. Ferrer, K. Sonmez, J. Zheng, E. Shriberg, and A. Stolcke | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2004), Toledo, Spain, pp. 51-56 | May 2004 | Speech | [PDF]
|
| Hunting for Wolves in Speaker Recognition | L. Stoll and G. Doddington | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2010), Brno, Czech Republic, pp. 159-164 | June 2010 | Speech | [PDF]
|
| Multiple-State Context-Dependent Phonetic Modeling with MLPs | M. Cohen, H. Franco, N. Morgan, D. Rumelhart, and V. Abrash | Proceedings of the Speech Research Symposium XII, Rutgers University, Camden, New Jersey | 1992 | Speech | |
| RelAtive SpecTrAl (RASTA) Processing in Speech Analysis | H. Hermansky and N. Morgan | Proceedings of the Speech Research Symposium XII, Rutgers University, Camden, New Jersey | 1992 | Speech | |
| A Can of RASTA Worms | H. Hermansky and N. Morgan | Proceedings of the Speech Research Symposium XIII, Johns Hopkins University, Balitmore, Maryland, pp. 343-350 | 1993 | Speech | |
| The Berkeley Restaurant Project | C. Wooters, D. Jurafsky, G. Tajchman, and N. Morgan | Proceedings of the Speech Research Symposium XIII, Johns Hopkins University, Baltimore, Maryland, pp. 119-128 | 1993 | Speech | |
| The Role of Disfluencies on Topic Classification of Human-Human Conversations | C. Boulis, J. G. Kahn, and M. Ostendorf | Proceedings of the Spoken Language Understanding Workshop Program at the 20th National Conference on Artificial Intelligence (AAAI-05), Pittsburgh, Pennsylvania | July 2005 | Speech | [PDF]
|
| Neural Networks for Statistical Inference: Generalizations with Applications to Speech Recognition | H. Bourlard and N. Morgan | Proceedings of the the International Joint Conference on Neural Networks (IJCNN '91), Singapore | 1991 | Speech | |
| A Neural Network Based, Speaker Independent, Large Vocabulary, Continuous Speech Recognition System: the Wernicke Project | A. Robinson, L. Almeida, J. Boite, H. Bourlard, F. Fallside, H. Hochberg, D. Kershaw, P. Kohn, Y. Konig, N. Morgan, J. Neto, S. Renals, M. Saerens, and C. Wooters | Proceedings of the Third European Conference on Speech Communication and Technology (Eurospeech '93), Berlin, Germany, pp. 1941-1944 | 1993 | Speech | |
| Towards Structured Approaches to Arbitrary Data Selection and Performance Prediction for Speaker Recognition | H. Lei | Proceedings of the Third IAPR/IEEE International Conference on Biometrics (ICB 2009), Alghero, Italy | June 2009 | Speech | [PDF]
|
| On the Use of Artificial Conversation Data for Speaker Recognition in Cars | L. Gottlieb and G. Friedland | Proceedings of the Third IEEE International Conference on Semantic Computing (ICSC-2009), Berkeley, California, pp. 124-128 | September 2009 | Speech | [PDF]
|
| Modeling Dynamics in Connectionist Speech Recognition - the Time Index Model | Y. Konig and N. Morgan | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 1523-1526 | September 1994 | Speech | [PDF]
|
| Stochastic Perceptual Auditory-Event-Based Models for Speech Recognition | N. Morgan, H. Bourlard, S. Greenberg, and H. Hermansky | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 1943-1946 | 1994 | Speech | [PDF]
|
| Multiple-Pronunciation Lexical Modeling in a Speaker Independent Speech Understanding System | C. Wooters and A. Stolcke | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 1963-1966 | September 1994 | Speech | [PDF]
|
| The Berkeley Restaurant Project | D. Jurafsky, C. Wooters, G. Tajchman, J. Segal, A. Stolcke, E. Fosler, and N. Morgan | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 2139-2142 | September 1994 | Speech | [PDF]
|
| Narrative-Theme Navigation for Sitcoms Supported by Fan-Generated Scripts | G. Friedland, L. Gottlieb, and A. Janin | Proceedings of the Third International Workshop on Automated Information Extraction in Media Production (AIEMPro '10) at the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 3-8 | October 2010 | Speech | [PDF]
|
| Overlap in Meetings: ASR Effects and Analysis by Dialog Factors, Speakers, and Collection Site | O. Cetin and E. Shriberg | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 212-224 | May 2006 | Speech | [PDF]
|
| Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization | X. Anguera, C. Wooters, and J. Hernando | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 248-256 | May 2006 | Speech | [PDF]
|
| Speaker Diarization for Multi-Microphone Meetings Using Only Between-Channel Differences | J.M. Pardo, X Anguera, and C. Wooters | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 257-264 | May 2006 | Speech | [PDF]
|
| CUDA-Level Performance with Python-Level Productivity for Gaussian Mixture Model Applications | H. Cook, E. Gonina, S. Kamil, G. Friedland, D. Patterson, and A. Fox | Proceedings of the Third USENIX Workshop on Hot Topics in Parallelism (HotPar ’11), Berkeley, California | May 2011 | Speech | [PDF]
|
| Spectro-temporal Gabor Features as a Front End for Automatic Speech Recognition | M. Kleinschmidt | Proceedings of the Triennial Forum Acusticum 2002, Seville, Spain | September 2002 | Speech | [PDF]
|
| Towards Subband-Based Speech Recognition | H. Bourlard, S. Dupont, H. Hermansky, and N. Morgan | Proceedings of the VIII European Signal Processing Conference (EUSIPCO '96), Trieste, Italy, pp. 1579-1582 | 1996 | Speech | |
| The ICSI Meeting Corpus: Close-Talking and Far-Field, Multi-Channel Transcriptions for Speech and Language Researchers | J. A. Edwards | Proceedings of the Workshop on Compiling and Processing Spoken Language Corpora at the Fourth International Conference on Language Resources and Evaluation (LREC 2004), pp. 8-11 | May 2004 | Speech | [PDF]
|
| Combining Bottom-Up and Top-Down Constraints for Robust ASR: The Multiscore Decoder | J. Barker, M. Cooke, and D. Ellis | Proceedings of the Workshop on Consistent and Reliable Acoustic Cues (CRAC-2001), Aalborg, Denmark | September 2001 | Speech | |
| SpeechCorder, The Portable Meeting Recorder | A. Janin and N. Morgan | Proceedings of the Workshop on Hands-Free Speech Communication, Kyoto, Japan | April 2001 | Speech | [PDF]
|
| A Scalable Global Model for Summarization | D. Gillick and B. Favre | Proceedings of the Workshop on Integer Linear Programming for Natural Language Processing at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009), Boulder, Colorado, pp. 10-18 | June 2009 | Speech | [PDF]
|
| A Parallel Meeting Diarist | G. Friedland, J. Chong, and A. Janin | Proceedings of the Workshop on Searching Spontaneous Conversational Speech (SSCS) at the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 57-60 | October 2010 | Speech | [PDF]
|
| LDA Based Similarity Modeling for Question Answering | A. Celikyilmaz, D. Hakkani-Tur, and G. Tur | Proceedings of the Workshop on Semantic Search at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2010), Los Angeles, California, pp. 1-9 | June 2010 | Speech | [PDF]
|
| A Graph-Based Semi-Supervised Learning for Question Semantic Labeling | A. Celikyilmaz and D. Hakkani-Tur | Proceedings of the Workshop on Semantic Search at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2010), Los Angeles, California, pp. 27-35 | June 2010 | Speech | [PDF]
|
| RASTA Extensions: Robustness to Additive and Convolutional Noise | N. Morgan and H. Hermansky | Proceedings of the Workshop on Speech Processing in Adverse Conditions, pp. 115-118 | 1992 | Speech | |
| The ICSI/SRI/UW RT04 Structural Metadata Extraction System | Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, and M. Harper | RT-04 EARS Workshop | January 2004 | Speech | |
| Hearing is Believing: Biologically-Inspired Feature Extraction for Robust Automatic Speech Recognition | R. M. Stern and N. Morgan | Signal Processing Magazine, Vol. 29, No. 6, pp. 34-43 | November 2012 | Speech | [PDF]
|
| Multimodal Model Integration for Sentence Unit Detection | L. Chen, Y. Liu, M. Harper, and E. Shriberg | Sixth International Conference on Multimodal Interfaces, October 2004 | 2004 | Speech | |
| Higher Level Features in Speaker Recognition | E. Shriberg | Speaker Classification I (Lecture Notes in Computer Science, Vol. 4343), pp. 241-259, Springer: Heidelberg / Berlin | 2007 | Speech | |
| ICSI System Description for SRE2008 Submission | H. Lei and D.V. Leeuwen | Speaker Recognition Evaluation 2008, National Institute of Standards and Technology | 2008 | Speech | [PDF]
|
| Automated Information Extraction in Production | R. Desutter, J.P. Evain, G. Friedland, A. Messina, and M. Sano | Special issue in Multimedia Tools and Applications, Springer | 2011 | Speech | |
| Multimedia Technologies for E-Learning 2007 | G. Friedland, L. Knipping, and N. Ludwig (eds.) | Special Issue of Interactive Technology Smart Education (ITSE), Vol. 4, Issue 4 | November 2007 | Speech | |
| Multimedia Technologies for E-learning | G. Friedland and L. Knipping (editors) | Special issue of International Journal of Interactive Technology Smart Education (ITSE), Vol 4, No 1, Troubador Publishing Ltd., United Kingdom | March 2007 | Speech | |