| CUDA-Level Performance with Python-Level Productivity for Gaussian Mixture Model Applications | H. Cook, E. Gonina, S. Kamil, G. Friedland, D. Patterson, and A. Fox | Proceedings of the Third USENIX Workshop on Hot Topics in Parallelism (HotPar ’11), Berkeley, California | May 2011 | Speech | [PDF]
|
| Speaker Diarization for Multi-Microphone Meetings Using Only Between-Channel Differences | J.M. Pardo, X Anguera, and C. Wooters | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 257-264 | May 2006 | Speech | [PDF]
|
| Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization | X. Anguera, C. Wooters, and J. Hernando | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 248-256 | May 2006 | Speech | [PDF]
|
| Overlap in Meetings: ASR Effects and Analysis by Dialog Factors, Speakers, and Collection Site | O. Cetin and E. Shriberg | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 212-224 | May 2006 | Speech | [PDF]
|
| Narrative-Theme Navigation for Sitcoms Supported by Fan-Generated Scripts | G. Friedland, L. Gottlieb, and A. Janin | Proceedings of the Third International Workshop on Automated Information Extraction in Media Production (AIEMPro '10) at the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 3-8 | October 2010 | Speech | [PDF]
|
| The Berkeley Restaurant Project | D. Jurafsky, C. Wooters, G. Tajchman, J. Segal, A. Stolcke, E. Fosler, and N. Morgan | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 2139-2142 | September 1994 | Speech | [PDF]
|
| Multiple-Pronunciation Lexical Modeling in a Speaker Independent Speech Understanding System | C. Wooters and A. Stolcke | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 1963-1966 | September 1994 | Speech | [PDF]
|
| Stochastic Perceptual Auditory-Event-Based Models for Speech Recognition | N. Morgan, H. Bourlard, S. Greenberg, and H. Hermansky | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 1943-1946 | 1994 | Speech | [PDF]
|
| Modeling Dynamics in Connectionist Speech Recognition - the Time Index Model | Y. Konig and N. Morgan | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 1523-1526 | September 1994 | Speech | [PDF]
|
| On the Use of Artificial Conversation Data for Speaker Recognition in Cars | L. Gottlieb and G. Friedland | Proceedings of the Third IEEE International Conference on Semantic Computing (ICSC-2009), Berkeley, California, pp. 124-128 | September 2009 | Speech | [PDF]
|
| Towards Structured Approaches to Arbitrary Data Selection and Performance Prediction for Speaker Recognition | H. Lei | Proceedings of the Third IAPR/IEEE International Conference on Biometrics (ICB 2009), Alghero, Italy | June 2009 | Speech | [PDF]
|
| A Neural Network Based, Speaker Independent, Large Vocabulary, Continuous Speech Recognition System: the Wernicke Project | A. Robinson, L. Almeida, J. Boite, H. Bourlard, F. Fallside, H. Hochberg, D. Kershaw, P. Kohn, Y. Konig, N. Morgan, J. Neto, S. Renals, M. Saerens, and C. Wooters | Proceedings of the Third European Conference on Speech Communication and Technology (Eurospeech '93), Berlin, Germany, pp. 1941-1944 | 1993 | Speech | |
| Neural Networks for Statistical Inference: Generalizations with Applications to Speech Recognition | H. Bourlard and N. Morgan | Proceedings of the the International Joint Conference on Neural Networks (IJCNN '91), Singapore | 1991 | Speech | |
| The Role of Disfluencies on Topic Classification of Human-Human Conversations | C. Boulis, J. G. Kahn, and M. Ostendorf | Proceedings of the Spoken Language Understanding Workshop Program at the 20th National Conference on Artificial Intelligence (AAAI-05), Pittsburgh, Pennsylvania | July 2005 | Speech | [PDF]
|
| The Berkeley Restaurant Project | C. Wooters, D. Jurafsky, G. Tajchman, and N. Morgan | Proceedings of the Speech Research Symposium XIII, Johns Hopkins University, Baltimore, Maryland, pp. 119-128 | 1993 | Speech | |
| A Can of RASTA Worms | H. Hermansky and N. Morgan | Proceedings of the Speech Research Symposium XIII, Johns Hopkins University, Balitmore, Maryland, pp. 343-350 | 1993 | Speech | |
| Multiple-State Context-Dependent Phonetic Modeling with MLPs | M. Cohen, H. Franco, N. Morgan, D. Rumelhart, and V. Abrash | Proceedings of the Speech Research Symposium XII, Rutgers University, Camden, New Jersey | 1992 | Speech | |
| RelAtive SpecTrAl (RASTA) Processing in Speech Analysis | H. Hermansky and N. Morgan | Proceedings of the Speech Research Symposium XII, Rutgers University, Camden, New Jersey | 1992 | Speech | |
| Hunting for Wolves in Speaker Recognition | L. Stoll and G. Doddington | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2010), Brno, Czech Republic, pp. 159-164 | June 2010 | Speech | [PDF]
|
| Modeling NERFs for Speaker Recognition | S. Kajarekar, L. Ferrer, K. Sonmez, J. Zheng, E. Shriberg, and A. Stolcke | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2004), Toledo, Spain, pp. 51-56 | May 2004 | Speech | [PDF]
|
| Text-Constrained Speaker Recognition on a Text-Independent Task | K. Boakye and B. Peskin | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2004), Toledo, Spain | May 2004 | Speech | [PDF]
|
| An Analysis of Sentence Segmentation Features for Broadcast News, Broadcast Conversations, and Meetings | S. Cuendet, E. Shriberg, B. Favre, J. Fung, and D. Hakkani-Tür | Proceedings of the SIGIR Workshop on Searching Conversational Spontaneous Speech, Amsterdam, Netherlands, pp. 43-59 | July 2007 | Speech | |
| Text Classification by Augmenting the Bag-of-Words Representation with Redundancy-Compensated Bigrams | C. Boulis and M. Ostendof | Proceedings of the SIAM International Conference on Data Mining at the Workshop on Feature Selection in Data Mining (SIAM-FSDM 2005), Newport Beach, California | April 2005 | Speech | [PDF]
|
| Opportunities and Challenges of Parallelizing Speech Recognition | J. Chong, G. Friedland, A. Janin, and N. Morgan | Proceedings of the Second USENIX Workshop on Hot Topics in Parallelism (HotPar '10), Berkeley, California | June 2010 | Speech | [PDF]
|
| Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System | A. Stolcke, X. Anguera, K. Boakye, O. Cetin, F. Grezl, A. Janin, A. Mandal, B. Peskin, C. Wooters, and J. Zheng | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 463-475 | July 2005 | Speech | [PDF]
|
| Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System | X. Anguera, C. Wooters, B. Peskin, and M. Aguilo | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 402-414 | July 2005 | Speech | [PDF]
|
| Accent Classification for Speech Recognition | A. Faria | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 285-293 | July 2005 | Speech | [PDF]
|
| Toward Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 187-193 | July 2005 | Speech | [PDF]
|
| The ICSI RT07s Speaker Diarization System | C. Wooters and M. Huijbregts | Proceedings of the Second International Workshop on Classification of Events, Activities, and Relationships (CLEAR 2007) and the Fifth Rich Transcription 2007 Meeting Recognition (RT 2007), Baltimore, Maryland, pp. 509-519 | May 2007 | Speech | [PDF]
|
| The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System | A. Stolcke, X. Anguera, K. Boakye, O. Cetin, A. Janin, M. Magimai-Doss, C. Wooters, and J. Zheng | Proceedings of the Second International Workshop on Classification of Events, Activities, and Relationships (CLEAR 2007) and the Fifth Rich Transcription 2007 Meeting Recognition (RT 2007), Baltimore, Maryland, pp. 450-463 | May 2007 | Speech | [PDF]
|
| Visualizing Large-Screen Electronic Chalkboard Content on Handheld Devices | A. Lüning, G. Friedland, L. Knipping, and R. Rojas | Proceedings of the Second IEEE International Workshop on Multimedia Technologies for E-Learning at 9th IEEE Symposium on Multimedia, Taichung, Taiwan, pp. 369-375 | December 2007 | Speech | |
| Compensation for the effect of the communication channel in Perceptual Linear Predictive (PLP) analysis of speech | H. Hermansky, A. Bayya, N. Morgan, P. Kohn | Proceedings of the Second European Conference on Speech Communication and Technology (Eurospeech '91), Genova, Italy, pp. 1367-1370 | 1991 | Speech | |
| Phonetic Context in Hybrid HMM/MLP Continuous Speech Recognition | H. Bourlard, M. Cohen, P. Kohn, N. Morgan, and C. Wooters | Proceedings of the Second European Conference on Speech Communication and Technology (Eurospeech '91), Genova, Italy, pp. 109-112 | 1991 | Speech | |
| Personalized, Interactive Tag Recommendation for Flickr | N. Garg and I. Weber | Proceedings of the Second ACM International Conference on Recommender Systems (RecSys 2008), Lausanne, Switzerland, pp. 67-74 | October 2008 | Speech | [PDF]
|
| Improving Language Recognition with Multilingual Phone Recognition and Speaker Adaptation Transforms | A. Stolcke, M. Akbacak, L. Ferrer, S. Kajarekar, C. Richey, N. Scheffer, and E. Shriberg | Proceedings of the Odyssey Speaker and Language Recognition Workshop, Brno, Czech Republic, pp. 256-262 | June 2010 | Speech | [PDF]
|
| An Introduction to the Diagnostic Evaluation of the Switchboard-Corpus Automatic Speech Recognition Systems | S. Greenberg, S. Chang, and J. Hollenback | Proceedings of the National Institute of Standards and Technology Speech Transcription Workshop, College Park, Maryland | May 2000 | Speech | [PDF]
|
| Prosodic Stress Revisited: Reassessing the Fole of Fundamental Frequency | R. Silipo and S. Greenberg | Proceedings of the National Institute of Standards and Technology Speech Transcription Workshop, College Park, Maryland | May 2000 | Speech | [PDF]
|
| The 2011 ICSI Video Location Estimation System | J. Choi, H. Lei, and G. Friedland | Proceedings of the MediaEval 2011 Workshop, Pisa, Italy | September 2011 | Speech | [PDF]
|
| The 2010 ICSI Video Location Estimation System | J. Choi, A. Janin, and G. Friedland | Proceedings of the MediaEval 2010 Workshop, Pisa Italy | October 2010 | Speech | [PDF]
|
| ICSI-CRF: The Generation of References to the Main Subject and Named Entities Using Conditional Random Fields | B. Favre and B. Bohnet | Proceedings of the Language Generation and Summarisation (UCNLG+Sum) Workshop at the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore, pp. 99-100 | August 2009 | Speech | [PDF]
|
| Speech Intelligibility is Highly Tolerant of Cross-Channel Spectral Asynchrony | S. Greenberg and T. Arai | Proceedings of the Joint Meeting of the 137th Acoustical Society of America and the 16th International Congress on Acoustics (ICA/ASA), Seattle, Washington, pp. 2677-2678 | June 1998 | Speech | [PDF]
|
| Japanese Speech Understanding Using Grammar Specialization | M. Rayner, N. Chatzichrisafis, P. Bouillon, Y. Nakao, H. Isahara, K. Kanzaki, B. A. Hockey, M. Santaholma, and M. Starlander | Proceedings of the Joint Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP 2005), Vancouver, Canada, pp. 26-27 | October 2005 | Speech | |
| Fast Consensus Decoding over Translation Forests | J. DeNero, D. Chiang, and K. Knight | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore | August 2009 | Speech | [PDF]
|
| Asynchronous Binarization for Synchronous Grammars | J. DeNero, A. Pauls, and D. Klein | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore | August 2009 | Speech | [PDF]
|
| Better Word Alignments with Supervised ITG Models | A. Haghighi, J. Blitzer, J. DeNero, and D. Klein | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore | August 2009 | Speech | [PDF]
|
| Who, What, When, Where, Why? Comparing Multiple Approaches to the Cross-Lingual 5W Task | K. Parton, K. R. McKeown, R. Coyne, M. T. Diab, R. Grishman, D. Hakkani-Tür, M. Harper, H. Ji, W. Y. Ma, A. Meyers, S. Stolbach, A. Sun, G. Tur, W. Xu, and S. Yaman | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Lanaguage Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore, pp. 423-431 | August 2009 | Speech | [PDF]
|
| The 2004 ICSI-SRI-UW Meeting Recognition System | C. Wooters, N. Mirghafori, A. Stolcke, T. Pirinen, I Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | Proceedings of the Joint AMI/PASCAL/IM2/IM4 Workshop on Multimodal and Related Machine Learning Algorithms (MLMI '04), Martigny, Switzerland, pp. 196-208 | June 2004 | Speech | [PDF]
|
| Linguistic Dissection of Switchboard-Corpus Automatic Speech Recognition Systems | S. Greenberg and S. Chang | Proceedings of the ISCA Workshop on Automatic Speech Recognition: Challenges for the New Millennium, Paris, France | 2000 | Speech | [PDF]
|
| Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech | E. Shriberg, A. Stolcke, and D. Baron | Proceedings of the ISCA Tutorial and Research Workshop on Prosody in Speech Recognition and Understanding, Red Bank, New Jersey | October 2001 | Speech | [PDF]
|
| Speaker Recognition Via Nonlinear Discriminant Features | L. Stoll, J. Frankel, and N. Mirghafori | Proceedings of the International Speech Communication Association Tutorial and Research Workshop on Non-Linear Speech Processing (NOLISP 2007), Paris, France, pp. 27-30 | May 2007 | Speech | [PDF]
|