| Discriminant Training of Front-End and Acoustic Modeling Stages to Heterogeneous Acoustic Environments for Multi-stream Automatic Speech Recognition | M. Shire | Ph.D Dissertation, University of California at Berkeley, Fall 2000 | 2000 | Speech | [PDF]
|
| Discourse Segmentation of Multi-party Conversation | M. Galley, K. McKeown, E. Fosler-Lussier, and H. Jing | Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL-03), Sapporo, Japan | July 2003 | Speech | [PDF]
|
| Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing | E. Shriberg and A. Stolcke | Proceedings of the International Conference on Speech Prosody, Nara, Japan, March 2004. | March 2004 | Speech | [PDF]
|
| Digit Recognition with Stochastic Perceptual Models | N. Morgan, S.L. Wu, and H. Bourlard | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| Dialog Act Tagging Using Graphical Models | G. Ji and J. Bilmes | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, Vol. 1, pp. 33-36 | March 2005 | Speech | [PDF]
|
| Dialocalizaton: Acoustic Speaker Diarization and Visual Localization as Joint Optimization Problem | G. Friedland, C. Yeo, and H. Hung | ACM Transactions on Multimedia Computing, Communications, and Applications, Vol. 6, No. 4, Article 27 | November 2010 | Speech | [PDF]
|
| Development of the SRI/Nightingale Arabic ASR system | D. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlater, K. Kirchoff, A. Faria, and N. Morgan | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 1437-1440 | September 2008 | Speech | |
| Detection of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data | D. Hillard, M. Ostendorf, and E. Shriberg | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada | May 2003 | Speech | [PDF]
|
| Detection and Compensation of Sensor Malfunction in Time Delay Based Direction of Arrival Estimation | T. Pirinen, J. Yli-Hietanen, P. Pertilä, and A. Visa | Proceedings of IEEE ISCAS, Vancouver | May 2004 | Speech | [PDF]
|
| Detecting Music in Ambient Audio by Long-Window Autocorrelation | K. Lee and D. Ellis | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 9-12 | April 2008 | Speech | [PDF]
|
| Detecting Local Semantic Concepts in Environmental Sounds Using Markov Model Based Clustering | K. Lee, D. Ellis, and A. Loui | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, March 2010 | March 2010 | Speech | [PDF]
|
| Detecting Deception Using Critical Segments | F. Enos, E. Shriberg, M. Graciarena, J. Hirschberg, and A. Stolcke | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2281-2284 | August 2007 | Speech | [PDF]
|
| Detecting Categories in News Video Using Acoustic, Speech, and Image Features | S. Petrov, A. Faria, P. Michaillat, A. Berg, A. Stolcke, D. Klein, and J. Malik | Presented at the NIST TREC Video Retrieval Workshop, Gaithersburg, Maryland | November 2006 | Speech | [PDF]
|
| Desperately Seeking Impostors: Data-Mining for Competitive Impostor Testing in a Text-Dependent Speaker Verification System | M. Hebert and N. Mirghafori | Proceedings of IEEE ICASSP, Montreal | May 2004 | Speech | [PDF]
|
| Deep and Wide: Multiple Layers in Automatic Speech Recognition | N. Morgan | IEEE Transactions on Audio, Speech, and Language Processing, Special Issue on Deep Learning | 2011 | Speech | [PDF]
|
| Deep and Wide: Multiple Layers in Automatic Speech Recognition | N. Morgan | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 1, pp. 7-13 | January 2012 | Speech | [PDF]
|
| Decoding Speech in the Presence of Other Sound Sources | J. Barker, M. Cooke, and D. Ellis | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location Estimation | J. Choi and G. Friedland | Proceedings of the IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246 | September 2011 | Speech | [PDF]
|
| Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location Estimation | J. Choi and G. Friedland | Proceedings of the Fifth IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246 | September 2011 | Speech | [PDF]
|
| Data-Driven Speaker and Subword Unit Clustering in Speech Processing | M. Hersch | EPFL Diploma Thesis, ICSI | March 2003 | Speech | [PDF]
|
| Data-driven RASTA Filters in Reverberation | M. Shire and B. Chen | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1627-1630 | June 2000 | Speech | [PDF]
|
| Data-Driven Modulation Filter Design Under Adverse Acoustic Conditions and Using Phonetic and Syllabic Units | M.L. Shire | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. III-1123-1126 | September 1999 | Speech | [PDF]
|
| Data-Driven Extensions to HMM Statistical Dependencies | J. Bilmes | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 69-72 | November 1998 | Speech | [PDF]
|
| Data-Driven Design of RASTA-like Filters | S. van Vuuren and H. Hermansky | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece | September 1997 | Speech | |
| Data Selection with Kurtosis and Nasality features for Speaker Recognition | H. Lei and N. Mirghafori | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 2753-2756 | August 2011 | Speech | [PDF]
|
| Cybercasing the Joint: Language Technologies, Multimedia Retrieval, and Online Privacy | G. Friedland | Presented at the Language Technologies Institute Colloquium, Carnegie Mellon University, Pittsburgh, Pennsylvania | April 13 2012 | Speech | [PDF]
|
| Current Research in Acoustically Robust Speech Recognition | N. Morgan | Proceedings of American Voice Input/Output Society (AVIOS), pp. 207-214 | September 1994 | Speech | |
| CUDA-Level Performance with Python-Level Productivity for Gaussian Mixture Model Applications | H. Cook, E. Gonina, S. Kamil, G. Friedland, D. Patterson, and A. Fox | Proceedings of the Third USENIX Workshop on Hot Topics in Parallelism (HotPar ’11), Berkeley, California | May 2011 | Speech | [PDF]
|
| Cross-Lingual Sentence Extraction for Information Distillation | A. Singla and D. Hakkani-Tur | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2707-2710 | September 2008 | Speech | [PDF]
|
| Cross-Genre Feature Comparisons for Spoken Sentence Segmentation | S. Cuendet, D. Hakkani-Tur, E. Shriberg, J. Fung, and B. Favre | Proceedings of International Conference on Semantic Computing, IEEE Computer Society, pp. 265-274, Irvine, California. Also published in International Journal of Semantic Computing, Volume 1, Issue 3, World Scientific, USA, pp. 335-346 | September 2007 | Speech | [PDF]
|
| Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons | A. Stolcke, F. Grezl, M.-Y. Hwang, X. Lei, N. Morgan, and D. Vergyri | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 321-324 | May 2006 | Speech | [PDF]
|
| Cover Song Detection: From High Scores to General Classification | S. Ravuri and D. Ellis | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 65-68 | March 2010 | Speech | [PDF]
|
| Corrected Tandem Features for Acoustic Model Training | A. Faria and N. Morgan | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4737-4740 | April 2008 | Speech | [PDF]
|
| Corpus Variation and Parser Performance | D. Gildea | Proceedings of the 2001 Conference on Empirical Methods in Natural Language Processing (EMNLP 2001), Pittsburgh, Pennsylvania | June 2001 | Speech | [PDF]
|
| Continuous Speech Recognition Using PLP Analysis with Multilayer Perceptrons | N. Morgan, H. Hermansky, H. Bourlard, P. Kohn, and C. Wooters | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, Toronto, Canada, pp. 49-52 | 1991 | Speech | |
| Continuous Speech Recognition Using Multilayer Perceptrons with Hidden Markov Models | H. Bourlard and N. Morgan | Proceedings of the IEEE International Conference of Acoustics, Speech & Signal Processing (ICASSP 1990), Albuquerque, New Mexico | 1990 | Speech | |
| Continuous Speech Recognition by Connectionist Statistical Methods | H. Bourlard and N. Morgan | IEEE Transactions on Neural Networks, Vol. 4, No. 6, pp. 893-909 | November 1993 | Speech | |
| Contextual Word and Syllable Pronunciation Models | E. Fosler-Lussier | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-99), Keystone, Colorado | December 1999 | Speech | [PDF]
|
| Context-Dependent Multiple Distribution Phonetic Modeling | M. Cohen, H. Franco, N. Morgan, D. Rumelhart, and V. Abrash | Advances in Neural Information Processing Systems, Vol. V, pp. 649-657 | 1993 | Speech | |
| Context-Dependent Connectionist Probability Estimation in a Hybrid HMM-Neural Net Speech Recognition System | H. Franco, M. Cohen, N. Morgan, D. Rumelhart, and V. Abrash | Proceedings of the International Joint Conference on Neural Networks, (IJCNN '92), Beijing, China | 1992 | Speech | |
| Constrained Cepstral Speaker Recognition Using Matched UBM and JFA Training | M. H. Sanchez, L. Ferrer, E. Shriberg, and A. Stolcke | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 141-144 | August 2011 | Speech | [PDF]
|
| Consonant Discrimination in Elicited and Spontaneous Speech: A Case for Signal-Adaptive Front Ends in ASR | K. Sönmez, M. Plauché, E. Shriberg, and H. Franco | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Consensus Training for Consensus Decoding in Machine Translation | A. Pauls, J. DeNero, and D. Klein | Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, pp. 1418-1427 | August 2009 | Speech | [PDF]
|
| Connectionist-Based Acoustic Word Models | C. Wooters and N. Morgan | Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, Copenhagen, Denmark, pp. 157-163 | 1992 | Speech | |
| Connectionist Techniques for Speech Recognition | H. Bourlard and N. Morgan | Article in the Survey on the State of the Art in Human Language Technology, ed. R. Cole, Cambridge University Press, pp. 356-361 | 1998 | Speech | |
| Connectionist Speech Recognition: A Hybrid Approach | H. Bourlard and N. Morgan | The Kluwer International Series in Engineering and Computer Science; v. 247, Boston: Kluwer Academic Publishers | 1993 | Speech | |
| Connectionist Probability Estimators in HMM Speech Recognition | S. Renals, N. Morgan, H. Bourlard, M. Cohen, and H. Franco | IEEE Transactions on Speech and Audio Processing, pp. II-161-174, | January 1993 | Speech | |
| Connectionist Probability Estimation in the Decipher Speech Recognition System | S. Renals, N. Morgan, M. Cohen H. Bourlard, and H. Franco | Proceedings of the IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP 1992), pp. I-601-604 | 1992 | Speech | [PDF]
|
| Connectionist Optimisation of Tied Mixture Hidden Markov Models | S. Renals, N. Morgan, H. Bourlard, M. Cohen, and H. Franco | Advances in Neural Information Processing Systems, Vol. IV, pp. 167-174 | 1991 | Speech | |
| Connectionist Gender Adaptation in a Hybrid Neural Network / Hidden Markov Model Speech Recognition System | V. Abrash, M. Cohen, H. Franco, N. Morgan, and Y. Konig | Proceedings of the International Conference on Spoken Language Processing (ICSLP'92), pp. 911-914 | 1992 | Speech | |