| Connectionist Approaches to the Use of Markov Models for Continuous Speech Recognition | H. Bourlard and N. Morgan | Advances in Neural Information Processing Systems, Vol. III, pp. 213-219 | 1991 | Speech | |
| Computers and Commerce: A Study of Technology and Management at Eckert-Mauchly Computer Company, Engineering Research Associates, and Remington Rand, 1946-1957 (book review) | G. Friedland | IEEE Annals of the History of Computing, Vol. 29, No. 2, IEEE Computer Society, California, pp. 74-77 | June 2007 | Speech | |
| Computationally Efficient Clustering of Audio-Visual Meeting Data | H. Hung, G. Friedland, and C. Yeo | In Multimedia Interaction and Intelligent User Interfaces: Principles, Methods, and Applications, M. Etho, J. Luo, and L. Shao, eds., pp. 25-59 | 2010 | Speech | |
| Computational Auditory Scene Analysis Exploiting Speech-Recognition Knowledge | D. Ellis | Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, p. 4 | October 1997 | Speech | [PDF]
|
| Compensation for the effect of the communication channel in Perceptual Linear Predictive (PLP) analysis of speech | H. Hermansky, A. Bayya, N. Morgan, P. Kohn | Proceedings of the Second European Conference on Speech Communication and Technology (Eurospeech '91), Genova, Italy, pp. 1367-1370 | 1991 | Speech | |
| Comparisons of Recent Speaker Recognition Approaches Based on Word Conditioning | H. Lei and N. Mirghafori | Proceedings of Odyssey 2008, Stellenbosch, South Africa | January 2008 | Speech | [PDF]
|
| Comparison of Grammar Based and Statistical Language Models Trained on the Same Data | B.A. Hockey and M. Rayner | Presented at the Workshop on Spoken Language Understanding at the 20th AIII National Conference on Artificial Intelligence, Pittsburgh, Pennsylvania | July 2005 | Speech | |
| Comparing the Contributions of Context and Prosody in Text-Independent Dialog Act Recognition | K. Laskowski and E. Shriberg | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5374-5377 | March 2010 | Speech | [PDF]
|
| Comparing Multilayer Perceptron to Deep Belief Network Tandem Features for Robust ASR | O. Vinyals and S. Ravuri | Proceedings of the 36th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '11), Prague, Czech Republic | May 2011 | Speech | [PDF]
|
| Comparing HMM, Maximum Entropy, and Conditional Random Fields for Disfluency Detection | Y. Liu, E. Shriberg, A. Stolcke, and M. Harper | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 3313-3316 | September 2005 | Speech | |
| Comparing Evaluation Metrics for Sentence Boundary Detection | Y. Liu and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Vol. 4, pp. 185-188, Honolulu, Hawaii | April 2007 | Speech | [PDF]
|
| Comparing Different Flavors of Spectro-Temporal Features for ASR | B. T. Meyer, S. V. Ravuri, M. R. Schaedler, and N. Morgan | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 1269-1272 | August 2011 | Speech | [PDF]
|
| Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in Speech | Y. Liu, A. Stolcke, E. Shriberg, and M. Harper | Proceedings of Conference on Empirical Methods in Natural Language Processing, Barcelona | July 2004 | Speech | [PDF]
|
| Combining Short-term Cepstral and Long-term Pitch Features for Automatic Recognition of Speaker Age | C. Müller and F. Burkhardt | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2277-2280 | August 2007 | Speech | |
| Combining Semantic and Syntactic Information Sources for 5-W Question Answering | S. Yaman, D. Hakkani-Tür, and G. Tur | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2707-2710 | September 2009 | Speech | [PDF]
|
| Combining Multiple Estimators of Speaking Rate | N. Morgan and E. Fosler-Lussier | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 729-732 | May 1998 | Speech | [PDF]
|
| Combining Multiple Clustering Systems | C. Boulis and M. Ostendof | Proceedings of the 15th European Conference on Machine Learning (ECML/PKDD 2004), Pisa, Italy | September 2004 | Speech | [PDF]
|
| Combining Feature Sets with Support Vector Machines: Application to Speaker Recognition | A. O. Hatch, A. Stolcke, and B. Peskin | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 75-79 | November 2005 | Speech | [PDF]
|
| Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition | J. Zheng, O. Cetin, M.-Y. Huang, X. Lei, A. Stolcke, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 633-636 | April 2007 | Speech | |
| Combining Connectionist Multi-Band and Full-Band Probability Streams for Speech Recognition of Natural Numbers | N. Mirghafori and N. Morgan | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 743-746. | 1998 | Speech | [PDF]
|
| Combining Bottom-Up and Top-Down Constraints for Robust ASR: The Multiscore Decoder | J. Barker, M. Cooke, and D. Ellis | Proceedings of the Workshop on Consistent and Reliable Acoustic Cues (CRAC-2001), Aalborg, Denmark | September 2001 | Speech | |
| Combined Speech and Speaker Recognition With Speaker-adapted Connectionist Models | D. Genoud, D. Ellis, and N. Morgan | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-99), Keystone, Colorado | December 1999 | Speech | [PDF]
|
| Combined Low Level and High Level Features for Out-of-Vocabulary Word Detection | B. Lecouteux, G. Linarès, and B. Favre | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1187-1190 | September 2009 | Speech | [PDF]
|
| Co-training Using Prosodic and Lexical Information for Sentence Segmentation | U. Guz, S. Cuendet, D. Hakkani-Tur, and G. Tur | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2597-2600 | August 2007 | Speech | [PDF]
|
| ClusterRank: A Graph Based Method for Meeting Summarization | N. Garg, B. Favre, K. Riedhammer, and D. Hakkani-Tür | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1499-1502 | September 2009 | Speech | [PDF]
|
| Classification-Based Strategies for Combining Multiple 5-W Question Answering Systems | S. Yaman, D. Hakkani-Tür, G. Tur, R. Grishman, M. Harper, K. R. McKeown, A. Meyers, and K. Sharma | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2703-2706 | September 2009 | Speech | [PDF]
|
| Clap Detection and Discrimination for Rhythm Therapy | N. Lesser and D.P.W. Ellis | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 37-40 | March 2005 | Speech | [PDF]
|
| Chapter 17: The Transcription of Discourse | J. Edwards | The Handbook of Discourse Analysis, D. Shriffrin, D. Tannen and H. Hamilton, eds. Oxford: Blackwell, pp. 321-348 | 2001 | Speech | |
| CDNN: A Context Dependent Neural Network for Continuous Speech Recognition | H. Bourlard, N. Morgan, C. Wooters, and S. Renals | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, San Francisco, California, pp. II-349-352 | 1992 | Speech | |
| Cascaded Model Adaptation for Dialog Act Segmentation and Tagging | U. Guz, G. Tur, D. Hakkani-Tür, and S. Cuendet | Journal of Computer Speech and Language, Vol. 24, Issue 2, pp. 289-306 | April 2010 | Speech | |
| Can We Escape the Trough of Disillusionment?--A Perspective on E-learning Technology Research from the ACM Workshop on Educational Multimedia and Multimedia Education | G. Friedland, L. Knipping, W. Huerst, and M. Muhlhauser | ACM E-Learn Journal | February 2009 | Speech | |
| Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech | E. Shriberg, A. Stolcke, and D. Baron | Proceedings of the ISCA Tutorial and Research Workshop on Prosody in Speech Recognition and Understanding, Red Bank, New Jersey | October 2001 | Speech | [PDF]
|
| Can Conversational Word Usage Be Used to Predict Speaker Demographics? | D. Gillick | Proceedings of the 11th Internationational Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan | September 2010 | Speech | [PDF]
|
| Buried Markov Models for Speech Recognition | J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-713-716 | March 1999 | Speech | [PDF]
|
| Building Multiple Pronunication Models for Novel Words using Exploratory Computational Phonology | G. Tajchman, E. Fosler, and D. Jurafsky | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| Building a Highly Accurate Mandarin Speech Recognizer | M-Y. Hwang, G. Peng, W. Wang, A. Faria, and A. Heidel | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, pp. 490-495 | December 2007 | Speech | [PDF]
|
| Bird Species Recognition Combining Acoustic and Sequence Modeling | M. Graciarena, M. Delplanche, E. Shriberg, and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 341-344 | May 2011 | Speech | [PDF]
|
| Big Dumb Neural Nets: A Working Brute Force Approach to Speech Recognition | N. Morgan | Proceedings of the International Conference on Neural Networks, Vol. VII, pp. 4462-4465 | 1994 | Speech | |
| Better Word Alignments with Supervised ITG Models | A. Haghighi, J. Blitzer, J. DeNero, and D. Klein | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore | August 2009 | Speech | [PDF]
|
| Best Papers from the Second IEEE International Conference on Semantic Computing (IJSC) | G. Friedland and C. Martell, eds. | International Journal on Semantic Computing (IJSC), Vol. 2, Issue 3 | September 2008 | Speech | |
| Best Papers from the 10th IEEE International Symposium on Multimedia | G. Friedland and S.-C. Shen, eds. | International Journal on Semantic Computing (IJSC), World Scientific, Vol. 3, Issue 2 | June 2009 | Speech | |
| Backoff Model Training Using Partially Observed Data: Application to Dialog Act Tagging | G. Ji and J. Bilmes | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2006), New York City, New York, pp. 280-287 | June 2006 | Speech | [PDF]
|
| Autoregressive Modeling of Hilbert Envelopes for Wide-Band Audio Coding | S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri | Proceedings of 124th Convention of Audio Engineering Society (AES), Amsterdam, the Netherlands, paper 7481 | May 2008 | Speech | |
| Automatically Generated Prosodic Cues to Lexically Ambiguous Dialog Acts in Multiparty Meetings | S. Bhagat, H. Carvey, and E. Shriberg | Proceedings of the 15th International Congress of Phonetic Sciences (ICPhS 2003), Barcelona, Spain | August 2003 | Speech | [PDF]
|
| Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings | X. Anguera, C. Wooters, J. Pardo, and J. Hernando | Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 241-244 | April 2007 | Speech | [PDF]
|
| Automatic Transcription of Prosodic Stress for Spontaneous English Discourse | R. Silipo and S. Greenberg | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 3, pp. 2351-2354 | August 1999 | Speech | [PDF]
|
| Automatic Tagging and Geo-Tagging in Video Collections and Communities | M. Larson, M. Soleymani, P. Serdyukov, S. Rudinac, C. Wartena, V. Murdock, G. Friedland, R. Ordelman, and G. J. F. Jones | Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR 2011), Trento, Italy, April 2011 | April 2011 | Speech | [PDF]
|
| Automatic Speech Recognition with Neural Spike Trains | M. Holmberg, D. Gelbart, U. Ramacher, and W. Hemmert | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal | September 2005 | Speech | [PDF]
|
| Automatic Speech Recognition with an Adaptation Model Motivated by Auditory Processing | M. Holmberg, D. Gelbart, and W. Hemmert | IEEE Transactions on Speech and Audio Processing, Vol. 14, Issue 1, pp. 44-49 | January 2006 | Speech | [PDF]
|
| Automatic Speech Recognition | H. Hermansky, and N. Morgan | Encyclopedia of Cognitive Science, Nature Publishing Group, London | 2003 | Speech | |