| Meeting Recorder Project: Dialog Act Labeling Guide | R. Dhillon, S. Bhagat, H. Carvey, and E. Shriberg | ICSI Technical Report TR-04-002 | February 2004 | Speech | [PDF]
|
| Meeting Recorder | A. Janin | Proceedings of the Applied Voice Input/Output Society, San Jose, California | April 2001 | Speech | [PDF]
|
| Meeting Acts: A Labeling System for Group Interaction in Meetings | R. Bates, P. Menning, E. Willingham, and C. Kuyper | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisbon, Portugal | September 2005 | Speech | [PDF]
|
| Maximum Mutual Information Based Reduction Strategies for Cross-Correlation Based Joint Distributional Modeling | J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 469-472 | May 1998 | Speech | [PDF]
|
| Manual Transcription of Conversational Speech at the Articulatory Feature Level | K. Livescu, A. Bezman, N. Borges, L. Yung, O. Cetin, J. Frankel, S. King, M. Magimai-Doss, X. Chi, and L. Lavoie | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 953-956 | April 2007 | Speech | |
| Making the Most from Multiple Microphones in Meeting Recognition | A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4992-4995 | May 2011 | Speech | [PDF]
|
| Longer Features: They Do a Speech Detector Good | TJ Tsai and N. Morgan | Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Portland, Oregon | September 2012 | Speech | |
| Long-Term Temporal Features for Conversational Speech Recognition | B. Chen, Q. Zhu, and N. Morgan | Proceedings of the First International Workshop on Machine Learning for Multimodal Interaction (MLMI 2004), Martigny, Switzerland | June 2004 | Speech | |
| Long Story Short - Global Unsupervised Models for Keyphrase Based Meeting Summarization | K. Riedhammer, B. Favre, and D. Hakkani-Tur | Speech Communication, Vol. 52, Issue 10, pp. 801-815. DOI:10.1016/j.specom.2010.06.002 | October 2010 | Speech | |
| Live Speaker Identification in Conversations | G. Friedland and O. Vinyals | Proceedings of the 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 1017-1018 | October 2008 | Speech | [PDF]
|
| Linguistic Dissection of Switchboard-Corpus Automatic Speech Recognition Systems | S. Greenberg and S. Chang | Proceedings of the ISCA Workshop on Automatic Speech Recognition: Challenges for the New Millennium, Paris, France | 2000 | Speech | [PDF]
|
| Leveraging Speaker Diarization for Meeting Recognition from Distant Microphones | A. Stolcke, G. Friedland, and D. Imseng | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4390-4393 | March 2010 | Speech | [PDF]
|
| Leveraging Sentence Weights in a Concept-Based Optimization Framework for Extractive Meeting Summarization | S. Xie, B. Favre, D. Hakkani-Tür, and Y. Liu | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1503-1506 | September 2009 | Speech | [PDF]
|
| Let's DISCOH: Collecting an Annotated Open Corpus with Dialogue Acts and Reward Signals for Natural Language Helpdesks | G. Andeani, D. Di Fabbrizio, M. Gilbert, D. Gillick, D. Hakkani-Tur, and O. Lemon | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 218-221 | December 2006 | Speech | [PDF]
|
| Learning Phonological Rule Probabilities from Speech Corpora with Exploratory Computational Phonology | G. Tajchman, D. Jurafsky, and E. Fosler | Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics (ACL 1995), Boston, Massachusetts, pp. 1-5 | June 1995 | Speech | [PDF]
|
| Learning Long-Term Temporal Features in LVCSR Using Neural Networks | B. Chen, Q. Zhu, and N. Morgan | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Learning Discriminative Temporal Patterns in Speech: Development of Novel TRAPS-Like Classifiers | B. Chen, S. Chang, and S. Sivadas | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| Learning Discriminant Narrow-Band Temporal Patterns for Automatic Recognition of Conversational Telephone Speech | B.Y. Chen | Ph.D. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| LDA Based Similarity Modeling for Question Answering | A. Celikyilmaz, D. Hakkani-Tur, and G. Tur | Proceedings of the Workshop on Semantic Search at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2010), Los Angeles, California, pp. 1-9 | June 2010 | Speech | [PDF]
|
| Language-Independent Constrained Cepstral Features for Speaker Recognition | E. Shriberg and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5296-5299 | May 2011 | Speech | [PDF]
|
| Language Model Combination and Adaptation Using Weighted Finite State Transducers | X. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. Woodland | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, Texas | March 2010 | Speech | |
| Kernel Optimization for Support Vector Machines: Application to Speaker Verification | A. Hatch | UC Berkeley dissertation | December 2006 | Speech | [PDF]
|
| Joke-o-Mat: Browsing Sitcoms Punchline by Punchline | G. Friedland, L. Gottlieb, and A. Janin | Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2009), Beijing, China, pp. 1115-1116 | October 2009 | Speech | [PDF]
|
| Joke-O-Mat HD: Browsing Sitcoms with Human Derived Transcripts | A. Janin, L. Gottlieb, and G. Friedland | Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 1591-1594 | October 2010 | Speech | [PDF]
|
| Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, A. Stolcke, E.E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Vol. 1, Toulouse, France, pp. 581-584 | May 2006 | Speech | [PDF]
|
| Joint Distributional Modeling with Cross-Correlation Based Features | J. Bilmes | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings (ASRU-97), Santa Barbara, California, pp.148-155 | 1997 | Speech | [PDF]
|
| Java Visual Speech Components for Rapid Application Development of GUI based Speech Processing Applications | S. Steidl, K. Riedhammer, T. Bocklet, F. Hoenig, and E. Noeth | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 3257-3260 | August 2011 | Speech | |
| Japanese Speech Understanding Using Grammar Specialization | M. Rayner, N. Chatzichrisafis, P. Bouillon, Y. Nakao, H. Isahara, K. Kanzaki, B. A. Hockey, M. Santaholma, and M. Starlander | Proceedings of the Joint Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP 2005), Vancouver, Canada, pp. 26-27 | October 2005 | Speech | |
| IXIR: A Statistical Information Distillation System | M. Levit, D. Hakkani-Tür, G. Tür, and D. Gillick | Journal of Computer Speech and Language, Vol. 23, Issue 4, pp. 527-542 | October 2009 | Speech | [PDF]
|
| Investigations Into Tandem Acoustic Modeling for the Aurora Taks | D.P.W. Ellis and M. Reyes | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | |
| Introduction to the Special Section on Deep Learning for Speech and Language Processing | D. Yu, G. Hinton, N. Morgan, J.-T. Chien, and S. Sagayama | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 1, pp. 4-6 | January 2012 | Speech | [PDF]
|
| Introduction to the Special Issue on Processing Morphologically Rich Languages | R. Sarikaya, K. Kirchhoff, T. Schultz, and D. Hakkani-Tür | IEEE Transactions on Audio, Speech and Language Processing, Special Issue on Processing Morphologically Rich Languages, Vol. 17, No. 5, pp. 861-862 | July 2009 | Speech | [PDF]
|
| Introduction to Multimedia Computing | G. Friedland and R. Jain | Cambridge University Press | 2011 | Speech | |
| Interpretation of Spatial Language in a Map Navigation Task | M. Levit and D. Roy | IEEE Transactions on Systems, Man and Cybernetics, Part B, vol. 37, no. 3, IEEE Systems, man, and Cybernetics Society, pp.667-679 | June 2007 | Speech | |
| Integrating Syllable Boundary Information Into Speech Recognition | S.L. Wu, M. Shire, S. Greenberg, and N. Morgan | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 987-990 | April 1997 | Speech | [PDF]
|
| Integrating RASTA-PLP into Speech Recognition | J. Koehler, N. Morgan, H. Hermansky, H.G. Hirsch, and G. Tong | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, pp. I-421-424 | 1994 | Speech | |
| Integrating Prosodic Features in Extractive Meeting Summarization | S. Xie, D. Hakkani-Tür, B. Favre, and Y. Liu | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 387-391 | December 2009 | Speech | [PDF]
|
| Integrating Experimental Models of Syntax, Phonology, and Accent/Dialect in a Speech Recognizer | D. Jurafsky, C.Wooters, G. Tajchman, J. Segal, A. Stolcke, and N. Morgan | Proceedings of the 12th National Conference on Artificial Intelligence (AAAI-94), Seattle, Washington | 1994 | Speech | [PDF]
|
| Insights Into Spoken Language Gleaned from Phonetic Transcriptions of the Switchboard Corpus | S. Greenberg, J. Hollenback, and D. Ellis | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| Incorporating Tandem/HATs MLP Features into SRI's Conversational Speech Recognition System | Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan | Proceedings of the EARS RT-04F Workshop, Palisades, New York, November 2004. | November 2004 | Speech | [PDF]
|
| Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition | S.L. Wu | Ph.D. Thesis, University of California at Berkeley, Spring 1998. Also ICSI Technical Report TR-98-014 | 1998 | Speech | [PDF]
|
| Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition | S.L. Wu, B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 721-724 | May 1998 | Speech | [PDF]
|
| Incorporating Contextual Phonetics Into Automatic Speech Recognition | E. Fosler-Lussier, S. Greenberg, and N. Morgan | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 1, pp. 611-614 | August 1999 | Speech | [PDF]
|
| Improving Word Sense Disambiguation in Lexical Chaining | M. Galley and K. McKeown | Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI 03), Acapulco, Mexico, pp. 1486-1488 | August 2003 | Speech | [PDF]
|
| Improving Word Accuracy with Gabor Feature Extraction | M. Kleinschmidt and D. Gelbart | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| Improving the Usability of MedSLT: Back-Translation and the Help System (in Japanese) | Y. Nakao, M. Rayner, N. Chatzichrisafis, K. Kanzaki, P. Bouillon, B.A. Hockey, and H. Isahara | Proceedings of the 12th Annual Meeting of the Japanese Society for Natural Language Processing (NLP2006), Tokyo, Japan | March 2006 | Speech | |
| Improving Statistical Speech Recognition | S. Renals, N. Morgan, M. Cohen, H. Franco, H. Bourlard | Proceedings of the International Joint Conference on Neural Networks, (IJCNN '92), Beijing, China, pp. II-302-307 | 1992 | Speech | |
| Improving Speech Translation with Automatic Boundary Prediction | E. Matusov, D. Hillard, M. Magimai-Doss, D. Hakkani-Tur, M. Ostendorf, and H. Ney | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2449-2452 | August 2007 | Speech | [PDF]
|
| Improving Language Recognition with Multilingual Phone Recognition and Speaker Adaptation Transforms | A. Stolcke, M. Akbacak, L. Ferrer, S. Kajarekar, C. Richey, N. Scheffer, and E. Shriberg | Proceedings of the Odyssey Speaker and Language Recognition Workshop, Brno, Czech Republic, pp. 256-262 | June 2010 | Speech | [PDF]
|
| Improving Automatic Speech Recognition by Learning from Human Errors | B. T. Meyer | Proceedings of the 162nd Meeting of the Acoustical Society of America, San Diego, California | October 2011 | Speech | |