| Consensus Training for Consensus Decoding in Machine Translation | A. Pauls, J. DeNero, and D. Klein | Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, pp. 1418-1427 | August 2009 | Speech | [PDF]
|
| Within-Class Covariance Normalization for SVM-Based Speaker Recognition | A. O. Hatch, S. Kajarekar, and A. Stolcke | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1471-1474 | September 2006 | Speech | [PDF]
|
| Improved Phonetic Speaker Recognition Using Lattice Decoding | A. O. Hatch, B. Peskin, and A. Stolcke | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 169-172 | March 2005 | Speech | [PDF]
|
| Combining Feature Sets with Support Vector Machines: Application to Speaker Recognition | A. O. Hatch, A. Stolcke, and B. Peskin | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 75-79 | November 2005 | Speech | [PDF]
|
| Generalized Linear Kernels for One-Versus-All Classification: Application to Speaker Recognition | A. O. Hatch and A. Stolcke | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 585-588 | May 2006 | Speech | [PDF]
|
| Efficient Data Selection for Machine Translation | A. Mandal, D. Vergyri, W. Wang, J. Zheng, A. Stolcke, G. Tür, D. Hakkani-Tür, and N. Fazil Ayan | Proceedings of IEEE/ACL Workshop on Spoken Language Technologies (SLT), Goa, India, pp. 261-264 | December 2008 | Speech | [PDF]
|
| Visualizing Large-Screen Electronic Chalkboard Content on Handheld Devices | A. Lüning, G. Friedland, L. Knipping, and R. Rojas | Proceedings of the Second IEEE International Workshop on Multimedia Technologies for E-Learning at 9th IEEE Symposium on Multimedia, Taichung, Taiwan, pp. 369-375 | December 2007 | Speech | |
| Joke-O-Mat HD: Browsing Sitcoms with Human Derived Transcripts | A. Janin, L. Gottlieb, and G. Friedland | Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 1591-1594 | October 2010 | Speech | [PDF]
|
| The ICSI Meeting Project: Resources and Research | A. Janin, J. Ang, S. Bhagat, R. Dhillon, J. Edwards, J. Macias, N. Morgan, B. Peskin, E. Shriberg, A. Stolcke, C. Wooters, and B. Wrede | Proceedings of the ICASSP 2004 Meeting Recognition Workshop, Montreal, Canada | May 2004 | Speech | [PDF]
|
| Multi-stream Speech Recognition: Ready for Prime Time? | A. Janin, D. Ellis, and N. Morgan | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. II-591-594 | September 1999 | Speech | [PDF]
|
| The ICSI Meeting Corpus | A. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. Wooters | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| The ICSI-SRI Spring 2006 Meeting Recognition System | A. Janin, A. Stolcke, X. Anguera, K. Boakye, O. Cetin, J. Frankel, and J. Zheng | In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006), Lecture Notes in Computer Science. Springer | 2006 | Speech | [PDF]
|
| SpeechCorder, The Portable Meeting Recorder | A. Janin and N. Morgan | Proceedings of the Workshop on Hands-Free Speech Communication, Kyoto, Japan | April 2001 | Speech | [PDF]
|
| Meeting Recorder | A. Janin | Proceedings of the Applied Voice Input/Output Society, San Jose, California | April 2001 | Speech | [PDF]
|
| Speech recognition on vector architectures | A. Janin | Ph.D. Thesis, University of California at Berkeley | 2004 | Speech | [PDF]
|
| Word-Level Confidence Estimation for Automatic Speech Recognition | A. Hatch | M.S. Thesis, University of California at Berkeley | August 2001 | Speech | [PDF]
|
| Kernel Optimization for Support Vector Machines: Application to Speaker Verification | A. Hatch | UC Berkeley dissertation | December 2006 | Speech | [PDF]
|
| Better Word Alignments with Supervised ITG Models | A. Haghighi, J. Blitzer, J. DeNero, and D. Klein | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore | August 2009 | Speech | [PDF]
|
| Multi-Stream Speaker Diarization Systems for the Meetings Domain | A. Gallardo-Antolin, X. Anguera, and C. Wooters | Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006—ICSLP), Philadelphia, Pennsylvania, pp. 2186-2189 | September 2006 | Speech | [PDF]
|
| Corrected Tandem Features for Acoustic Model Training | A. Faria and N. Morgan | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4737-4740 | April 2008 | Speech | [PDF]
|
| When a Mismatch Can Be Good: Large Vocabulary Speech Recognition Trained with Idealized Tandem Features | A. Faria and N. Morgan | Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, pp. 1574-1577 | March 2008 | Speech | [PDF]
|
| Efficient Pitch-Based Estimation of VTLN Warp Factors | A. Faria and D. Gelbart | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 213-216 | September 2005 | Speech | [PDF]
|
| Accent Classification for Speech Recognition | A. Faria | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 285-293 | July 2005 | Speech | [PDF]
|
| LDA Based Similarity Modeling for Question Answering | A. Celikyilmaz, D. Hakkani-Tur, and G. Tur | Proceedings of the Workshop on Semantic Search at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2010), Los Angeles, California, pp. 1-9 | June 2010 | Speech | [PDF]
|
| A Graph-Based Semi-Supervised Learning for Question Semantic Labeling | A. Celikyilmaz and D. Hakkani-Tur | Proceedings of the Workshop on Semantic Search at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2010), Los Angeles, California, pp. 27-35 | June 2010 | Speech | [PDF]
|
| A Hybrid Hierarchical Model for Multi-Document Summarization | A. Celikyilmaz and D. Hakkani-Tür | Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010), Uppsala, Sweden, pp. 1149-1154 | July 2010 | Speech | [PDF]
|
| Articulatory Features for Expressive Speech Synthesis | A. Black, H. T. Bunnell, Y. Dou, P. Kumar, F. Metze, D. Perry, T. Polzehl, K. Prahallad, S. Steidl, and C. Vaug | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Forms of English Function Words - Effects of Disfluencies, Turn Position, Age and Sex, and Predictability | A. Bell, D. Jurafsky, E. Fosler-Lussier, C. Girand, and D. Gildea | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 1, pp. 395-398 | August 1999 | Speech | [PDF]
|
| Associating Children’s Non-Verbal and Verbal Behaviour: Body Movements, Emotions, and Laughter in a Human-Robot Interaction | A. Batliner, S. Steidl, and E. Nöth | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 22-27 | May 2011 | Speech | [PDF]
|
| The Automatic Recognition of Emotions in Speech | A. Batliner, B. Schuller, D. Seppi, S. Steidl, L. Devillers, L. Vidrascu, T. Vogt, V. Aharonson, and N. Amir | Article in P. Petta, Paolo, C. Pelachaud, R. Cowie, eds., Emotion-Oriented Systems: The Humaine Handbook Cognitive Technologies, pp. 71-99, Springer | 2011 | Speech | |
| A New Speaker Change Detection Method for Two-Speaker Segmentation | A. Adami, S. Kajarekar, and H. Hermansky | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Qualcomm-ICSI-OGI Features for ASR | A. Adami, L. Burget, S. Dupont, H. Garudadri, F. Grezl, H. Hermansky, P. Jain, S. Kajarekar, N. Morgan, and S. Sivadas | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|