| Improving Automatic Sentence Boundary Detection with Confusion Networks | D. Hillard, M. Ostendorf, A. Stolcke, Y. Liu, and E. Shriberg | Proceedings of HLT-NAACL Conference, Boston | April 2004 | Speech | [PDF]
|
| Improving ASR Performance for Reverberant Speech | B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 87-90 | 1997 | Speech | [PDF]
|
| Improved Speech Activity Detection Using Cross-Channel Features for Recognition of Multiparty Meetings | K. Boakye and A. Stolcke | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1962-1965 | September 2006 | Speech | [PDF]
|
| Improved Recognition by Combining Different Features and Different Systems | D.P.W. Ellis | Proceedings of the Applied Voice Input/Output Society (AVIOS-2000), San Jose, California | May 2000 | Speech | [PDF]
|
| Improved Phonetic Speaker Recognition Using Lattice Decoding | A. O. Hatch, B. Peskin, and A. Stolcke | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 169-172 | March 2005 | Speech | [PDF]
|
| Improved Overlapped Speech Handling for Speaker Diarization | K. Boakye, O. Vinyals, and G. Friedland | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 941-944 | August 2011 | Speech | |
| Improved MLP Structures for Data-Driven Feature Extraction for ASR | Q. Zhu, B. Chen, F. Grezl, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132 | September 2005 | Speech | [PDF]
|
| Improved MLP Structures for Data-Driven Feature Extraction for ASR | Q. Zhu, B. Chen, F. Grezl, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132 | September 2005 | Speech | |
| Improved Classification of Speaking Styles for Mental Health Monitoring using Phoneme Dynamics | K. Chang, H. Lei, and J. Canny | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 85-88 | August 2011 | Speech | [PDF]
|
| Importance of Nasality Measures for Speaker Recognition Data Selection and Performance Prediction | H. Lei and E. Lopez-Gonzalo | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 888-891 | September 2009 | Speech | [PDF]
|
| Impact of Automatic Comma Prediction on POS/Name Tagging of Speech | D. Hillard, Z. Huang, H. Ji, R. Grishman, D. Hakkani-Tur, M. Harper, M. Ostendorf, and W. Wang | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 58-61 | December 2006 | Speech | [PDF]
|
| Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic Dependencies | M. Galley, K. McKeown, J. Hirschberg, and E. Shriberg | Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL 04), Barcelona, Spain | July 2004 | Speech | [PDF]
|
| ICSI-CRF: The Generation of References to the Main Subject and Named Entities Using Conditional Random Fields | B. Favre and B. Bohnet | Proceedings of the Language Generation and Summarisation (UCNLG+Sum) Workshop at the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore, pp. 99-100 | August 2009 | Speech | [PDF]
|
| ICSI's 2005 Speaker Recognition System | N. Mirghafori, A. O. Hatch, S. Stafford, K. Boakye, D. Gillick, and B. Peskin | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 23-28 | November 2005 | Speech | [PDF]
|
| ICSI System Description for SRE2008 Submission | H. Lei and D.V. Leeuwen | Speaker Recognition Evaluation 2008, National Institute of Standards and Technology | 2008 | Speech | [PDF]
|
| Hybrid Speech/Non-Speech Detector Applied to Speaker Diarization of Meetings | X. Anguera, M. Aguilo, C. Wooters, C. Nadeu, and J. Hernando | Proceedings of IEEE Odyssey: The Speaker and Language Recognition Workshop, San Juan de Puerto Rico, pp. 1-6 | June 2006 | Speech | [PDF]
|
| Hybrid Neural Network / Hidden Markov Model Continuous Speech Recognition | M. Cohen, H. Franco, N. Morgan, D. Rumelhart, and V. Abrash | Proceedings of the International Conference on Spoken Language Processing (ICSLP'92), pp. 915-918 | 1992 | Speech | |
| Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions | H. Bourlard and N. Morgan | Adaptive Processing of Sequences and Data Structures, C.L. Giles and M. Gori (Eds.), pp. 389-417, Lecture Notes in Artificial Intelligence (1387), Springer | 1998 | Speech | |
| Hybrid Connnectionist Models for Continuous Speech Recognition | H. Bourlard and N. Morgan | Chapter in Automatic Speech and Speaker Recognition - Advanced Topics, Lee, Paliwal and Soong, eds., pp. 259-283, Kluwer Academic Press | 1996 | Speech | |
| Hunting for Wolves in Speaker Recognition | L. Stoll and G. Doddington | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2010), Brno, Czech Republic, pp. 159-164 | June 2010 | Speech | [PDF]
|
| How to Put It Into Words - Using Random Forests to Extract Symbol Level Descriptions from Audio Content for Concept Detection | P.-S. Huang, R. Mertens, A. Divakaran, G. Friedland, and M. Hasegawa-Johns | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| How to Build a Spoken Dialog System with Limited (or No) Resources | M. Plauché, O. Cetin, and N. Uhdaykumar | Presented at the Workshop on AI in ICT for Development at the 20th International Joint Conference on AI (IJCAI07), Hyderabad, India | January 2007 | Speech | |
| How Good Is the Crowd at "Real" WSD? | J. Hong and C. F. Baker | Proceedings of the Fifth Linguistic Annotation Workshop (LAW-V), Portland, Oregon | June 2011 | Speech | [PDF]
|
| Hooking Up Spectro-Temporal Filters with Auditory-Inspired Representations for Robust Automatic Speech Recognition | B. Meyer, C. Spille, B. Kollmeier, and N. Morgan | Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Portland, Oregon | September 2012 | Speech | [PDF]
|
| Hill-Climbing Feature Selection for Multi-Stream ASR | D. Gelbart, N. Morgan, and A. Tsymbal | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2967-2970 | September 2009 | Speech | [PDF]
|
| Hill-Climbing Ensemble Feature Selection with a Larger Ensemble | D. Gelbart | ICSI Technical Report TR-09-001 | February 2009 | Speech | [PDF]
|
| Higher Level Features in Speaker Recognition | E. Shriberg | Speaker Classification I (Lecture Notes in Computer Science, Vol. 4343), pp. 241-259, Springer: Heidelberg / Berlin | 2007 | Speech | |
| Hierarchical Tandem Feature Extraction | S. Sivadas and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR System | F. Valente, M. Magimai-Doss, C. Plahl, and S. Ravuri | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2963-2966 | September 2009 | Speech | [PDF]
|
| Hearing is Believing: Biologically-Inspired Feature Extraction for Robust Automatic Speech Recognition | R. M. Stern and N. Morgan | Signal Processing Magazine, Vol. 29, No. 6, pp. 34-43 | November 2012 | Speech | [PDF]
|
| Global Posterior Probability Estimates as Decision Confidence Measures in an Automatic Speech Recognition System | W. Warren | Ph.D. Dissertation, University of California at Berkeley | December 2000 | Speech | |
| Global Posterior Probability Estimates as Confidence Measures in an Automatic Speech Recognition System | W. Warren | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| Getting the Last Laugh: Automatic Laughter Segmentation in Meetings | M. Knox, N. Morgan, and N. Mirghafori | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 797-800 | September 2008 | Speech | [PDF]
|
| Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures | I. Bulyko, M. Ostendorf, and A. Stolcke | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, Vol. 2, pp. 7-9 | May 2003 | Speech | [PDF]
|
| Genre Effects on Automatic Sentee Segmentation of Speech: A Comparison of Broadcast News and Broadcast Conversationsnc | J. Kolar, Y. Liu, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4701-4704 | April 2009 | Speech | [PDF]
|
| Generative and Discriminative Methods Using Morphological Information for Sentence Segmentation of Turkish | U. Guz, B. Favre, D. Hakkani-Tur, and G. Tur | IEEE Transactions on Speech, Audio and Language Processing, Special Issue on Processing Morphologically Rich Languages, Vol. 17, No. 5, pp. 895-903 | July 2009 | Speech | [PDF]
|
| Generalized Linear Kernels for One-Versus-All Classification: Application to Speaker Recognition | A. O. Hatch and A. Stolcke | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 585-588 | May 2006 | Speech | [PDF]
|
| Generalization and Parameter Estimation in Feedforward Nets: Some Experiments | H. Bourlard and N. Morgan | ICSI Technical Report TR-089-017. Also published in Advances in Neural Information Processing Systems, Vol. II, pp. 630-637, 1990. | 1989 | Speech | |
| GDNN: A Gender-Dependent Neural Network for Continuous Speech Recognition | Y. Konig and N. Morgan | Proceedings of the International Joint Conference on Neural Networks, (IJCNN '92), Beijing, China, pp. II-332-337 | 1992 | Speech | |
| Gappy Phrasal Alignment by Agreement | M. Bansal, C. Quirk, and R. C. Moore | Proceedings of the 49th annual Meeting of the Association for Computational Linguistics, pp. 1308-1317 Portland, Oregon | June 2011 | Speech | [PDF]
|
| Fusing Short Term and Long Term Features for Improved Speaker Diarization | G. Friedland, O. Vinyals, Y. Huang, and C. Müller | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4077-4080 | April 2009 | Speech | [PDF]
|
| Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System | A. Stolcke, X. Anguera, K. Boakye, O. Cetin, F. Grezl, A. Janin, A. Mandal, B. Peskin, C. Wooters, and J. Zheng | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 463-475 | July 2005 | Speech | [PDF]
|
| From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System | N. Mirghafori, A. Stolcke, C. Wooters, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| From Here to Utility - Melding Phonetic Insight with Speech Technology | S. Greenberg | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| From AUDREY to Siri: Is Speech Recognition A Solved Problem? | R. Pieraccini | Presented at the Mobile Voice Conference, San Francisco, California | March 2012 | Speech | [PDF]
|
| Friends and Enemies: A Novel Initialization for Speaker Diarization | X. Anguera, C. Wooters, and J. Hernando | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 689-692 | September 2006 | Speech | [PDF]
|
| Forms of English Function Words - Effects of Disfluencies, Turn Position, Age and Sex, and Predictability | A. Bell, D. Jurafsky, E. Fosler-Lussier, C. Girand, and D. Gildea | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 1, pp. 395-398 | August 1999 | Speech | [PDF]
|
| fMPE-MAP: Improved Discriminative Adaptation for Modeling New Domains | J. Zheng and A. Stolcke | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1573-1576 | August 2007 | Speech | [PDF]
|
| Finding Difficult Speakers in Automatic Speaker Recognition | L. Stoll | UC Berkeley PhD thesis, Berkeley, California | December 2011 | Speech | [PDF]
|
| Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections | M. Huijbregts, C. Wooters, and R. Ordelman | Proceedings of 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2925-2928 | August 2007 | Speech | |