| Global Posterior Probability Estimates as Confidence Measures in an Automatic Speech Recognition System | W. Warren | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| Global Posterior Probability Estimates as Decision Confidence Measures in an Automatic Speech Recognition System | W. Warren | Ph.D. Dissertation, University of California at Berkeley | December 2000 | Speech | |
| Hearing is Believing: Biologically-Inspired Feature Extraction for Robust Automatic Speech Recognition | R. M. Stern and N. Morgan | Signal Processing Magazine, Vol. 29, No. 6, pp. 34-43 | November 2012 | Speech | [PDF]
|
| Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR System | F. Valente, M. Magimai-Doss, C. Plahl, and S. Ravuri | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2963-2966 | September 2009 | Speech | [PDF]
|
| Hierarchical Tandem Feature Extraction | S. Sivadas and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Higher Level Features in Speaker Recognition | E. Shriberg | Speaker Classification I (Lecture Notes in Computer Science, Vol. 4343), pp. 241-259, Springer: Heidelberg / Berlin | 2007 | Speech | |
| Hill-Climbing Ensemble Feature Selection with a Larger Ensemble | D. Gelbart | ICSI Technical Report TR-09-001 | February 2009 | Speech | [PDF]
|
| Hill-Climbing Feature Selection for Multi-Stream ASR | D. Gelbart, N. Morgan, and A. Tsymbal | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2967-2970 | September 2009 | Speech | [PDF]
|
| Hooking Up Spectro-Temporal Filters with Auditory-Inspired Representations for Robust Automatic Speech Recognition | B. Meyer, C. Spille, B. Kollmeier, and N. Morgan | Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Portland, Oregon | September 2012 | Speech | [PDF]
|
| How Good Is the Crowd at "Real" WSD? | J. Hong and C. F. Baker | Proceedings of the Fifth Linguistic Annotation Workshop (LAW-V), Portland, Oregon | June 2011 | Speech | [PDF]
|
| How to Build a Spoken Dialog System with Limited (or No) Resources | M. Plauché, O. Cetin, and N. Uhdaykumar | Presented at the Workshop on AI in ICT for Development at the 20th International Joint Conference on AI (IJCAI07), Hyderabad, India | January 2007 | Speech | |
| How to Put It Into Words - Using Random Forests to Extract Symbol Level Descriptions from Audio Content for Concept Detection | P.-S. Huang, R. Mertens, A. Divakaran, G. Friedland, and M. Hasegawa-Johns | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Hunting for Wolves in Speaker Recognition | L. Stoll and G. Doddington | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2010), Brno, Czech Republic, pp. 159-164 | June 2010 | Speech | [PDF]
|
| Hybrid Connnectionist Models for Continuous Speech Recognition | H. Bourlard and N. Morgan | Chapter in Automatic Speech and Speaker Recognition - Advanced Topics, Lee, Paliwal and Soong, eds., pp. 259-283, Kluwer Academic Press | 1996 | Speech | |
| Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions | H. Bourlard and N. Morgan | Adaptive Processing of Sequences and Data Structures, C.L. Giles and M. Gori (Eds.), pp. 389-417, Lecture Notes in Artificial Intelligence (1387), Springer | 1998 | Speech | |
| Hybrid Neural Network / Hidden Markov Model Continuous Speech Recognition | M. Cohen, H. Franco, N. Morgan, D. Rumelhart, and V. Abrash | Proceedings of the International Conference on Spoken Language Processing (ICSLP'92), pp. 915-918 | 1992 | Speech | |
| Hybrid Speech/Non-Speech Detector Applied to Speaker Diarization of Meetings | X. Anguera, M. Aguilo, C. Wooters, C. Nadeu, and J. Hernando | Proceedings of IEEE Odyssey: The Speaker and Language Recognition Workshop, San Juan de Puerto Rico, pp. 1-6 | June 2006 | Speech | [PDF]
|
| ICSI System Description for SRE2008 Submission | H. Lei and D.V. Leeuwen | Speaker Recognition Evaluation 2008, National Institute of Standards and Technology | 2008 | Speech | [PDF]
|
| ICSI's 2005 Speaker Recognition System | N. Mirghafori, A. O. Hatch, S. Stafford, K. Boakye, D. Gillick, and B. Peskin | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 23-28 | November 2005 | Speech | [PDF]
|
| ICSI-CRF: The Generation of References to the Main Subject and Named Entities Using Conditional Random Fields | B. Favre and B. Bohnet | Proceedings of the Language Generation and Summarisation (UCNLG+Sum) Workshop at the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore, pp. 99-100 | August 2009 | Speech | [PDF]
|
| Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic Dependencies | M. Galley, K. McKeown, J. Hirschberg, and E. Shriberg | Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL 04), Barcelona, Spain | July 2004 | Speech | [PDF]
|
| Impact of Automatic Comma Prediction on POS/Name Tagging of Speech | D. Hillard, Z. Huang, H. Ji, R. Grishman, D. Hakkani-Tur, M. Harper, M. Ostendorf, and W. Wang | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 58-61 | December 2006 | Speech | [PDF]
|
| Importance of Nasality Measures for Speaker Recognition Data Selection and Performance Prediction | H. Lei and E. Lopez-Gonzalo | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 888-891 | September 2009 | Speech | [PDF]
|
| Improved Classification of Speaking Styles for Mental Health Monitoring using Phoneme Dynamics | K. Chang, H. Lei, and J. Canny | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 85-88 | August 2011 | Speech | [PDF]
|
| Improved MLP Structures for Data-Driven Feature Extraction for ASR | Q. Zhu, B. Chen, F. Grezl, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132 | September 2005 | Speech | [PDF]
|
| Improved MLP Structures for Data-Driven Feature Extraction for ASR | Q. Zhu, B. Chen, F. Grezl, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132 | September 2005 | Speech | |
| Improved Overlapped Speech Handling for Speaker Diarization | K. Boakye, O. Vinyals, and G. Friedland | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 941-944 | August 2011 | Speech | |
| Improved Phonetic Speaker Recognition Using Lattice Decoding | A. O. Hatch, B. Peskin, and A. Stolcke | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 169-172 | March 2005 | Speech | [PDF]
|
| Improved Recognition by Combining Different Features and Different Systems | D.P.W. Ellis | Proceedings of the Applied Voice Input/Output Society (AVIOS-2000), San Jose, California | May 2000 | Speech | [PDF]
|
| Improved Speech Activity Detection Using Cross-Channel Features for Recognition of Multiparty Meetings | K. Boakye and A. Stolcke | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1962-1965 | September 2006 | Speech | [PDF]
|
| Improving ASR Performance for Reverberant Speech | B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 87-90 | 1997 | Speech | [PDF]
|
| Improving Automatic Sentence Boundary Detection with Confusion Networks | D. Hillard, M. Ostendorf, A. Stolcke, Y. Liu, and E. Shriberg | Proceedings of HLT-NAACL Conference, Boston | April 2004 | Speech | [PDF]
|
| Improving Automatic Speech Recognition by Learning from Human Errors | B. T. Meyer | Proceedings of the 162nd Meeting of the Acoustical Society of America, San Diego, California | October 2011 | Speech | |
| Improving Language Recognition with Multilingual Phone Recognition and Speaker Adaptation Transforms | A. Stolcke, M. Akbacak, L. Ferrer, S. Kajarekar, C. Richey, N. Scheffer, and E. Shriberg | Proceedings of the Odyssey Speaker and Language Recognition Workshop, Brno, Czech Republic, pp. 256-262 | June 2010 | Speech | [PDF]
|
| Improving Speech Translation with Automatic Boundary Prediction | E. Matusov, D. Hillard, M. Magimai-Doss, D. Hakkani-Tur, M. Ostendorf, and H. Ney | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2449-2452 | August 2007 | Speech | [PDF]
|
| Improving Statistical Speech Recognition | S. Renals, N. Morgan, M. Cohen, H. Franco, H. Bourlard | Proceedings of the International Joint Conference on Neural Networks, (IJCNN '92), Beijing, China, pp. II-302-307 | 1992 | Speech | |
| Improving the Usability of MedSLT: Back-Translation and the Help System (in Japanese) | Y. Nakao, M. Rayner, N. Chatzichrisafis, K. Kanzaki, P. Bouillon, B.A. Hockey, and H. Isahara | Proceedings of the 12th Annual Meeting of the Japanese Society for Natural Language Processing (NLP2006), Tokyo, Japan | March 2006 | Speech | |
| Improving Word Accuracy with Gabor Feature Extraction | M. Kleinschmidt and D. Gelbart | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| Improving Word Sense Disambiguation in Lexical Chaining | M. Galley and K. McKeown | Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI 03), Acapulco, Mexico, pp. 1486-1488 | August 2003 | Speech | [PDF]
|
| Incorporating Contextual Phonetics Into Automatic Speech Recognition | E. Fosler-Lussier, S. Greenberg, and N. Morgan | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 1, pp. 611-614 | August 1999 | Speech | [PDF]
|
| Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition | S.L. Wu | Ph.D. Thesis, University of California at Berkeley, Spring 1998. Also ICSI Technical Report TR-98-014 | 1998 | Speech | [PDF]
|
| Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition | S.L. Wu, B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 721-724 | May 1998 | Speech | [PDF]
|
| Incorporating Tandem/HATs MLP Features into SRI's Conversational Speech Recognition System | Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan | Proceedings of the EARS RT-04F Workshop, Palisades, New York, November 2004. | November 2004 | Speech | [PDF]
|
| Insights Into Spoken Language Gleaned from Phonetic Transcriptions of the Switchboard Corpus | S. Greenberg, J. Hollenback, and D. Ellis | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| Integrating Experimental Models of Syntax, Phonology, and Accent/Dialect in a Speech Recognizer | D. Jurafsky, C.Wooters, G. Tajchman, J. Segal, A. Stolcke, and N. Morgan | Proceedings of the 12th National Conference on Artificial Intelligence (AAAI-94), Seattle, Washington | 1994 | Speech | [PDF]
|
| Integrating Prosodic Features in Extractive Meeting Summarization | S. Xie, D. Hakkani-Tür, B. Favre, and Y. Liu | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 387-391 | December 2009 | Speech | [PDF]
|
| Integrating RASTA-PLP into Speech Recognition | J. Koehler, N. Morgan, H. Hermansky, H.G. Hirsch, and G. Tong | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, pp. I-421-424 | 1994 | Speech | |
| Integrating Syllable Boundary Information Into Speech Recognition | S.L. Wu, M. Shire, S. Greenberg, and N. Morgan | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 987-990 | April 1997 | Speech | [PDF]
|
| Interpretation of Spatial Language in a Map Navigation Task | M. Levit and D. Roy | IEEE Transactions on Systems, Man and Cybernetics, Part B, vol. 37, no. 3, IEEE Systems, man, and Cybernetics Society, pp.667-679 | June 2007 | Speech | |
| Introduction to Multimedia Computing | G. Friedland and R. Jain | Cambridge University Press | 2011 | Speech | |