| Features Based on Auditory Physiology and Perception | R. M. Stern and N. Morgan | In Techniques for Noise Robustness in Automatic Speech Recognition, T. Virtanen, B. Raj, and R. Singh, Wiley Publishing | 2012 | Speech | |
| Feature-Based and Channel-Based Analyses of Intrinsic Variability in Speaker Verification | M. Graciarena, T. Bocklet, E. Shriberg, A. Stolcke, and S. Kajarekar | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2015-2018 | September 2009 | Speech | |
| Feature Transformations and Combinations for Improving ASR Performance | P. Somervuo, B. Chen, and Q. Zhu | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| Feature Extraction Using Non-Linear Transformation for Robust Speech Recognition on the Aurora Database | S. Sharma, D. Ellis, S. Kajarekar, P. Jain, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. II-1117-1120 | June 2000 | Speech | [PDF]
|
| Fast Speakers in Large Vocabulary Continuous Speech Recognition: Analysis & Antidotes | N. Mirghafori, E. Fosler, and N. Morgan | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| Fast Speaker Diarization Using a High-Level Scripting Language | E. Gonina, G. Friedland, H. Cook, and K. Keutzer | Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2011), Big Island, Hawaii | December 2011 | Speech | [PDF]
|
| Fast Consensus Decoding over Translation Forests | J. DeNero, D. Chiang, and K. Knight | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore | August 2009 | Speech | [PDF]
|
| Far-Field ASR on Inexpensive Microphones | L. Docio, D. Gelbart, and N. Morgan | Proceedings of Eighth European Conference on Speech Communication and Technology (EUROSPEECH 2003), Geneva, Switzerland, pp. 2141-2144 | September 2003 | Speech | [PDF]
|
| Factoring Networks by a Statistical Method | N. Morgan and H. Bourlard | Neural Computation, Vol. 4 No. 6, pp. 835-838 | 1992 | Speech | [PDF]
|
| Factoring Networks by a Statistical Method | N. Morgan and H. Bourlard | Neural Computation, Vol. 4 No. 6, pp. 835-838 | 1992 | Speech | [PDF]
|
| Factored Language Models and Generalized Parallel Backoff | J. Bilmes and K. Kirchhoff | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, p. 1 | May 2003 | Speech | [PDF]
|
| Exploiting User Feedback for Language Model Adaptation in Meeting Recognition | D. Vergyri, A. Stolcke, and G. Tur | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4737-4740 | April 2009 | Speech | [PDF]
|
| Exploiting Information Extraction Annotations for Document Retrieval in Distillation Tasks | D. Hakkani-Tur, G. Tur, and M. Levit | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 330-333 | August 2007 | Speech | [PDF]
|
| Exploiting Dialog Act Tagging and Prosodic Information for Action Item Identification | F. Yang, G. Tur, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4941-4944 | April 2008 | Speech | [PDF]
|
| Exploiting Chinese Character Models to Improve Speech Recognition Performance | J. L. Hieronymus, X. Liu, M. J. F. Gales, and P. C. Woodland | Proceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UK | September 2009 | Speech | |
| Experiments with Temporal Resolution for Continuous Speech Recognition with Multi-Layer Perceptrons | N. Morgan, C. Wooters, H. Hermansky, H. Bourlard | Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, pp. 405-410 | 1991 | Speech | |
| Experiments with Linear and Nonlinear Feature Transformations in HMM Based Phone Recognition | P. Somervuo | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Evaluation of Semantic Role Labeling and Dependency Parsing of Automatic Speech Recognition Output | B. Favre, B. Bohnet, D. Hakkani-Tür | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5342-5345 | March 2010 | Speech | [PDF]
|
| Evaluating Long-term Spectral Subtraction for Reverberant ASR | D. Gelbart and N. Morgan | Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU 2001), Madonna di Campiglio, Italy | December 2001 | Speech | [PDF]
|
| Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus | L. Chen, Y. Liu, M. Harper, E. Maia, and S. McRoy | Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, Portugal, pp. 759-762 | 2004 | Speech | [PDF]
|
| Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems | L. Hennebert, C. Ris, H. Bourlard, S Renals, and N. Morgan | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece, pp. 1951-1954 | September 1997 | Speech | |
| Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies | H. Hung, Y. Huang, G. Friedland, and D. Gatica-Perez | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 2197-2200 | April 2008 | Speech | [PDF]
|
| Estimating Dominance in Multi-Party Meetings Using Speaker Diarization from a Single Microphone | H. Hung, Y. Huang, G. Friedland, and D. Gatica-Perez | IEEE Transactions on Audio, Speech and Language Processing, Vol. 19, No. 4, pp. 847–860 | May 2011 | Speech | |
| Entropy Based Classifier Combination for Sentence Segmentation | M. Magimai Doss, D. Hakkani-Tur, O. Cetin, E. Shriberg, J. Fung, and N. Mirghafori | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 189-192 | April 2007 | Speech | [PDF]
|
| Ensemble Feature Selection for Multi-stream Automatic Speech Recognition | D. Gelbart | UC Berkeley dissertation | December 2008 | Speech | [PDF]
|
| Enriching Speech Recognition with Automatic Detection of Sentence Boundaries and Disfluencies | Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, and M. Harper | IEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1526-1540 | September 2006 | Speech | [PDF]
|
| Efficient Sentence Segmentation Using Syntactic Features | B. Favre, D. Hakkani-Tur, S. Petrov, and D. Klein | Proceedings of IEEE Workshop on Spoken Language Technologies (SLT2008), Goa, India, pp. 77-80 | December 2008 | Speech | [PDF]
|
| Efficient Pitch-Based Estimation of VTLN Warp Factors | A. Faria and D. Gelbart | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 213-216 | September 2005 | Speech | [PDF]
|
| Efficient Parsing of Syntactic and Semantic Dependency Structures | B. Bohnet | Presented at the 13th Conference on Computational Natural Language Learning (CoNLL-2009), Boulder, Colorado | June 2009 | Speech | [PDF]
|
| Efficient Parsing for Transducer Grammars | J. DeNero, M. Bansal, A. Pauls, and D. Klein | Proceedings of North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009), Boulder, Colorado, pp. 227-235. | May 2009 | Speech | [PDF]
|
| Efficient Data Selection for Machine Translation | A. Mandal, D. Vergyri, W. Wang, J. Zheng, A. Stolcke, G. Tür, D. Hakkani-Tür, and N. Fazil Ayan | Proceedings of IEEE/ACL Workshop on Spoken Language Technologies (SLT), Goa, India, pp. 261-264 | December 2008 | Speech | [PDF]
|
| Effects of Vocal Effort and Speaking Style on Text-Independent Speaker Verification | E. Shriberg, M. Graciarena, H. Bratt, A. Kathol, S. Kajarekar, H. Jameel, C. Richey, and F. Goodman | Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 609-612 | September 2008 | Speech | [PDF]
|
| Effects of Speaking Rate and Word Predictability on Conversational Pronunciations | E. Fosler-Lussier and N. Morgan | Proceedings of the ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, Netherlands | May 1998 | Speech | [PDF]
|
| Effects of Speaking Rate and Word Frequency on Conversational Pronunciations | E. Fosler-Lussier and N. Morgan | Speech Communication Vol. 29, No. 2-4, pp. 137-158 | November 1999 | Speech | [PDF]
|
| Effective Arabic Dialect Classification Using Diverse Phonotactic Models | M. Akbacak, D. Vergyri, A. Stolcke, N. Scheffer, and A. Mandal | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 737-740 | August 2011 | Speech | [PDF]
|
| EEG Signal Compression Based on Classified Signature and Envelope Vector Sets | H. Gurkan, U. Guz, and B.S. Yarman | Proceedings of the European Conference on Circuit Theory and Design, IEEE Circuits and Systems Society and the European Circuit Society, Seville, Spain, pp. 420-423 | August 2007 | Speech | |
| Educational Multimedia Systems: The Past, the Present, and a Glimpse into the Future | G. Friedland, W. Huerst, and L. Knipping | Proceedings of the ACM Workshop on Educational Multimedia and Multimedia Education at ACM Multimedia 2007, Augsburg, Germany, pp. 1-4 | September 2007 | Speech | |
| Educational Multimedia | G. Friedland, L. Knipping, and W. Huerst (guest editors) | Special Section in IEEE Multimedia Magazine, pp. 54-74, July-Sept. 2008 | July 2008 | Speech | [PDF]
|
| Easy Does It: Robust Spectro-Temporal Many-Stream ASR Without Fine Tuning Streams | S. Ravuri and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | |
| Dynamic Pronunciation Models for Automatic Speech Recognition | E. Fosler-Lussier | Ph.D Dissertation, University of California at Berkeley | August 1999 | Speech | [PDF]
|
| Dynamic Pronunciation Models for Autmoatic Speech Recognition | E. Fosler-Lussier | Ph.D. Thesis, UC Berkeley, Fall 1999, ICSI Technical Report TR-99-015 | September 1999 | Speech | [PDF]
|
| Dynamic Classifier Combinations in Hybrid Speech Recognition Systems Using Utterance-Level Confidence Values | K. Kirchhoff and J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-693-696 | March 1999 | Speech | [PDF]
|
| Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification | G. Tur, E. Shriberg, A. Stolcke, and S. Kajarekar | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech--Eurospeech 2008), Antwerp, Belgium, pp. 2049-2052 | August 2007 | Speech | [PDF]
|
| Double the Trouble: Handling Noise and Reverberation in Far-Field Automatic Speech Recognition | D. Gelbart and N. Morgan | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| Don't Multiply Lightly: Quantifying Problems with the Acoustic Model Assumptions in Speech Recognition | D. Gillick, L. Gillick, and S. Wegmann | Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU), Big Island, Hawaii | December 2011 | Speech | [PDF]
|
| Does Session Variability Compensation in Speaker Recognition Model Intrinsic Variation Under Mismatched Conditions? | E. Shriberg, S. Kajarekar, and N. Scheffer | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1551-1554 | September 2009 | Speech | [PDF]
|
| Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data? | A. Venkataraman, Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2777-2780 | September 2005 | Speech | [PDF]
|
| Discriminative Training for Speech Recognition is Compensating for Statistical Dependence on the HMM Framework | D. Gillick and S. Wegmann, L. Gillick | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Discriminative Training for Hierarchical Clustering in Speaker Diarization | O. Vinyals, G. Friedland, and N. Morgan | Proceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2326-2329 | September 2010 | Speech | [PDF]
|
| Discriminative Pronunciation Learning Using Phonetic Decoder and Minimum-Classification-Error Criterion | O. Vinyals, L. Deng, D. Yu, and A. Acero | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4445-4448 | April 2009 | Speech | [PDF]
|