| Using Knowledge to Organize Sound: The Prediction-driven Approach to Computational Auditory Scene Analysis and Its Application to Speech/Nonspeech Mixtures | D. Ellis | Speech Communication, Vol. 27, Issue 3-4, pp. 281-298 | 1999 | Speech | |
| The Weft: A Representation for Periodic Sounds | D. Ellis | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 1307-1310 | April 1997 | Speech | [PDF]
|
| Computational Auditory Scene Analysis Exploiting Speech-Recognition Knowledge | D. Ellis | Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, p. 4 | October 1997 | Speech | [PDF]
|
| Prediction-driven Computational Auditory Scene Analysis for Dense Sound Mixtures | D. Ellis | Proceedings of the ESCA Workshop on the "Auditory Basis of Speech Perception," Keele University, Staffordshire, UK | 1996 | Speech | [PDF]
|
| Midlevel Representations for Computational Auditory Scene Analysis: The Weft Element | D. Ellis and D. Rosenthal | Computational Auditory Scene Analysis, D.F. Rosenthal & H.G. Okuno, eds., Lawrence Erlbaum, pp. 257-272 | 1998 | Speech | |
| Using Mutual Information to Design Feature Combinations | D. Ellis and J. Bilmes | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Size Matters: An Empirical Study of Neural Network Training for Large Vocabulary Continuous Speech Recognition | D. Ellis and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1013-1016 | March 1999 | Speech | [PDF]
|
| Tandem Acoustic Modeling in Large-Vocabulary Recognition | D. Ellis, R. Singh, and S. Sivadas | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| Reducing the Effect of Room Acoustics on Human-Computer Interaction | D. Gelbart | Proceedings of the Applied Voice Input/Output Society (AVIOS 2002), San Jose, California | May 2002 | Speech | [PDF]
|
| Ensemble Feature Selection for Multi-stream Automatic Speech Recognition | D. Gelbart | UC Berkeley dissertation | December 2008 | Speech | [PDF]
|
| Hill-Climbing Ensemble Feature Selection with a Larger Ensemble | D. Gelbart | ICSI Technical Report TR-09-001 | February 2009 | Speech | [PDF]
|
| Double the Trouble: Handling Noise and Reverberation in Far-Field Automatic Speech Recognition | D. Gelbart and N. Morgan | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| Evaluating Long-term Spectral Subtraction for Reverberant ASR | D. Gelbart and N. Morgan | Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU 2001), Madonna di Campiglio, Italy | December 2001 | Speech | [PDF]
|
| SmartKom English: From Robust Recognition to Felicitous Interaction | D. Gelbart, J. Bryants, A. Stolcke, R. Porzel, M. Baudis, and N. Morgan | In SmartKom--Foundations of Multimodal Dialogue Systems, W. Wahlster, ed., pp. 453-470, Springer | November 2004 | Speech | [PDF]
|
| Hill-Climbing Feature Selection for Multi-Stream ASR | D. Gelbart, N. Morgan, and A. Tsymbal | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2967-2970 | September 2009 | Speech | [PDF]
|
| Combined Speech and Speaker Recognition With Speaker-adapted Connectionist Models | D. Genoud, D. Ellis, and N. Morgan | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-99), Keystone, Colorado | December 1999 | Speech | [PDF]
|
| Corpus Variation and Parser Performance | D. Gildea | Proceedings of the 2001 Conference on Empirical Methods in Natural Language Processing (EMNLP 2001), Pittsburgh, Pennsylvania | June 2001 | Speech | [PDF]
|
| Automatic Labeling of Semantic Roles | D. Gildea and D. Jurafsky | The 38th Annual Meeting of the Association for Computational Linguistics (ACL-2000), Hong Kong, pp. 512-520 | October 2000 | Speech | [PDF]
|
| Topic-Based Language Models Using EM | D. Gildea and T. Hofmann | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. V-2167-2170 | September 1999 | Speech | [PDF]
|
| Can Conversational Word Usage Be Used to Predict Speaker Demographics? | D. Gillick | Proceedings of the 11th Internationational Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan | September 2010 | Speech | [PDF]
|
| A Scalable Global Model for Summarization | D. Gillick and B. Favre | Proceedings of the Workshop on Integer Linear Programming for Natural Language Processing at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009), Boulder, Colorado, pp. 10-18 | June 2009 | Speech | [PDF]
|
| Discriminative Training for Speech Recognition is Compensating for Statistical Dependence on the HMM Framework | D. Gillick and S. Wegmann, L. Gillick | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| The ICSI Summarization System at TAC 2008 | D. Gillick, B. Favre, and D. Hakkani-Tur | Proceedings of Text Analysis Conference (TAC), Gaithersburg, Maryland | November 2008 | Speech | [PDF]
|
| Unsupervised Learning of Edit Parameters for Matching Name Variants | D. Gillick, D. Hakkani-Tur, and M. Levit. | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 467-470 | September 2008 | Speech | [PDF]
|
| A Global Optimization Framework for Meeting Summarization | D. Gillick, K. Riedhammer, B. Favre, and D. Hakkani-Tür | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4769-4772 | April 2009 | Speech | [PDF]
|
| Don't Multiply Lightly: Quantifying Problems with the Acoustic Model Assumptions in Speech Recognition | D. Gillick, L. Gillick, and S. Wegmann | Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU), Big Island, Hawaii | December 2011 | Speech | [PDF]
|
| Speaker Detection Without Models | D. Gillick, S. Stafford, and B. Peskin | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 757-760 | March 2005 | Speech | [PDF]
|
| Semi-Autonomous Car Control Using Brain Computer Interfaces | D. Goehring, D. Latotzky, M. Wang, and R. Rojas | Proceedings of the 12th International Conference of Intelligent Autonomous Systems (IAS), Juju Island, Korea | June 2012 | Speech | |
| Towards Automatic Argument Diagramming of Multiparty Meetings | D. Hakkani-Tür | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4753-4756 | April 2009 | Speech | [PDF]
|
| Statistical Sentence Extraction for Information Distillation | D. Hakkani-Tur and G. Tur | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 1-4 | April 2007 | Speech | [PDF]
|
| Exploiting Information Extraction Annotations for Document Retrieval in Distillation Tasks | D. Hakkani-Tur, G. Tur, and M. Levit | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 330-333 | August 2007 | Speech | [PDF]
|
| Improving Automatic Sentence Boundary Detection with Confusion Networks | D. Hillard, M. Ostendorf, A. Stolcke, Y. Liu, and E. Shriberg | Proceedings of HLT-NAACL Conference, Boston | April 2004 | Speech | [PDF]
|
| Detection of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data | D. Hillard, M. Ostendorf, and E. Shriberg | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada | May 2003 | Speech | [PDF]
|
| Impact of Automatic Comma Prediction on POS/Name Tagging of Speech | D. Hillard, Z. Huang, H. Ji, R. Grishman, D. Hakkani-Tur, M. Harper, M. Ostendorf, and W. Wang | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 58-61 | December 2006 | Speech | [PDF]
|
| Robust Speaker Diarization for Short Speech Recordings | D. Imseng and G. Friedland | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 432-437 | December 2009 | Speech | [PDF]
|
| Tuning-Robust Initialization Methods for Speaker Diarization | D. Imseng and G. Friedland | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 8, pp. 2028-2037 | November 2010 | Speech | [PDF]
|
| An Adaptive Initialization Method for Speaker Diarization Based on Prosodic Features | D. Imseng and G. Friedland | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4946-4949 | March 2010 | Speech | [PDF]
|
| Reduction of English Function Words in Switchboard | D. Jurafsky, A. Bell, E. Fosler-Lussier, C. Girand, and W. Raymond | Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP 98), Sydney, Australia, Vol. 7, p. 3111 | December 1998 | Speech | [PDF]
|
| The Berkeley Restaurant Project | D. Jurafsky, C. Wooters, G. Tajchman, J. Segal, A. Stolcke, E. Fosler, and N. Morgan | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 2139-2142 | September 1994 | Speech | [PDF]
|
| Using A Stochastic Context-Free Grammar as a Language Model for Speech Recognition | D. Jurafsky, C. Wooters, J. Segal, A. Stolcke, E. Fosler, G. Tajchman, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 95), Detroit, Michigan | May 1995 | Speech | [PDF]
|
| Integrating Experimental Models of Syntax, Phonology, and Accent/Dialect in a Speech Recognizer | D. Jurafsky, C.Wooters, G. Tajchman, J. Segal, A. Stolcke, and N. Morgan | Proceedings of the 12th National Conference on Artificial Intelligence (AAAI-94), Seattle, Washington | 1994 | Speech | [PDF]
|
| Switchboard-DAMSL Labeling Project Coder's Manual | D. Jurafsky, E. Shriberg, and D. Biasca | Technical Report 97-02, University of Colorado, Institute of Cognitive Science, Boulder, Colorado | 1997 | Speech | [PDF]
|
| The SuperSID Project: Exploiting High-Level Information for High-Accuracy Speaker Recognition | D. Reynolds, W. Andrews, J. Campbell, J. Navratil, B. Peskin, A. Adami, Q. Jin, D. Klusacek, J. Abramson, R. Mihaescu, J. Godfrey, D. Jones, and B. Xiang | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Development of the SRI/Nightingale Arabic ASR system | D. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlater, K. Kirchoff, A. Faria, and N. Morgan | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 1437-1440 | September 2008 | Speech | |
| Exploiting User Feedback for Language Model Adaptation in Meeting Recognition | D. Vergyri, A. Stolcke, and G. Tur | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4737-4740 | April 2009 | Speech | [PDF]
|
| Introduction to the Special Section on Deep Learning for Speech and Language Processing | D. Yu, G. Hinton, N. Morgan, J.-T. Chien, and S. Sagayama | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 1, pp. 4-6 | January 2012 | Speech | [PDF]
|
| Improved Recognition by Combining Different Features and Different Systems | D.P.W. Ellis | Proceedings of the Applied Voice Input/Output Society (AVIOS-2000), San Jose, California | May 2000 | Speech | [PDF]
|
| Investigations Into Tandem Acoustic Modeling for the Aurora Taks | D.P.W. Ellis and M. Reyes | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | |
| The challenges of IT research in developing regions | E. Brewer, M. Demmer, M. Ho, R.J. Honicky, J. Pal, M. Plauché, and S. Surana | IEEE Pervasive Computing, Vol. 5, No. 2, pp. 15-23 | April 2006 | Speech | |
| On Reversing the Generation Process in Optimality Theory | E. Fosler | Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL-96), Santa Cruz, California | 1996 | Speech | [PDF]
|