| Exploiting Dialog Act Tagging and Prosodic Information for Action Item Identification | F. Yang, G. Tur, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4941-4944 | April 2008 | Speech | [PDF]
|
| Using Prosodic and Lexical Information for Speaker Identification | F. Weber, L. Manganaro, B. Peskin, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR System | F. Valente, M. Magimai-Doss, C. Plahl, and S. Ravuri | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2963-2966 | September 2009 | Speech | [PDF]
|
| A Comparative Large Scale Study of MLP Features for Mandarin ASR | F. Valente, M. Magimai Doss, C. Plahl, S. Ravuri, and W. Wang | Proceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2630-2363 | September 2010 | Speech | [PDF]
|
| Detecting Deception Using Critical Segments | F. Enos, E. Shriberg, M. Graciarena, J. Hirschberg, and A. Stolcke | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2281-2284 | August 2007 | Speech | [PDF]
|
| Does Session Variability Compensation in Speaker Recognition Model Intrinsic Variation Under Mismatched Conditions? | E. Shriberg, S. Kajarekar, and N. Scheffer | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1551-1554 | September 2009 | Speech | [PDF]
|
| The ICSI Meeting Recorder Dialog Act (MRDA) Corpus | E. Shriberg, R. Dhillon, S. Bhagat, J. Ang, and H. Carvey | Proceedings of the Human Language Technology Conference at the North American Chapter of the Association for Computational Linguistics, Boston, Massachusetts | April 2004 | Speech | [PDF]
|
| Effects of Vocal Effort and Speaking Style on Text-Independent Speaker Verification | E. Shriberg, M. Graciarena, H. Bratt, A. Kathol, S. Kajarekar, H. Jameel, C. Richey, and F. Goodman | Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 609-612 | September 2008 | Speech | [PDF]
|
| Modeling Prosodic Feature Sequences for Speaker Recognition | E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke | Speech Communication, Vol. 46, Issues 3-4, pp. 455-472 | July 2005 | Speech | |
| Prosodic Similarities of Dialog Act Boundaries Across Speaking Styles | E. Shriberg, B. Favre, J. Fung, D. Hakkani-Tur, and S. Cuendet | Linguistic Patterns in Spontaneous Speech, S.-C. Tseng, ed., pp. 213-239, Institute of Linguistics | 2009 | Speech | [PDF]
|
| Prosody-Based Automatic Segmentation of Speech into Sentences and Topics | E. Shriberg, A. Stolcke, D. Hakkani-Tür, and G. Tür | Speech Communications, T. Robinson and S. Rendals, eds., Vol. 32, Issue 1-2, pp. 127-154 | September 2000 | Speech | |
| Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech | E. Shriberg, A. Stolcke, and D. Baron | Proceedings of the ISCA Tutorial and Research Workshop on Prosody in Speech Recognition and Understanding, Red Bank, New Jersey | October 2001 | Speech | [PDF]
|
| Observations on Overlap: Findings and Implications for Automatic Processing of Multi-Party Conversation | E. Shriberg, A. Stolcke, and D. Baron | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| A Text-constrained Prosodic System for Speaker Verification | E. Shriberg and L. Ferrer | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1226-1229 | August 2007 | Speech | [PDF]
|
| Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing | E. Shriberg and A. Stolcke | Proceedings of the International Conference on Speech Prosody, Nara, Japan, March 2004. | March 2004 | Speech | [PDF]
|
| Prosody Modeling for Automatic Speech Recognition and Understanding | E. Shriberg and A. Stolcke | Mathematical Foundations of Speech and Language Modeling, M. Johnson, M. Ostendorf, S. Khudanpur, R. Rosenfeld (eds.), Volume 138 in IMA Volumes in Mathematics and its Applications, pp. 105-114, Springer-Verlag. | 2004 | Speech | [PDF]
|
| The Case for Automatic Higher-Level Features in Forensic Speaker Recognition | E. Shriberg and A. Stolcke | Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 1509-1512 | September 2008 | Speech | [PDF]
|
| Language-Independent Constrained Cepstral Features for Speaker Recognition | E. Shriberg and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5296-5299 | May 2011 | Speech | [PDF]
|
| Spontaneous Speech: How People Really Talk, and Why Engineers Should Care | E. Shriberg | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 1781-1784 | September 2005 | Speech | [PDF]
|
| Higher Level Features in Speaker Recognition | E. Shriberg | Speaker Classification I (Lecture Notes in Computer Science, Vol. 4343), pp. 241-259, Springer: Heidelberg / Berlin | 2007 | Speech | |
| Improving Speech Translation with Automatic Boundary Prediction | E. Matusov, D. Hillard, M. Magimai-Doss, D. Hakkani-Tur, M. Ostendorf, and H. Ney | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2449-2452 | August 2007 | Speech | [PDF]
|
| Precise Indoor Localization Using Smart Phones | E. Martin, O. Vinyals, G. Friedland, and R. Bajcsy | Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 787-790 | October 2010 | Speech | [PDF]
|
| Fast Speaker Diarization Using a High-Level Scripting Language | E. Gonina, G. Friedland, H. Cook, and K. Keutzer | Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2011), Big Island, Hawaii | December 2011 | Speech | [PDF]
|
| Incorporating Contextual Phonetics Into Automatic Speech Recognition | E. Fosler-Lussier, S. Greenberg, and N. Morgan | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 1, pp. 611-614 | August 1999 | Speech | [PDF]
|
| Effects of Speaking Rate and Word Frequency on Conversational Pronunciations | E. Fosler-Lussier and N. Morgan | Speech Communication Vol. 29, No. 2-4, pp. 137-158 | November 1999 | Speech | [PDF]
|
| Effects of Speaking Rate and Word Predictability on Conversational Pronunciations | E. Fosler-Lussier and N. Morgan | Proceedings of the ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, Netherlands | May 1998 | Speech | [PDF]
|
| Not Just What, But Also When: Guided Automatic Pronunciation Modeling for Broadcast News | E. Fosler-Lussier and G. Williams | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| Contextual Word and Syllable Pronunciation Models | E. Fosler-Lussier | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-99), Keystone, Colorado | December 1999 | Speech | [PDF]
|
| Multi-Level Decision Trees for Static and Dynamic Pronunciation Models | E. Fosler-Lussier | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. I-463-466 | September 1999 | Speech | [PDF]
|
| Dynamic Pronunciation Models for Automatic Speech Recognition | E. Fosler-Lussier | Ph.D Dissertation, University of California at Berkeley | August 1999 | Speech | [PDF]
|
| Dynamic Pronunciation Models for Autmoatic Speech Recognition | E. Fosler-Lussier | Ph.D. Thesis, UC Berkeley, Fall 1999, ICSI Technical Report TR-99-015 | September 1999 | Speech | [PDF]
|
| Automatic Learning of Word Pronunciation from Data | E. Fosler, M. Weintraub, S. Wegmann, Y. H. Kao, S. Khudanpur, C. Galles, and M. Saraclar | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| On Reversing the Generation Process in Optimality Theory | E. Fosler | Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL-96), Santa Cruz, California | 1996 | Speech | [PDF]
|
| The challenges of IT research in developing regions | E. Brewer, M. Demmer, M. Ho, R.J. Honicky, J. Pal, M. Plauché, and S. Surana | IEEE Pervasive Computing, Vol. 5, No. 2, pp. 15-23 | April 2006 | Speech | |
| Investigations Into Tandem Acoustic Modeling for the Aurora Taks | D.P.W. Ellis and M. Reyes | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | |
| Improved Recognition by Combining Different Features and Different Systems | D.P.W. Ellis | Proceedings of the Applied Voice Input/Output Society (AVIOS-2000), San Jose, California | May 2000 | Speech | [PDF]
|
| Introduction to the Special Section on Deep Learning for Speech and Language Processing | D. Yu, G. Hinton, N. Morgan, J.-T. Chien, and S. Sagayama | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 1, pp. 4-6 | January 2012 | Speech | [PDF]
|
| Exploiting User Feedback for Language Model Adaptation in Meeting Recognition | D. Vergyri, A. Stolcke, and G. Tur | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4737-4740 | April 2009 | Speech | [PDF]
|
| Development of the SRI/Nightingale Arabic ASR system | D. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlater, K. Kirchoff, A. Faria, and N. Morgan | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 1437-1440 | September 2008 | Speech | |
| The SuperSID Project: Exploiting High-Level Information for High-Accuracy Speaker Recognition | D. Reynolds, W. Andrews, J. Campbell, J. Navratil, B. Peskin, A. Adami, Q. Jin, D. Klusacek, J. Abramson, R. Mihaescu, J. Godfrey, D. Jones, and B. Xiang | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Switchboard-DAMSL Labeling Project Coder's Manual | D. Jurafsky, E. Shriberg, and D. Biasca | Technical Report 97-02, University of Colorado, Institute of Cognitive Science, Boulder, Colorado | 1997 | Speech | [PDF]
|
| Integrating Experimental Models of Syntax, Phonology, and Accent/Dialect in a Speech Recognizer | D. Jurafsky, C.Wooters, G. Tajchman, J. Segal, A. Stolcke, and N. Morgan | Proceedings of the 12th National Conference on Artificial Intelligence (AAAI-94), Seattle, Washington | 1994 | Speech | [PDF]
|
| Using A Stochastic Context-Free Grammar as a Language Model for Speech Recognition | D. Jurafsky, C. Wooters, J. Segal, A. Stolcke, E. Fosler, G. Tajchman, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 95), Detroit, Michigan | May 1995 | Speech | [PDF]
|
| The Berkeley Restaurant Project | D. Jurafsky, C. Wooters, G. Tajchman, J. Segal, A. Stolcke, E. Fosler, and N. Morgan | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 2139-2142 | September 1994 | Speech | [PDF]
|
| Reduction of English Function Words in Switchboard | D. Jurafsky, A. Bell, E. Fosler-Lussier, C. Girand, and W. Raymond | Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP 98), Sydney, Australia, Vol. 7, p. 3111 | December 1998 | Speech | [PDF]
|
| Robust Speaker Diarization for Short Speech Recordings | D. Imseng and G. Friedland | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 432-437 | December 2009 | Speech | [PDF]
|
| Tuning-Robust Initialization Methods for Speaker Diarization | D. Imseng and G. Friedland | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 8, pp. 2028-2037 | November 2010 | Speech | [PDF]
|
| An Adaptive Initialization Method for Speaker Diarization Based on Prosodic Features | D. Imseng and G. Friedland | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4946-4949 | March 2010 | Speech | [PDF]
|
| Impact of Automatic Comma Prediction on POS/Name Tagging of Speech | D. Hillard, Z. Huang, H. Ji, R. Grishman, D. Hakkani-Tur, M. Harper, M. Ostendorf, and W. Wang | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 58-61 | December 2006 | Speech | [PDF]
|
| Detection of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data | D. Hillard, M. Ostendorf, and E. Shriberg | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada | May 2003 | Speech | [PDF]
|