| Syllable Intelligibility for Temporally-Filtered LPC Cepstral Trajectories | T. Arai, M. Pavel, H. Hermansky, and C. Avendano | Journal of the Acoustical Society of America, Vol. 105, No. 5, pp. 2783-2791 | May 1999 | Speech | [PDF]
|
| Speech Intelligibility in the Presence of Cross-Channel Spectral Asynchrony | T. Arai and S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-98), Seattle, Washington, pp. 933-936 | May 1998 | Speech | [PDF]
|
| The Temporal Properties of Spoken Japanese Are Similar to Those of English | T. Arai and S. Greenberg | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece, Vol. 2, pp. 1011-1014 | September 1997 | Speech | [PDF]
|
| Integrating Syllable Boundary Information Into Speech Recognition | S.L. Wu, M. Shire, S. Greenberg, and N. Morgan | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 987-990 | April 1997 | Speech | [PDF]
|
| Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition | S.L. Wu, B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 721-724 | May 1998 | Speech | [PDF]
|
| Performance Improvements Through Combining Phone- and Syllable-Length Information in Automatic Speech Recognition | S.L. Wu, B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia, pp. 854-857 | November 1998 | Speech | [PDF]
|
| Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition | S.L. Wu | Ph.D. Thesis, University of California at Berkeley, Spring 1998. Also ICSI Technical Report TR-98-014 | 1998 | Speech | [PDF]
|
| Anchored Speech Recognition for Question Answering | S. Yaman, G. Tür, D. Vergyri, D. Hakkani-Tür, M. Harper, and W. Wang | Proceedings of North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009): Short Papers, Boulder, Colorado, pp. 265-268 | June 2009 | Speech | [PDF]
|
| Classification-Based Strategies for Combining Multiple 5-W Question Answering Systems | S. Yaman, D. Hakkani-Tür, G. Tur, R. Grishman, M. Harper, K. R. McKeown, A. Meyers, and K. Sharma | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2703-2706 | September 2009 | Speech | [PDF]
|
| Combining Semantic and Syntactic Information Sources for 5-W Question Answering | S. Yaman, D. Hakkani-Tür, and G. Tur | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2707-2710 | September 2009 | Speech | [PDF]
|
| A Comparison of Single- and Multi-Objective Programming Approaches to Problems with Multiple Design Objectives | S. Yaman and C.-H. Lee | Journal of Signal Processing Systems, MLSP special issue | November 2008 | Speech | [PDF]
|
| Multi-Stream to Many-Stream: Using Spectro-Temporal Features for ASR | S. Y. Zhao, S. Ravuri, and N. Morgan | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2951-2954 | September 2009 | Speech | [PDF]
|
| Multi-Stream Spectro-Temporal Features for Robust Speech Recognition | S. Y. Zhao and N. Morgan | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 898-901 | September 2008 | Speech | [PDF]
|
| Integrating Prosodic Features in Extractive Meeting Summarization | S. Xie, D. Hakkani-Tür, B. Favre, and Y. Liu | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 387-391 | December 2009 | Speech | [PDF]
|
| Leveraging Sentence Weights in a Concept-Based Optimization Framework for Extractive Meeting Summarization | S. Xie, B. Favre, D. Hakkani-Tür, and Y. Liu | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1503-1506 | September 2009 | Speech | [PDF]
|
| Using Corpus and Knowledge-Based Similarity Measure in Maximum Marginal Relevance for Meeting Summarization | S. Xie and Y. Liu | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4985-4988 | March 2008 | Speech | [PDF]
|
| Why Has (Reasonably Accurate) Automatic Speech Recognition Been So Hard to Achieve? | S. Wegmann and L. Gillick | ArXiv.org under CoRR abs/1003.0206 | February 2010 | Speech | [PDF]
|
| Data-Driven Design of RASTA-like Filters | S. van Vuuren and H. Hermansky | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece | September 1997 | Speech | |
| Name-Aware Speech Recognition for Interactive Question Answering | S. Stoyanchev, G. Tur, and D. Hakkani-Tür | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5113-5116 | April 2008 | Speech | [PDF]
|
| QASR: Question Answering Using Semantic Roles for Speech Interface | S. Stenchikova, D. Hakkani-Tur, and G. Tur | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1185-1188 | September 2006 | Speech | |
| Java Visual Speech Components for Rapid Application Development of GUI based Speech Processing Applications | S. Steidl, K. Riedhammer, T. Bocklet, F. Hoenig, and E. Noeth | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 3257-3260 | August 2011 | Speech | |
| The Sequential GMM: A Gaussian Mixture Model Based Speaker Verification System that Captures Sequential Information | S. Stafford | M.S. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| Hierarchical Tandem Feature Extraction | S. Sivadas and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Feature Extraction Using Non-Linear Transformation for Robust Speech Recognition on the Aurora Database | S. Sharma, D. Ellis, S. Kajarekar, P. Jain, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. II-1117-1120 | June 2000 | Speech | [PDF]
|
| Adaptive Language Modeling with Varied Sources to Cover New Vocabulary Items | S. Schwarm, I. Bulyko, and M. Ostendorf | IEEE Transactions on Speech and Audio Processing, Vol. 12, No. 3, pp. 334-342 | May 2004 | Speech | [PDF]
|
| The SRI NIST 2008 Speaker Recognition Evaluation System | S. S. Kajarekar, N. Scheffer, M. Graciarena, E. Shriberg, A. Stolcke, L. Ferrer, and T. Bocklet | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4205-4208 | April 2009 | Speech | [PDF]
|
| Improving Statistical Speech Recognition | S. Renals, N. Morgan, M. Cohen, H. Franco, H. Bourlard | Proceedings of the International Joint Conference on Neural Networks, (IJCNN '92), Beijing, China, pp. II-302-307 | 1992 | Speech | |
| Connectionist Probability Estimation in the Decipher Speech Recognition System | S. Renals, N. Morgan, M. Cohen H. Bourlard, and H. Franco | Proceedings of the IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP 1992), pp. I-601-604 | 1992 | Speech | [PDF]
|
| Connectionist Optimisation of Tied Mixture Hidden Markov Models | S. Renals, N. Morgan, H. Bourlard, M. Cohen, and H. Franco | Advances in Neural Information Processing Systems, Vol. IV, pp. 167-174 | 1991 | Speech | |
| Connectionist Probability Estimators in HMM Speech Recognition | S. Renals, N. Morgan, H. Bourlard, M. Cohen, and H. Franco | IEEE Transactions on Speech and Audio Processing, pp. II-161-174, | January 1993 | Speech | |
| Probability Estimation by Feed-forward Networks in Continuous Speech Recognition | S. Renals, N. Morgan, and H. Bourlard | ICSI Technical Report TR-91-030. Also published in Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, pp. 309-318 | 1991 | Speech | |
| Audio Information Access from Meeting Rooms | S. Renals and D. Ellis | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong | April 2003 | Speech | [PDF]
|
| Using Spectro-Temporal Features to Improve AFE Feature Extraction for ASR | S. Ravuri and N. Morgan | Proceedings of the 11th Internationational Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 1181-1184 | September 2010 | Speech | |
| Easy Does It: Robust Spectro-Temporal Many-Stream ASR Without Fine Tuning Streams | S. Ravuri and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | |
| Cover Song Detection: From High Scores to General Classification | S. Ravuri and D. Ellis | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 65-68 | March 2010 | Speech | [PDF]
|
| On the Use of Spectro-Temporal Features in Noise-Additive Speech | S. Ravuri | UC Berkeley Master's thesis, Spring 2011 | 2011 | Speech | [PDF]
|
| Detecting Categories in News Video Using Acoustic, Speech, and Image Features | S. Petrov, A. Faria, P. Michaillat, A. Berg, A. Stolcke, D. Klein, and J. Malik | Presented at the NIST TREC Video Retrieval Workshop, Gaithersburg, Maryland | November 2006 | Speech | [PDF]
|
| Modeling NERFs for Speaker Recognition | S. Kajarekar, L. Ferrer, K. Sonmez, J. Zheng, E. Shriberg, and A. Stolcke | Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2004), Toledo, Spain, pp. 51-56 | May 2004 | Speech | [PDF]
|
| Speaker Recognition Using Prosodic and Lexical Features | S. Kajarekar, L. Ferrer, A. Venkataraman, K. Sonmez, E. Shriberg, A. Stolcke, H. Bratt, and R. R. Gadde | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2003), St. Thomas, Virgin Islands, pp. 19-24 | November 2003 | Speech | [PDF]
|
| A Study of Two Dimensional Linear Descriminants For ASR | S. Kajarekar, B. Yegnanarayana, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| Speech Intelligibility Derived From Exceedingly Sparse Spectral Information | S. Greenberg, T. Arai, and R. Silipo | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 74-77 | November 1998 | Speech | [PDF]
|
| The Relation Between Stress Accent and Vocalic Identity in Spontaneous American English Discourse | S. Greenberg, S. Chang, and L. Hitchcock | Proceedings of ISCA Workshop on Prosody in Speech Recognition and Understanding, Red Bank, New Jersey | October 2001 | Speech | |
| An Introduction to the Diagnostic Evaluation of the Switchboard-Corpus Automatic Speech Recognition Systems | S. Greenberg, S. Chang, and J. Hollenback | Proceedings of the National Institute of Standards and Technology Speech Transcription Workshop, College Park, Maryland | May 2000 | Speech | [PDF]
|
| Insights Into Spoken Language Gleaned from Phonetic Transcriptions of the Switchboard Corpus | S. Greenberg, J. Hollenback, and D. Ellis | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| The Relation of Stress Accent to Pronunciation Variation in Spontaneous American English Discourse | S. Greenberg, H.M. Carvey, and L. Hitchcock | Proceedings of the International Conference on Speech Prosody 2002, Aix-en-Provence, France | April 2002 | Speech | |
| A Space-Time Theory of Pitch and Timbre Based on Cortical Expansion of the Cochlea Traveling Wave Delay | S. Greenberg, D. Poeppel, and T. Roberts | Proceedings of the 11th International Symposium on Hearing, Grantham, United Kingdom | August 1997 | Speech | [PDF]
|
| The Relation Between Speech Intelligibility and the Complex Modulation Spectrum | S. Greenberg and T. Arai | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| Speech Intelligibility is Highly Tolerant of Cross-Channel Spectral Asynchrony | S. Greenberg and T. Arai | Proceedings of the Joint Meeting of the 137th Acoustical Society of America and the 16th International Congress on Acoustics (ICA/ASA), Seattle, Washington, pp. 2677-2678 | June 1998 | Speech | [PDF]
|
| Linguistic Dissection of Switchboard-Corpus Automatic Speech Recognition Systems | S. Greenberg and S. Chang | Proceedings of the ISCA Workshop on Automatic Speech Recognition: Challenges for the New Millennium, Paris, France | 2000 | Speech | [PDF]
|
| The Uninvited Guest: Information's Role in Guiding the Production of Spontaneous Speech | S. Greenberg and E. Fosler-Lussier | Proceedings of the Crest Workshop on Models of Speech Production: Motor Planning and Articulatory Modelling, Kloster Seeon, Germany | May 2000 | Speech | [PDF]
|