| Language-Independent Constrained Cepstral Features for Speaker Recognition | E. Shriberg and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5296-5299 | May 2011 | Speech | [PDF]
|
| The SRI NIST 2010 Speaker Recognition Evaluation System | N. Scheffer, L. Ferrer, M. Graciarena, S. Kajarekar, E. Shriberg, and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5292-5295 | May 2011 | Speech | [PDF]
|
| Making the Most from Multiple Microphones in Meeting Recognition | A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4992-4995 | May 2011 | Speech | [PDF]
|
| The IBM 2009 GALE Arabic Speech Transcription System | B. Kingsbury, H. Soltau, G. Saon, S. Chu, H.-K. Kuo, L. Mangu, S. Ravuri, A. Janin, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4672-4675 | May 2011 | Speech | [PDF]
|
| Bird Species Recognition Combining Acoustic and Sequence Modeling | M. Graciarena, M. Delplanche, E. Shriberg, and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 341-344 | May 2011 | Speech | [PDF]
|
| Associating Children’s Non-Verbal and Verbal Behaviour: Body Movements, Emotions, and Laughter in a Human-Robot Interaction | A. Batliner, S. Steidl, and E. Nöth | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 22-27 | May 2011 | Speech | [PDF]
|
| Exploiting User Feedback for Language Model Adaptation in Meeting Recognition | D. Vergyri, A. Stolcke, and G. Tur | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4737-4740 | April 2009 | Speech | [PDF]
|
| Genre Effects on Automatic Sentee Segmentation of Speech: A Comparison of Broadcast News and Broadcast Conversationsnc | J. Kolar, Y. Liu, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4701-4704 | April 2009 | Speech | [PDF]
|
| Syntactically Informed Models for Comma Prediction | B. Favre, D. Hakkani-Tür, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4697-4700 | April 2009 | Speech | [PDF]
|
| Speaker Recognition Using Syllable-Based Constraints for Cepstral Frame Selection | T. Bocklet and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4525-4528 | April 2009 | Speech | [PDF]
|
| Discriminative Pronunciation Learning Using Phonetic Decoder and Minimum-Classification-Error Criterion | O. Vinyals, L. Deng, D. Yu, and A. Acero | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4445-4448 | April 2009 | Speech | [PDF]
|
| The SRI NIST 2008 Speaker Recognition Evaluation System | S. S. Kajarekar, N. Scheffer, M. Graciarena, E. Shriberg, A. Stolcke, L. Ferrer, and T. Bocklet | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4205-4208 | April 2009 | Speech | [PDF]
|
| Fusing Short Term and Long Term Features for Improved Speaker Diarization | G. Friedland, O. Vinyals, Y. Huang, and C. Müller | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4077-4080 | April 2009 | Speech | [PDF]
|
| Multi-Modal Speaker Diarization of Real-World Meeting Using Compressed-Domain Video Features | G. Friedland, H. Hung, and C. Yeo | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4069-4072 | April 2009 | Speech | [PDF]
|
| Comparing Evaluation Metrics for Sentence Boundary Detection | Y. Liu and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Vol. 4, pp. 185-188, Honolulu, Hawaii | April 2007 | Speech | [PDF]
|
| Manual Transcription of Conversational Speech at the Articulatory Feature Level | K. Livescu, A. Bezman, N. Borges, L. Yung, O. Cetin, J. Frankel, S. King, M. Magimai-Doss, X. Chi, and L. Lavoie | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 953-956 | April 2007 | Speech | |
| An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling | O. Cetin, A. Kantor, S. King, C. Bartels, M. Magimai-Doss, J. Frankel, and K. Livescu | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 645-648 | April 2007 | Speech | |
| Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition | J. Zheng, O. Cetin, M.-Y. Huang, X. Lei, A. Stolcke, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 633-636 | April 2007 | Speech | |
| Entropy Based Classifier Combination for Sentence Segmentation | M. Magimai Doss, D. Hakkani-Tur, O. Cetin, E. Shriberg, J. Fung, and N. Mirghafori | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 189-192 | April 2007 | Speech | [PDF]
|
| Statistical Sentence Extraction for Information Distillation | D. Hakkani-Tur and G. Tur | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 1-4 | April 2007 | Speech | [PDF]
|
| Wide-Band Perceptual Audio Coding Based on Frequency-Domain Linear Prediction | P. Motlicek, V. Ullal, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 1, pp. 265-268 | April 2007 | Speech | |
| Word-Conditioned Phone N-Grams for Speaker Recognition | H. Lei and N. Mirghafori | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, pp. 253-256 | April 2007 | Speech | [PDF]
|
| Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 Jhu Summer Workshop | K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. Saenko | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii | April 2007 | Speech | |
| Hierarchical Tandem Feature Extraction | S. Sivadas and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Using Prosodic and Lexical Information for Speaker Identification | F. Weber, L. Manganaro, B. Peskin, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Tandem Acoustic Modeling in Large-Vocabulary Recognition | D. Ellis, R. Singh, and S. Sivadas | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| A Study of Two Dimensional Linear Descriminants For ASR | S. Kajarekar, B. Yegnanarayana, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| Multi-Stream ASR trained with Heterogeneous Reverberant Environments | M.L. Shire | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | [PDF]
|
| Global Posterior Probability Estimates as Confidence Measures in an Automatic Speech Recognition System | W. Warren | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| Tandem Connectionist Feature Stream Extraction for Conventional HMM Systems | H. Hermansky, D. Ellis, and S. Sharma | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1635-1638 | June 2000 | Speech | [PDF]
|
| Data-driven RASTA Filters in Reverberation | M. Shire and B. Chen | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1627-1630 | June 2000 | Speech | [PDF]
|
| Feature Extraction Using Non-Linear Transformation for Robust Speech Recognition on the Aurora Database | S. Sharma, D. Ellis, S. Kajarekar, P. Jain, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. II-1117-1120 | June 2000 | Speech | [PDF]
|
| Buried Markov Models for Speech Recognition | J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-713-716 | March 1999 | Speech | [PDF]
|
| Dynamic Classifier Combinations in Hybrid Speech Recognition Systems Using Utterance-Level Confidence Values | K. Kirchhoff and J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-693-696 | March 1999 | Speech | [PDF]
|
| Size Matters: An Empirical Study of Neural Network Training for Large Vocabulary Continuous Speech Recognition | D. Ellis and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1013-1016 | March 1999 | Speech | [PDF]
|
| Using Boosting to Improve a Hybrid HMM/Neural Network Speech Recognizer | H. Schwenk | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1009-1012 | March 1999 | Speech | [PDF]
|
| Temporal Patterns (TRAPS) in ASR of Noisy Speech | H. Hermansky and S. Sharma | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona | March 1999 | Speech | |
| Relevancy of Time Frequency Features for Phonetic Classification Measured by Mutual Information | H.H. Yang, S. van Vuuren, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona | March 1999 | Speech | |
| Combining Multiple Estimators of Speaking Rate | N. Morgan and E. Fosler-Lussier | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 729-732 | May 1998 | Speech | [PDF]
|
| Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition | S.L. Wu, B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 721-724 | May 1998 | Speech | [PDF]
|
| Transmissions and Transitions: A Study of Two Common Assumptions in Multi-Band ASR | N. Mirghafori and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 713-716 | 1998 | Speech | [PDF]
|
| Maximum Mutual Information Based Reduction Strategies for Cross-Correlation Based Joint Distributional Modeling | J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 469-472 | May 1998 | Speech | [PDF]
|
| Recognition in a New Key - Towards a Science of Spoken Language | S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 1041-1045 | May 1998 | Speech | [PDF]
|
| Multiband Audio Modeling for Single-Channel Acoustic Source Separation | M.J. Reyes-Gomez, D. Ellis, and N. Jojic | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), Montreal, Canada, Vol.5, pp. 641-644 | May 2004 | Speech | [PDF]
|
| Stochastic Perceptual Models of Speech | N. Morgan, H. Bourlard, S. Greenberg, H. Hermansky, and S.L. Wu. | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 95), Detroit, Michigan | May 1995 | Speech | [PDF]
|
| Using A Stochastic Context-Free Grammar as a Language Model for Speech Recognition | D. Jurafsky, C. Wooters, J. Segal, A. Stolcke, E. Fosler, G. Tajchman, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 95), Detroit, Michigan | May 1995 | Speech | [PDF]
|
| Supervised and Unsupervised Clustering of the Speaker Space for Connectionist Speech Recognition | Y. Konig and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech & Signal Processing, Minneapolis, Minnesota, pp. I-545-548 | 1993 | Speech | |
| Connectionist Probability Estimation in the Decipher Speech Recognition System | S. Renals, N. Morgan, M. Cohen H. Bourlard, and H. Franco | Proceedings of the IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP 1992), pp. I-601-604 | 1992 | Speech | [PDF]
|
| User Verification: Matching the Uploaders of Videos Across Accounts | H. Lei, J. Choi, A. Janin, and G. Friedland | Proceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 2404-2407 | May 2011 | Speech | [PDF]
|
| Continuous Speech Recognition Using Multilayer Perceptrons with Hidden Markov Models | H. Bourlard and N. Morgan | Proceedings of the IEEE International Conference of Acoustics, Speech & Signal Processing (ICASSP 1990), Albuquerque, New Mexico | 1990 | Speech | |