| Relevancy of Time Frequency Features for Phonetic Classification Measured by Mutual Information | H.H. Yang, S. van Vuuren, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona | March 1999 | Speech | |
| Using Boosting to Improve a Hybrid HMM/Neural Network Speech Recognizer | H. Schwenk | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1009-1012 | March 1999 | Speech | [PDF]
|
| Size Matters: An Empirical Study of Neural Network Training for Large Vocabulary Continuous Speech Recognition | D. Ellis and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1013-1016 | March 1999 | Speech | [PDF]
|
| Dynamic Classifier Combinations in Hybrid Speech Recognition Systems Using Utterance-Level Confidence Values | K. Kirchhoff and J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-693-696 | March 1999 | Speech | [PDF]
|
| Buried Markov Models for Speech Recognition | J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-713-716 | March 1999 | Speech | [PDF]
|
| Feature Extraction Using Non-Linear Transformation for Robust Speech Recognition on the Aurora Database | S. Sharma, D. Ellis, S. Kajarekar, P. Jain, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. II-1117-1120 | June 2000 | Speech | [PDF]
|
| Data-driven RASTA Filters in Reverberation | M. Shire and B. Chen | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1627-1630 | June 2000 | Speech | [PDF]
|
| Tandem Connectionist Feature Stream Extraction for Conventional HMM Systems | H. Hermansky, D. Ellis, and S. Sharma | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1635-1638 | June 2000 | Speech | [PDF]
|
| Tandem Acoustic Modeling in Large-Vocabulary Recognition | D. Ellis, R. Singh, and S. Sivadas | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| A Study of Two Dimensional Linear Descriminants For ASR | S. Kajarekar, B. Yegnanarayana, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| Multi-Stream ASR trained with Heterogeneous Reverberant Environments | M.L. Shire | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | [PDF]
|
| Global Posterior Probability Estimates as Confidence Measures in an Automatic Speech Recognition System | W. Warren | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| Hierarchical Tandem Feature Extraction | S. Sivadas and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Using Prosodic and Lexical Information for Speaker Identification | F. Weber, L. Manganaro, B. Peskin, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 Jhu Summer Workshop | K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. Saenko | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii | April 2007 | Speech | |
| Word-Conditioned Phone N-Grams for Speaker Recognition | H. Lei and N. Mirghafori | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, pp. 253-256 | April 2007 | Speech | [PDF]
|
| Wide-Band Perceptual Audio Coding Based on Frequency-Domain Linear Prediction | P. Motlicek, V. Ullal, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 1, pp. 265-268 | April 2007 | Speech | |
| Statistical Sentence Extraction for Information Distillation | D. Hakkani-Tur and G. Tur | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 1-4 | April 2007 | Speech | [PDF]
|
| Entropy Based Classifier Combination for Sentence Segmentation | M. Magimai Doss, D. Hakkani-Tur, O. Cetin, E. Shriberg, J. Fung, and N. Mirghafori | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 189-192 | April 2007 | Speech | [PDF]
|
| Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition | J. Zheng, O. Cetin, M.-Y. Huang, X. Lei, A. Stolcke, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 633-636 | April 2007 | Speech | |
| An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling | O. Cetin, A. Kantor, S. King, C. Bartels, M. Magimai-Doss, J. Frankel, and K. Livescu | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 645-648 | April 2007 | Speech | |
| Manual Transcription of Conversational Speech at the Articulatory Feature Level | K. Livescu, A. Bezman, N. Borges, L. Yung, O. Cetin, J. Frankel, S. King, M. Magimai-Doss, X. Chi, and L. Lavoie | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 953-956 | April 2007 | Speech | |
| Comparing Evaluation Metrics for Sentence Boundary Detection | Y. Liu and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Vol. 4, pp. 185-188, Honolulu, Hawaii | April 2007 | Speech | [PDF]
|
| Multi-Modal Speaker Diarization of Real-World Meeting Using Compressed-Domain Video Features | G. Friedland, H. Hung, and C. Yeo | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4069-4072 | April 2009 | Speech | [PDF]
|
| Fusing Short Term and Long Term Features for Improved Speaker Diarization | G. Friedland, O. Vinyals, Y. Huang, and C. Müller | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4077-4080 | April 2009 | Speech | [PDF]
|
| The SRI NIST 2008 Speaker Recognition Evaluation System | S. S. Kajarekar, N. Scheffer, M. Graciarena, E. Shriberg, A. Stolcke, L. Ferrer, and T. Bocklet | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4205-4208 | April 2009 | Speech | [PDF]
|
| Discriminative Pronunciation Learning Using Phonetic Decoder and Minimum-Classification-Error Criterion | O. Vinyals, L. Deng, D. Yu, and A. Acero | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4445-4448 | April 2009 | Speech | [PDF]
|
| Speaker Recognition Using Syllable-Based Constraints for Cepstral Frame Selection | T. Bocklet and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4525-4528 | April 2009 | Speech | [PDF]
|
| Syntactically Informed Models for Comma Prediction | B. Favre, D. Hakkani-Tür, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4697-4700 | April 2009 | Speech | [PDF]
|
| Genre Effects on Automatic Sentee Segmentation of Speech: A Comparison of Broadcast News and Broadcast Conversationsnc | J. Kolar, Y. Liu, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4701-4704 | April 2009 | Speech | [PDF]
|
| Exploiting User Feedback for Language Model Adaptation in Meeting Recognition | D. Vergyri, A. Stolcke, and G. Tur | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4737-4740 | April 2009 | Speech | [PDF]
|
| Associating Children’s Non-Verbal and Verbal Behaviour: Body Movements, Emotions, and Laughter in a Human-Robot Interaction | A. Batliner, S. Steidl, and E. Nöth | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 22-27 | May 2011 | Speech | [PDF]
|
| Bird Species Recognition Combining Acoustic and Sequence Modeling | M. Graciarena, M. Delplanche, E. Shriberg, and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 341-344 | May 2011 | Speech | [PDF]
|
| The IBM 2009 GALE Arabic Speech Transcription System | B. Kingsbury, H. Soltau, G. Saon, S. Chu, H.-K. Kuo, L. Mangu, S. Ravuri, A. Janin, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4672-4675 | May 2011 | Speech | [PDF]
|
| Making the Most from Multiple Microphones in Meeting Recognition | A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4992-4995 | May 2011 | Speech | [PDF]
|
| The SRI NIST 2010 Speaker Recognition Evaluation System | N. Scheffer, L. Ferrer, M. Graciarena, S. Kajarekar, E. Shriberg, and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5292-5295 | May 2011 | Speech | [PDF]
|
| Language-Independent Constrained Cepstral Features for Speaker Recognition | E. Shriberg and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5296-5299 | May 2011 | Speech | [PDF]
|
| Multimodal City-Verification on Flickr Videos Using Acoustic and Textual Features | H. Lei, J. Choi, and G. Friedland | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Spectro-Temporal Gabor Features for Speaker Recognition | H. Lei, B. T. Meyer, and N. Mirghafori | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Discriminative Training for Speech Recognition is Compensating for Statistical Dependence on the HMM Framework | D. Gillick and S. Wegmann, L. Gillick | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| How to Put It Into Words - Using Random Forests to Extract Symbol Level Descriptions from Audio Content for Concept Detection | P.-S. Huang, R. Mertens, A. Divakaran, G. Friedland, and M. Hasegawa-Johns | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Easy Does It: Robust Spectro-Temporal Many-Stream ASR Without Fine Tuning Streams | S. Ravuri and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | |
| Articulatory Features for Expressive Speech Synthesis | A. Black, H. T. Bunnell, Y. Dou, P. Kumar, F. Metze, D. Perry, T. Polzehl, K. Prahallad, S. Steidl, and C. Vaug | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Language Model Combination and Adaptation Using Weighted Finite State Transducers | X. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. Woodland | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, Texas | March 2010 | Speech | |
| Speech Intelligibility in the Presence of Cross-Channel Spectral Asynchrony | T. Arai and S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-98), Seattle, Washington, pp. 933-936 | May 1998 | Speech | [PDF]
|
| Multimodal Location Estimation of Consumer Media – Dealing with Sparse Training Data | J. Choi, G. Friedland, V. Ekambaram, and K. Ramchandran | Proceedings of the IEEE International Conference on Multimedia and Expo, Melbourne, Australia, pp. 43-48 | July 2012 | Speech | [PDF]
|
| Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location Estimation | J. Choi and G. Friedland | Proceedings of the IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246 | September 2011 | Speech | [PDF]
|
| On the Applicability of Speaker Diarization to Audio Concept Detection for Multimedia Retrieval | R. Mertens, P.-S. Huang, L. Gottlieb, G. Friedland, and A. Divakaran | Proceedings of the IEEE International Symposium on Multimedia, Dana Point, California, pp. 446-451 | December 2011 | Speech | [PDF]
|
| Parallel Training of MLP Probability Estimators for Speech Recognition: A Gender-Based Approach | N. Mirghafori, N. Morgan, and H. Bourlard | Proceedings of the IEEE Neural Networks for Signal Processing Workshop (NNSP 94), Ermioni, Greece | September 1994 | Speech | [PDF]
|
| Computational Auditory Scene Analysis Exploiting Speech-Recognition Knowledge | D. Ellis | Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, p. 4 | October 1997 | Speech | [PDF]
|