| GDNN: A Gender-Dependent Neural Network for Continuous Speech Recognition | Y. Konig and N. Morgan | Proceedings of the International Joint Conference on Neural Networks, (IJCNN '92), Beijing, China, pp. II-332-337 | 1992 | Speech | |
| Improving Statistical Speech Recognition | S. Renals, N. Morgan, M. Cohen, H. Franco, H. Bourlard | Proceedings of the International Joint Conference on Neural Networks, (IJCNN '92), Beijing, China, pp. II-302-307 | 1992 | Speech | |
| Context-Dependent Connectionist Probability Estimation in a Hybrid HMM-Neural Net Speech Recognition System | H. Franco, M. Cohen, N. Morgan, D. Rumelhart, and V. Abrash | Proceedings of the International Joint Conference on Neural Networks, (IJCNN '92), Beijing, China | 1992 | Speech | |
| Neural Networks for Statistical Inference: Generalizations with Applications to Speech Recognition | H. Bourlard and N. Morgan | Proceedings of the International Joint Conference on Neural Networks (IJCNN '91), Singapore | 1991 | Speech | |
| Automatic Transcription of Prosodic Stress for Spontaneous English Discourse | R. Silipo and S. Greenberg | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 3, pp. 2351-2354 | August 1999 | Speech | [PDF]
|
| Statistical Acoustic Indications of Coarticulation | K. Kirchoff and J. Bilmes | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 3, pp. 1729-1732 | August 1999 | Speech | [PDF]
|
| Syllable Detection and Segmentation Using Temporal Flow Neural Networks | L. Shastri, S. Chang, and S. Greenberg | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 3, pp. 1721-1724 | August 1999 | Speech | [PDF]
|
| Incorporating Contextual Phonetics Into Automatic Speech Recognition | E. Fosler-Lussier, S. Greenberg, and N. Morgan | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 1, pp. 611-614 | August 1999 | Speech | [PDF]
|
| Forms of English Function Words - Effects of Disfluencies, Turn Position, Age and Sex, and Predictability | A. Bell, D. Jurafsky, E. Fosler-Lussier, C. Girand, and D. Gildea | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 1, pp. 395-398 | August 1999 | Speech | [PDF]
|
| Hybrid Neural Network / Hidden Markov Model Continuous Speech Recognition | M. Cohen, H. Franco, N. Morgan, D. Rumelhart, and V. Abrash | Proceedings of the International Conference on Spoken Language Processing (ICSLP'92), pp. 915-918 | 1992 | Speech | |
| Connectionist Gender Adaptation in a Hybrid Neural Network / Hidden Markov Model Speech Recognition System | V. Abrash, M. Cohen, H. Franco, N. Morgan, and Y. Konig | Proceedings of the International Conference on Spoken Language Processing (ICSLP'92), pp. 911-914 | 1992 | Speech | |
| Towards Handling the Acoustic Environment in Spoken Language Processing | H. Hermansky and N. Morgan | Proceedings of the International Conference on Spoken Language Processing (ICSLP'92), pp. 85-88 | 1992 | Speech | |
| Acoustic Sub-word Models in the Berkeley Restaurant Project | C. Wooters and N. Morgan | Proceedings of the International Conference on Spoken Language Processing (ICSLP'92), pp. 1551-1554 | 1992 | Speech | |
| Speech Data Modeling at WS96: The Questionable Parameter Group | N. Morgan | Proceedings of the International Conference on Spoken Language Processing (ICSLP 96), Addendum, Philadelphia, Pennsylvania, pp 30-31 | 1996 | Speech | |
| Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing | E. Shriberg and A. Stolcke | Proceedings of the International Conference on Speech Prosody, Nara, Japan, March 2004. | March 2004 | Speech | [PDF]
|
| The Relation of Stress Accent to Pronunciation Variation in Spontaneous American English Discourse | S. Greenberg, H.M. Carvey, and L. Hitchcock | Proceedings of the International Conference on Speech Prosody 2002, Aix-en-Provence, France | April 2002 | Speech | |
| Big Dumb Neural Nets: A Working Brute Force Approach to Speech Recognition | N. Morgan | Proceedings of the International Conference on Neural Networks, Vol. VII, pp. 4462-4465 | 1994 | Speech | |
| Speech Intelligibility Derived From Asynchrounous Processing of Auditory-Visual Information | K.W. Grant and S. Greenberg | Proceedings of the International Conference on Auditory-Visual Speech Processing Workshop (AVSP 2001), Scheelsminde, Denmark | September 2001 | Speech | [PDF]
|
| Towards Robustness to Fast Speech in ASR | N. Mirghafori, E. Fosler, and N. Morgan | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP-96), Atlanta, Georgia | 1996 | Speech | [PDF]
|
| REMAP - Experiments with Speech Recognition | Y. Konig, H. Bourlard, and N. Morgan | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP-96), Atlanta, Georgia | May 1996 | Speech | [PDF]
|
| Audio Information Access from Meeting Rooms | S. Renals and D. Ellis | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong | April 2003 | Speech | [PDF]
|
| Multi-Channel Source Separation by Factorial HMMs | M.J. Reyes-gomez, B. Raj, and D. Ellis | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong | April 2003 | Speech | [PDF]
|
| A New Speaker Change Detection Method for Two-Speaker Segmentation | A. Adami, S. Kajarekar, and H. Hermansky | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Experiments with Temporal Resolution for Continuous Speech Recognition with Multi-Layer Perceptrons | N. Morgan, C. Wooters, H. Hermansky, H. Bourlard | Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, pp. 405-410 | 1991 | Speech | |
| Parallel Training of MLP Probability Estimators for Speech Recognition: A Gender-Based Approach | N. Mirghafori, N. Morgan, and H. Bourlard | Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, Greece, pp. 289-298 | 1994 | Speech | |
| Connectionist-Based Acoustic Word Models | C. Wooters and N. Morgan | Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, Copenhagen, Denmark, pp. 157-163 | 1992 | Speech | |
| Temporal Signal Processing for ASR | N. Morgan | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 9-16 | 1999 | Speech | |
| Joint Distributional Modeling with Cross-Correlation Based Features | J. Bilmes | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings (ASRU-97), Santa Barbara, California, pp.148-155 | 1997 | Speech | [PDF]
|
| Contextual Word and Syllable Pronunciation Models | E. Fosler-Lussier | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-99), Keystone, Colorado | December 1999 | Speech | [PDF]
|
| Combined Speech and Speaker Recognition With Speaker-adapted Connectionist Models | D. Genoud, D. Ellis, and N. Morgan | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-99), Keystone, Colorado | December 1999 | Speech | [PDF]
|
| Speaker Recognition Using Prosodic and Lexical Features | S. Kajarekar, L. Ferrer, A. Venkataraman, K. Sonmez, E. Shriberg, A. Stolcke, H. Bratt, and R. R. Gadde | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2003), St. Thomas, Virgin Islands, pp. 19-24 | November 2003 | Speech | [PDF]
|
| Pitch-Based Emphasis Detection for Characterization of Meeting Recordings | L. Kennedy and D. Ellis | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2003), St. Thomas, Virgin Islands | November 2003 | Speech | [PDF]
|
| Computational Auditory Scene Analysis Exploiting Speech-Recognition Knowledge | D. Ellis | Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, p. 4 | October 1997 | Speech | [PDF]
|
| Parallel Training of MLP Probability Estimators for Speech Recognition: A Gender-Based Approach | N. Mirghafori, N. Morgan, and H. Bourlard | Proceedings of the IEEE Neural Networks for Signal Processing Workshop (NNSP 94), Ermioni, Greece | September 1994 | Speech | [PDF]
|
| On the Applicability of Speaker Diarization to Audio Concept Detection for Multimedia Retrieval | R. Mertens, P.-S. Huang, L. Gottlieb, G. Friedland, and A. Divakaran | Proceedings of the IEEE International Symposium on Multimedia, Dana Point, California, pp. 446-451 | December 2011 | Speech | [PDF]
|
| Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location Estimation | J. Choi and G. Friedland | Proceedings of the IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246 | September 2011 | Speech | [PDF]
|
| Multimodal Location Estimation of Consumer Media – Dealing with Sparse Training Data | J. Choi, G. Friedland, V. Ekambaram, and K. Ramchandran | Proceedings of the IEEE International Conference on Multimedia and Expo, Melbourne, Australia, pp. 43-48 | July 2012 | Speech | [PDF]
|
| Speech Intelligibility in the Presence of Cross-Channel Spectral Asynchrony | T. Arai and S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-98), Seattle, Washington, pp. 933-936 | May 1998 | Speech | [PDF]
|
| Language Model Combination and Adaptation Using Weighted Finite State Transducers | X. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. Woodland | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, Texas | March 2010 | Speech | |
| Multimodal City-Verification on Flickr Videos Using Acoustic and Textual Features | H. Lei, J. Choi, and G. Friedland | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Spectro-Temporal Gabor Features for Speaker Recognition | H. Lei, B. T. Meyer, and N. Mirghafori | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Discriminative Training for Speech Recognition is Compensating for Statistical Dependence on the HMM Framework | D. Gillick and S. Wegmann, L. Gillick | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| How to Put It Into Words - Using Random Forests to Extract Symbol Level Descriptions from Audio Content for Concept Detection | P.-S. Huang, R. Mertens, A. Divakaran, G. Friedland, and M. Hasegawa-Johns | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Easy Does It: Robust Spectro-Temporal Many-Stream ASR Without Fine Tuning Streams | S. Ravuri and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | |
| Articulatory Features for Expressive Speech Synthesis | A. Black, H. T. Bunnell, Y. Dou, P. Kumar, F. Metze, D. Perry, T. Polzehl, K. Prahallad, S. Steidl, and C. Vaug | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Language-Independent Constrained Cepstral Features for Speaker Recognition | E. Shriberg and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5296-5299 | May 2011 | Speech | [PDF]
|
| The SRI NIST 2010 Speaker Recognition Evaluation System | N. Scheffer, L. Ferrer, M. Graciarena, S. Kajarekar, E. Shriberg, and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5292-5295 | May 2011 | Speech | [PDF]
|
| Making the Most from Multiple Microphones in Meeting Recognition | A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4992-4995 | May 2011 | Speech | [PDF]
|
| The IBM 2009 GALE Arabic Speech Transcription System | B. Kingsbury, H. Soltau, G. Saon, S. Chu, H.-K. Kuo, L. Mangu, S. Ravuri, A. Janin, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4672-4675 | May 2011 | Speech | [PDF]
|
| Bird Species Recognition Combining Acoustic and Sequence Modeling | M. Graciarena, M. Delplanche, E. Shriberg, and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 341-344 | May 2011 | Speech | [PDF]
|