| Neural Networks for Statistical Recognition of Continuous Speech | N. Morgan and H. Bourlard | Proceedings of IEEE, Vol. 83, Issue 5, pp. 742-770 | May 1995 | Speech | |
| RASTA Extensions: Robustness to Additive and Convolutional Noise | N. Morgan and H. Hermansky | Proceedings of the Workshop on Speech Processing in Adverse Conditions, pp. 115-118 | 1992 | Speech | |
| MLP-Based Feature Extraction for Speech Transcription | N. Morgan, A. Faria, S. Ravuri, and S. Zhao | Handbook of Natural Language Processing and Machine Translation, J. Olive, ed., Springer, in press | 2010 | Speech | |
| Scaling Up: Learning Large-Scale Recognition Methods from Small-Scale Recognition Tasks | N. Morgan, B. Chen, Q. Zhu, and A. Stolcke | ICSI Technical Report tr-03-02. Also Special Workshop in Maui(SWIM) paper 218. | 2004 | Speech | [PDF]
|
| TRAPping Conversational Speech: Extending TRAP/Tandem Approaches to Conversational Telephone Speech Recognition | N. Morgan, B. Y. Chen, Q. Zhu, and A. Stolcke | Proceedings of IEEE ICASSP, Montreal | May 2004 | Speech | [PDF]
|
| Experiments with Temporal Resolution for Continuous Speech Recognition with Multi-Layer Perceptrons | N. Morgan, C. Wooters, H. Hermansky, H. Bourlard | Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, pp. 405-410 | 1991 | Speech | |
| The Meeting Project at ICSI | N. Morgan, D. Baron, J. Edwards, D. Ellis, D. Gelbart, A. Janin, T. Pfau, E. Shriberg, and A. Stolcke | Proceedings of the Human Language Technologies Conference, San Diego, California | March 2001 | Speech | [PDF]
|
| Meetings About Meetings: Research at ICSI on Speech in Multiparty Conversations | N. Morgan, D. Baron, S. Bhagat, H. Carvey, R. Dhillon, J. Edwards, D. Gelbart, A. Janin, A. Krupski, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. Wooters | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Reducing Errors by Increasing the Error Rate: MLP Acoustic Modeling for Broadcast News Transcription | N. Morgan, D. Ellis, E. Fosler-Lussier, A. Janin, and B. Kingsbury | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| Speech Recognition Using On-line Estimation of Speaking Rate | N. Morgan, E. Fosler, and N. Mirghafori | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece, Vol. 4, pp. 2079-2082 | September 1997 | Speech | [PDF]
|
| Scaling a Hybrid HMM/MLP System for Large Vocabulary CSR | N. Morgan, G. Tajchman, N. Mirghafori, Y. Konig, and C. Wooters | ARPA Spoken Language Technology Workshop, Morgan Kaufmann, pp. 123-124 | 1994 | Speech | |
| Speech Recognition and the Auditory Perspective | N. Morgan, H. Bourlard, and H. Hermansky | Chapter in Speech Processing in the Auditory System, S. Greenberg and W. Ainsworth, eds, Springer | 2004 | Speech | |
| Stochastic Perceptual Auditory-Event-Based Models for Speech Recognition | N. Morgan, H. Bourlard, S. Greenberg, and H. Hermansky | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 1943-1946 | 1994 | Speech | [PDF]
|
| Stochastic Perceptual Models of Speech | N. Morgan, H. Bourlard, S. Greenberg, H. Hermansky, and S.L. Wu. | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 95), Detroit, Michigan | May 1995 | Speech | [PDF]
|
| Continuous Speech Recognition Using PLP Analysis with Multilayer Perceptrons | N. Morgan, H. Hermansky, H. Bourlard, P. Kohn, and C. Wooters | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, Toronto, Canada, pp. 49-52 | 1991 | Speech | |
| Pushing the Envelope - Aside | N. Morgan, Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cetin, H. Bourlard, and M. Athineos | IEEE Signal Processing Magazine, Vol. 22, No. 5, pp. 81-88 | September 2005 | Speech | |
| Digit Recognition with Stochastic Perceptual Models | N. Morgan, S.L. Wu, and H. Bourlard | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| SPAM: Experiments with Digit Recognition | N. Morgan, S.L. Wu, and H. Bourlard | Proceedings of the 15th Annual Speech Research Symposium, Baltimore, Maryland | June 1995 | Speech | [PDF]
|
| Transition-Based Statistical Training for ASR | N. Morgan, Y. Konig, S.L. Wu, and H. Bourlard | IEEE Snowbird Workshop '95 | 1995 | Speech | [PDF]
|
| The SRI NIST 2010 Speaker Recognition Evaluation System | N. Scheffer, L. Ferrer, M. Graciarena, S. Kajarekar, E. Shriberg, and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5292-5295 | May 2011 | Speech | [PDF]
|
| Overlap in Meetings: ASR Effects and Analysis by Dialog Factors, Speakers, and Collection Site | O. Cetin and E. Shriberg | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 212-224 | May 2006 | Speech | [PDF]
|
| Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap | O. Cetin and E.E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 357-360 | May 2006 | Speech | [PDF]
|
| Multi-Rate and Variable-Rate Modeling of Speech at Phone and Syllable Time Scales | O. Cetin and M. Ostendorf | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 665-668 | March 2005 | Speech | |
| An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling | O. Cetin, A. Kantor, S. King, C. Bartels, M. Magimai-Doss, J. Frankel, and K. Livescu | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 645-648 | April 2007 | Speech | |
| A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition | O. Cheng, J. Dines, and M. Magimai Doss | Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 345-348 | April 2007 | Speech | [PDF]
|
| Towards Semantic Analysis of Conversations: A System for the Live Identification of Speakers in Meetings | O. Vinyals and G. Friedland | Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, pp. 426-431 | August 2008 | Speech | [PDF]
|
| A Hardware-Independent Fast Logarithm Approximation with Adjustable Accuracy | O. Vinyals and G. Friedland | Proceedings of the 10th IEEE International Symposium on Multimedia, Berkeley, California, pp. 61-65 | December 2008 | Speech | [PDF]
|
| Modulation Spectrogram Features for Speaker Diarization | O. Vinyals and G. Friedland | Proceedings of the 9th Annual Conference of the International Speech Communication
Association (Interspeech 2008), Brisbane, Australia, pp. 630-633 | September 2008 | Speech | |
| Comparing Multilayer Perceptron to Deep Belief Network Tandem Features for Robust ASR | O. Vinyals and S. Ravuri | Proceedings of the 36th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '11), Prague, Czech Republic | May 2011 | Speech | [PDF]
|
| Multimodal Indoor Localization: An Audio-Wireless-Based Approach | O. Vinyals, E. Martin, and G. Friedland | Proceedings of the Fourth IEEE International Conference on Semantic Computing (ICSC-2010), Pittsburgh, Pennsylvania, pp. 120-125 | September 2010 | Speech | [PDF]
|
| Discriminative Training for Hierarchical Clustering in Speaker Diarization | O. Vinyals, G. Friedland, and N. Morgan | Proceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2326-2329 | September 2010 | Speech | [PDF]
|
| Discriminative Pronunciation Learning Using Phonetic Decoder and Minimum-Classification-Error Criterion | O. Vinyals, L. Deng, D. Yu, and A. Acero | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4445-4448 | April 2009 | Speech | [PDF]
|
| A Multilingual Shared Grammar for Recognition and Generation (in French) | P. Bouillon, M. Rayner, B. Novellas, Y. Nakao, M. Santaholma, M. Starlander, and N. Chatzichrisafis | Proceedings of the 13th Conference on Natural Language Processing (TALN 2006), Leuwen, Belgium, pp. 93-102 | April 2006 | Speech | |
| A Generic Multi-Lingual Open Source Platform for Limited-Domain Medical Speech Translation | P. Bouillon, M. Rayner, N. Chatzichrisafis, B.A. Hockey, M. Santaholma, M. Starlander, H. Isahara, K. Kanzaki, and Y. Nakao | Proceedings of the 10th Annual Conference of the European Association of Machine Translation (EAMT 2005), Budapest, Hungary, pp. 5-58 | May 2005 | Speech | |
| A Methodology for Comparing Grammar-Based and Robust Approaches to Speech Understanding | P. Bouillon, N. Chatzichrisafis, B.A. Hockey, M. Rayner, M. Santaholma, M. Starlander, H. Isahara, K. Kanzaki, and Y. Nakao | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 1877-1880 | September 2005 | Speech | |
| Perceptually Motivated Sub-Band Decomposition for FDLP Audio Coding | P. Motlicek, S. Ganapathy, H. Hermansky, H. Garudadri, and M. Athineos | Proceedings of 11th International Conference on Text, Speech, and Dialogue (TSD 2008), Brno, Czech Republic, pp. 435-442 | September 2008 | Speech | [PDF]
|
| Wide-Band Perceptual Audio Coding Based on Frequency-Domain Linear Prediction | P. Motlicek, V. Ullal, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 1, pp. 265-268 | April 2007 | Speech | |
| Experiments with Linear and Nonlinear Feature Transformations in HMM Based Phone Recognition | P. Somervuo | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Speech Modeling Using Variational Bayesian Mixture of Gaussians | P. Somervuo | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| Feature Transformations and Combinations for Improving ASR Performance | P. Somervuo, B. Chen, and Q. Zhu | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| How to Put It Into Words - Using Random Forests to Extract Symbol Level Descriptions from Audio Content for Concept Detection | P.-S. Huang, R. Mertens, A. Divakaran, G. Friedland, and M. Hasegawa-Johns | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Incorporating Tandem/HATs MLP Features into SRI's Conversational Speech Recognition System | Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan | Proceedings of the EARS RT-04F Workshop, Palisades, New York, November 2004. | November 2004 | Speech | [PDF]
|
| Using MLP Features in SRI's Conversational Speech Recognition System | Q. Zhu, A. Stolcke, B.Y. Chen, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2141-2144 | September 2005 | Speech | [PDF]
|
| Improved MLP Structures for Data-Driven Feature Extraction for ASR | Q. Zhu, B. Chen, F. Grezl, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132 | September 2005 | Speech | [PDF]
|
| Improved MLP Structures for Data-Driven Feature Extraction for ASR | Q. Zhu, B. Chen, F. Grezl, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132 | September 2005 | Speech | |
| Tandem Connectionist Feature Extraction for Conversational Speech Recognition | Q. Zhu, B. Chen, N. Morgan, and A.Stolcke | Proceedings of the First International Workshop on Machine Learning for Multimodal Interaction (MLMI 2004), Martigny, Switzerland | June 2004 | Speech | |
| On Using MLP Features in LVCSR | Q. Zhu, B. Chen, N. Morgan. and A. Stolcke | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Meeting Acts: A Labeling System for Group Interaction in Meetings | R. Bates, P. Menning, E. Willingham, and C. Kuyper | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisbon, Portugal | September 2005 | Speech | [PDF]
|
| The Challenge of Spoken Language Systems: Research Directions for the Nineties | R. Cole, L. Hirschman, L. Atlas, M. Beckman, A. Biermann, M. Bush, M. Clements, J. Cohen, O. Garcia, B. Hanson, H. Hermansky, S. Levinson, K. McKeown, N. Morgan, D. Novick, M. Ostendorf, S. Oviatt, P. Price, H. Silverman, J. Spitz, A. Waibel, C. Weinstein, S. Zahorian, and V. Zue | IEEE Transactions on Speech and Audio Processing, Vol. 3, No. 1, pp. 1-21 | January 1995 | Speech | |
| Automated Information Extraction in Production | R. Desutter, J.P. Evain, G. Friedland, A. Messina, and M. Sano | Special issue in Multimedia Tools and Applications, Springer | 2011 | Speech | |