| RASTA Processing of Speech | H. Hermansky and N. Morgan | IEEE Transactions on Speech and Audio Processing, special issue on Robust Speech Recognition, Vol. 2, No. 4, pp. 578-589 | October 1994 | Speech | |
| Connectionist Probability Estimators in HMM Speech Recognition | S. Renals, N. Morgan, H. Bourlard, M. Cohen, and H. Franco | IEEE Transactions on Speech and Audio Processing, pp. II-161-174, | January 1993 | Speech | |
| Continuous Speech Recognition by Connectionist Statistical Methods | H. Bourlard and N. Morgan | IEEE Transactions on Neural Networks, Vol. 4, No. 6, pp. 893-909 | November 1993 | Speech | |
| Speaker Diarization For Multiple-distant-microphone Meetings Using Several Sources of Information | J. M. Pardo, X. Anguera, and C. Wooters | IEEE Transactions on Computers, Vol. 56, Issue 9, IEEE Computer Society, California, pp. 1212-1224 | September 2007 | Speech | [PDF]
|
| Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms | A. Stolcke, S. Kajarekar, L. Ferrer, and E. Shriberg | IEEE Transactions on Audio, Speech, and Language Processing. Special issue on speaker and language recognition, Vol. 15, Issue 7, IEEE Computer Society, California, pp. 1987-1998 | September 2007 | Speech | [PDF]
|
| The ICSI RT-09 Speaker Diarization System | G. Friedland, A. Janin, D. Imseng, X. Anguera, L. Gottlieb, M. Huijbregts, M. Knox, and O. Vinyals | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 2, pp. 371-381 | February 2012 | Speech | [PDF]
|
| Speaker Diarization: A Review of Recent Research | X. Anguera, S. Bozonnet, N. Evans, C. Fredouille, G. Friedland, and O. Vinyals | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 2, pp. 356-370 | February 2012 | Speech | [PDF]
|
| Deep and Wide: Multiple Layers in Automatic Speech Recognition | N. Morgan | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 1, pp. 7-13 | January 2012 | Speech | [PDF]
|
| Introduction to the Special Section on Deep Learning for Speech and Language Processing | D. Yu, G. Hinton, N. Morgan, J.-T. Chien, and S. Sagayama | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 1, pp. 4-6 | January 2012 | Speech | [PDF]
|
| Tuning-Robust Initialization Methods for Speaker Diarization | D. Imseng and G. Friedland | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 8, pp. 2028-2037 | November 2010 | Speech | [PDF]
|
| The CALO Meeting Assistant System | G. Tur, A. Stolcke, L. Voss, S. Peters, D. Hakkani-Tür, J. Dowding, B. Favre, R. Fernandez, M. Frampton, M. Frandsen, C. Frederickson, M. Graciarena, D. Kintzing, K. Leveque, S. Mason, J. Niekrasz, M. Purver, K. Riedhammer, E. Shriberg, J. Tien, D. Vergyri, and F. Yang | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 6, pp. 1601-1611 | August 2010 | Speech | [PDF]
|
| Audio-Based Semantic Concept Classification for Consumer Video | K. Lee and D. Ellis | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 6, pp. 1406-1416 | August 2010 | Speech | [PDF]
|
| Prosodic and Other Long-Term Features for Speaker Diarization | G. Friedland, O. Vinyals, Y. Huang, and C. Müller | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, No. 5, pp. 985-993 | July 2009 | Speech | [PDF]
|
| Deep and Wide: Multiple Layers in Automatic Speech Recognition | N. Morgan | IEEE Transactions on Audio, Speech, and Language Processing, Special Issue on Deep Learning | 2011 | Speech | [PDF]
|
| Estimating Dominance in Multi-Party Meetings Using Speaker Diarization from a Single Microphone | H. Hung, Y. Huang, G. Friedland, and D. Gatica-Perez | IEEE Transactions on Audio, Speech and Language Processing, Vol. 19, No. 4, pp. 847–860 | May 2011 | Speech | |
| Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech | U. Guz, S. Cuendet, G. Tur, and D. Hakkani-Tür | IEEE Transactions on Audio, Speech and Language Processing, Vol. 18, Issue 2, pp. 320-329 | February 2010 | Speech | [PDF]
|
| Acoustic Beamforming for Speaker Diarization of Meetings | X. Anguera, C. Wooters, and J. Hernando | IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 7, IEEE Computer Society, California, pp. 2011-2022 | September 2007 | Speech | |
| Recent Innovations in Speech-to-Text Transcription at SRI-ICSI-UW | A. Stolcke, B. Chen, H. Franco, V.R.R. Gadde, M. Graciarena, M.-Y. Hwang, K. Kirchhoff, N. Morgan, X. Lin, T. Ng, M. Ostendorf, K. Sönmez, A. Venkataraman, D. Vergyri, W. Wang, J. Zheng, and Q. Zhu | IEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1729-1744 | September 2006 | Speech | [PDF]
|
| Enriching Speech Recognition with Automatic Detection of Sentence Boundaries and Disfluencies | Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, and M. Harper | IEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1526-1540 | September 2006 | Speech | [PDF]
|
| Introduction to the Special Issue on Processing Morphologically Rich Languages | R. Sarikaya, K. Kirchhoff, T. Schultz, and D. Hakkani-Tür | IEEE Transactions on Audio, Speech and Language Processing, Special Issue on Processing Morphologically Rich Languages, Vol. 17, No. 5, pp. 861-862 | July 2009 | Speech | [PDF]
|
| Special Section on New Frontiers in Rich Transcription | G. Friedland, J. Fiscus, T. Hain, and S. Furui (eds) | IEEE Transactions in Audio, Speech, and Language Processing, Vol. 20, No. 2 | February 2012 | Speech | |
| Why Is ASR Harder For Fast Speech And What Can We Do About It? | N. Mirghafori, E. Fosler, and N. Morgan | IEEE Snowbird Workshop '95 | 1995 | Speech | [PDF]
|
| Transition-Based Statistical Training for ASR | N. Morgan, Y. Konig, S.L. Wu, and H. Bourlard | IEEE Snowbird Workshop '95 | 1995 | Speech | [PDF]
|
| Updated MINDS Report on Speech Recognition and Understanding, Part 2 | J. Baker, L. Deng, S. Khudanpur, C.-H. Lee, J. Glass, N. Morgan, and D. O'Shgughnessy | IEEE Signal Processing Magazine, Vol. 26, No. 4, pp. 78-85 | July 2009 | Speech | [PDF]
|
| Research Developments and Directions in Speech Recognition and Understanding, Part 1 | J. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O'Shaughnessy | IEEE Signal Processing Magazine, Vol. 26, No. 3, pp. 75-80 | May 2009 | Speech | |
| Speech Segmentation and Spoken Document Processing | M. Ostendorf, B. Favre, R. Grishman, D. Hakkani-Tur, M. Harper, D. Hillard, J. Hirschberg, J. Heng, J. G. Kahn, Y. Liu, S. Maskey, E. Matusov, H. Ney, A. Rosenberg, E. Shriberg, W. Wang, and C. Wooters | IEEE Signal Processing Magazine, Vol. 25, Issue 3, pp. 59-69 | May 2008 | Speech | [PDF]
|
| Pushing the Envelope - Aside | N. Morgan, Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cetin, H. Bourlard, and M. Athineos | IEEE Signal Processing Magazine, Vol. 22, No. 5, pp. 81-88 | September 2005 | Speech | |
| An Introduction to Hybrid HMM/Connectionist Continuous Speech Recognition | N. Morgan and H. Bourlard | IEEE Signal Processing Magazine, pp. 25-42 | May 1995 | Speech | [PDF]
|
| A Training Algorithm for Statistical Sequence Recognition with Applications to Transition-Based Speech Recognition | H. Bourlard, Y. Konig, and N. Morgan | IEEE Signal Processing Letters, pp. 203-205 | July 1996 | Speech | |
| The challenges of IT research in developing regions | E. Brewer, M. Demmer, M. Ho, R.J. Honicky, J. Pal, M. Plauché, and S. Surana | IEEE Pervasive Computing, Vol. 5, No. 2, pp. 15-23 | April 2006 | Speech | |
| Multimedia Education in Computer Science -- A Little Bit of Everything Is Not Enough | G. Friedland, L. Knipping, and W. Huerst | IEEE Multimedia Magazine, Vol. 15, Issue 2, pp. 78-82 | April 2008 | Speech | [PDF]
|
| Multimedia Data Formats and Semantic Computing: A Practical Example and its Implications for the Future | G. Friedland | IEEE International Conference on Semantic Computing, Irvine, California | September 2007 | Speech | |
| Computers and Commerce: A Study of Technology and Management at Eckert-Mauchly Computer Company, Engineering Research Associates, and Remington Rand, 1946-1957 (book review) | G. Friedland | IEEE Annals of the History of Computing, Vol. 29, No. 2, IEEE Computer Society, California, pp. 74-77 | June 2007 | Speech | |
| The Digital Hand, Vol 2 - How Computers Changed the Work of the American Financial, Telecommunications, Media, and Entertainment Industries (book review) | G. Friedland | IEEE Annals of the History of Computing, Vol. 29, Issue 3, IEEE Computer Society, California, pp. 72-75 | July 2007 | Speech | [PDF]
|
| Probability Estimation by Feed-forward Networks in Continuous Speech Recognition | S. Renals, N. Morgan, and H. Bourlard | ICSI Technical Report TR-91-030. Also published in Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, pp. 309-318 | 1991 | Speech | |
| Hill-Climbing Ensemble Feature Selection with a Larger Ensemble | D. Gelbart | ICSI Technical Report TR-09-001 | February 2009 | Speech | [PDF]
|
| Merging Multilayer Perceptrons & Hidden Markov Models: Some Experiments in Continuous Speech Recognition | H. Bourlard and N. Morgan | ICSI Technical Report TR-089-033 | 1989 | Speech | |
| Generalization and Parameter Estimation in Feedforward Nets: Some Experiments | H. Bourlard and N. Morgan | ICSI Technical Report TR-089-017. Also published in Advances in Neural Information Processing Systems, Vol. II, pp. 630-637, 1990. | 1989 | Speech | |
| Multi-modal Speaker Diarization of Real-world Meetings Using Compressed-domain Video Features | G. Friedland, H. Hung, and C. Yeo | ICSI Technical Report TR-08-007, October 2008 | October 2008 | Speech | [PDF]
|
| Meeting Recorder Project: Dialog Act Labeling Guide | R. Dhillon, S. Bhagat, H. Carvey, and E. Shriberg | ICSI Technical Report TR-04-002 | February 2004 | Speech | [PDF]
|
| Scaling Up: Learning Large-Scale Recognition Methods from Small-Scale Recognition Tasks | N. Morgan, B. Chen, Q. Zhu, and A. Stolcke | ICSI Technical Report tr-03-02. Also Special Workshop in Maui(SWIM) paper 218. | 2004 | Speech | [PDF]
|
| MLP-Based Feature Extraction for Speech Transcription | N. Morgan, A. Faria, S. Ravuri, and S. Zhao | Handbook of Natural Language Processing and Machine Translation, J. Olive, ed., Springer, in press | 2010 | Speech | |
| Analytics for Experts | G. Friedland | Featured paper in ACM SIGMM Records, Vol. 1, Issue 1 | March 2009 | Speech | [PDF]
|
| Anthropocentric Video Segmentation for Lecture Webcasts | G. Friedland and R. Rojas | EURASIP Journal on Image and Video Processing, Vol. 8, Issue 2, Article 9 | January 2008 | Speech | [PDF]
|
| Data-Driven Speaker and Subword Unit Clustering in Speech Processing | M. Hersch | EPFL Diploma Thesis, ICSI | March 2003 | Speech | [PDF]
|
| Automated Lecture Recording | G. Friedland, L. Knipping, and W. Huerst | Encyclopedia of Multimedia, B. Furht, ed., Springer | October 2008 | Speech | |
| Automatic Speech Recognition | H. Hermansky, and N. Morgan | Encyclopedia of Cognitive Science, Nature Publishing Group, London | 2003 | Speech | |
| Putting Linguistics into Speech Recognition: The Regulus Grammar Compiler | M. Rayner, B.A. Hockey, and P. Bouillon | CSLI Press | May 2006 | Speech | |
| A Study in Machine Learning from Imbalanced Data for Sentence Boundary Detection in Speech | Y. Liu, N.V. Chawla, M.P. Harper, E. Shriberg, and A. Stolcke | Computer Speech and Language, Vol. 20, Issue 4, pp. 468-494 | October 2006 | Speech | [PDF]
|
| Midlevel Representations for Computational Auditory Scene Analysis: The Weft Element | D. Ellis and D. Rosenthal | Computational Auditory Scene Analysis, D.F. Rosenthal & H.G. Okuno, eds., Lawrence Erlbaum, pp. 257-272 | 1998 | Speech | |