Publication Search Results

TitleAuthorBibliographicsort ascendingDateGroupLinks
RASTA Processing of SpeechH. Hermansky and N. MorganIEEE Transactions on Speech and Audio Processing, special issue on Robust Speech Recognition, Vol. 2, No. 4, pp. 578-589October 1994Speech
Connectionist Probability Estimators in HMM Speech RecognitionS. Renals, N. Morgan, H. Bourlard, M. Cohen, and H. FrancoIEEE Transactions on Speech and Audio Processing, pp. II-161-174,January 1993Speech
Continuous Speech Recognition by Connectionist Statistical MethodsH. Bourlard and N. MorganIEEE Transactions on Neural Networks, Vol. 4, No. 6, pp. 893-909November 1993Speech
Speaker Diarization For Multiple-distant-microphone Meetings Using Several Sources of InformationJ. M. Pardo, X. Anguera, and C. WootersIEEE Transactions on Computers, Vol. 56, Issue 9, IEEE Computer Society, California, pp. 1212-1224September 2007Speech[PDF]

Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation TransformsA. Stolcke, S. Kajarekar, L. Ferrer, and E. ShribergIEEE Transactions on Audio, Speech, and Language Processing. Special issue on speaker and language recognition, Vol. 15, Issue 7, IEEE Computer Society, California, pp. 1987-1998September 2007Speech[PDF]

The ICSI RT-09 Speaker Diarization SystemG. Friedland, A. Janin, D. Imseng, X. Anguera, L. Gottlieb, M. Huijbregts, M. Knox, and O. VinyalsIEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 2, pp. 371-381February 2012Speech[PDF]

Speaker Diarization: A Review of Recent ResearchX. Anguera, S. Bozonnet, N. Evans, C. Fredouille, G. Friedland, and O. VinyalsIEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 2, pp. 356-370February 2012Speech[PDF]

Deep and Wide: Multiple Layers in Automatic Speech RecognitionN. MorganIEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 1, pp. 7-13January 2012Speech[PDF]

Introduction to the Special Section on Deep Learning for Speech and Language ProcessingD. Yu, G. Hinton, N. Morgan, J.-T. Chien, and S. SagayamaIEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 1, pp. 4-6January 2012Speech[PDF]

Tuning-Robust Initialization Methods for Speaker DiarizationD. Imseng and G. FriedlandIEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 8, pp. 2028-2037November 2010Speech[PDF]

The CALO Meeting Assistant SystemG. Tur, A. Stolcke, L. Voss, S. Peters, D. Hakkani-Tür, J. Dowding, B. Favre, R. Fernandez, M. Frampton, M. Frandsen, C. Frederickson, M. Graciarena, D. Kintzing, K. Leveque, S. Mason, J. Niekrasz, M. Purver, K. Riedhammer, E. Shriberg, J. Tien, D. Vergyri, and F. YangIEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 6, pp. 1601-1611August 2010Speech[PDF]

Audio-Based Semantic Concept Classification for Consumer VideoK. Lee and D. EllisIEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 6, pp. 1406-1416August 2010Speech[PDF]

Prosodic and Other Long-Term Features for Speaker DiarizationG. Friedland, O. Vinyals, Y. Huang, and C. MüllerIEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, No. 5, pp. 985-993July 2009Speech[PDF]

Deep and Wide: Multiple Layers in Automatic Speech RecognitionN. MorganIEEE Transactions on Audio, Speech, and Language Processing, Special Issue on Deep Learning 2011Speech[PDF]

Estimating Dominance in Multi-Party Meetings Using Speaker Diarization from a Single MicrophoneH. Hung, Y. Huang, G. Friedland, and D. Gatica-PerezIEEE Transactions on Audio, Speech and Language Processing, Vol. 19, No. 4, pp. 847–860May 2011Speech
Multi-View Semi-Supervised Learning for Dialog Act Segmentation of SpeechU. Guz, S. Cuendet, G. Tur, and D. Hakkani-TürIEEE Transactions on Audio, Speech and Language Processing, Vol. 18, Issue 2, pp. 320-329February 2010Speech[PDF]

Acoustic Beamforming for Speaker Diarization of MeetingsX. Anguera, C. Wooters, and J. HernandoIEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 7, IEEE Computer Society, California, pp. 2011-2022September 2007Speech
Recent Innovations in Speech-to-Text Transcription at SRI-ICSI-UWA. Stolcke, B. Chen, H. Franco, V.R.R. Gadde, M. Graciarena, M.-Y. Hwang, K. Kirchhoff, N. Morgan, X. Lin, T. Ng, M. Ostendorf, K. Sönmez, A. Venkataraman, D. Vergyri, W. Wang, J. Zheng, and Q. ZhuIEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1729-1744September 2006Speech[PDF]

Enriching Speech Recognition with Automatic Detection of Sentence Boundaries and DisfluenciesY. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, and M. HarperIEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1526-1540September 2006Speech[PDF]

Introduction to the Special Issue on Processing Morphologically Rich LanguagesR. Sarikaya, K. Kirchhoff, T. Schultz, and D. Hakkani-TürIEEE Transactions on Audio, Speech and Language Processing, Special Issue on Processing Morphologically Rich Languages, Vol. 17, No. 5, pp. 861-862July 2009Speech[PDF]

Special Section on New Frontiers in Rich TranscriptionG. Friedland, J. Fiscus, T. Hain, and S. Furui (eds)IEEE Transactions in Audio, Speech, and Language Processing, Vol. 20, No. 2February 2012Speech
Why Is ASR Harder For Fast Speech And What Can We Do About It?N. Mirghafori, E. Fosler, and N. MorganIEEE Snowbird Workshop '95 1995Speech[PDF]

Transition-Based Statistical Training for ASRN. Morgan, Y. Konig, S.L. Wu, and H. BourlardIEEE Snowbird Workshop '95 1995Speech[PDF]

Updated MINDS Report on Speech Recognition and Understanding, Part 2J. Baker, L. Deng, S. Khudanpur, C.-H. Lee, J. Glass, N. Morgan, and D. O'ShgughnessyIEEE Signal Processing Magazine, Vol. 26, No. 4, pp. 78-85July 2009Speech[PDF]

Research Developments and Directions in Speech Recognition and Understanding, Part 1J. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O'ShaughnessyIEEE Signal Processing Magazine, Vol. 26, No. 3, pp. 75-80May 2009Speech
Speech Segmentation and Spoken Document ProcessingM. Ostendorf, B. Favre, R. Grishman, D. Hakkani-Tur, M. Harper, D. Hillard, J. Hirschberg, J. Heng, J. G. Kahn, Y. Liu, S. Maskey, E. Matusov, H. Ney, A. Rosenberg, E. Shriberg, W. Wang, and C. WootersIEEE Signal Processing Magazine, Vol. 25, Issue 3, pp. 59-69May 2008Speech[PDF]

Pushing the Envelope - AsideN. Morgan, Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cetin, H. Bourlard, and M. AthineosIEEE Signal Processing Magazine, Vol. 22, No. 5, pp. 81-88September 2005Speech
An Introduction to Hybrid HMM/Connectionist Continuous Speech RecognitionN. Morgan and H. BourlardIEEE Signal Processing Magazine, pp. 25-42May 1995Speech[PDF]

A Training Algorithm for Statistical Sequence Recognition with Applications to Transition-Based Speech RecognitionH. Bourlard, Y. Konig, and N. MorganIEEE Signal Processing Letters, pp. 203-205July 1996Speech
The challenges of IT research in developing regionsE. Brewer, M. Demmer, M. Ho, R.J. Honicky, J. Pal, M. Plauché, and S. SuranaIEEE Pervasive Computing, Vol. 5, No. 2, pp. 15-23April 2006Speech
Multimedia Education in Computer Science -- A Little Bit of Everything Is Not EnoughG. Friedland, L. Knipping, and W. HuerstIEEE Multimedia Magazine, Vol. 15, Issue 2, pp. 78-82April 2008Speech[PDF]

Multimedia Data Formats and Semantic Computing: A Practical Example and its Implications for the FutureG. FriedlandIEEE International Conference on Semantic Computing, Irvine, CaliforniaSeptember 2007Speech
Computers and Commerce: A Study of Technology and Management at Eckert-Mauchly Computer Company, Engineering Research Associates, and Remington Rand, 1946-1957 (book review)G. FriedlandIEEE Annals of the History of Computing, Vol. 29, No. 2, IEEE Computer Society, California, pp. 74-77June 2007Speech
The Digital Hand, Vol 2 - How Computers Changed the Work of the American Financial, Telecommunications, Media, and Entertainment Industries (book review)G. FriedlandIEEE Annals of the History of Computing, Vol. 29, Issue 3, IEEE Computer Society, California, pp. 72-75July 2007Speech[PDF]

Probability Estimation by Feed-forward Networks in Continuous Speech RecognitionS. Renals, N. Morgan, and H. BourlardICSI Technical Report TR-91-030. Also published in Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, pp. 309-318 1991Speech
Hill-Climbing Ensemble Feature Selection with a Larger EnsembleD. GelbartICSI Technical Report TR-09-001February 2009Speech[PDF]

Merging Multilayer Perceptrons & Hidden Markov Models: Some Experiments in Continuous Speech RecognitionH. Bourlard and N. MorganICSI Technical Report TR-089-033 1989Speech
Generalization and Parameter Estimation in Feedforward Nets: Some ExperimentsH. Bourlard and N. MorganICSI Technical Report TR-089-017. Also published in Advances in Neural Information Processing Systems, Vol. II, pp. 630-637, 1990. 1989Speech
Multi-modal Speaker Diarization of Real-world Meetings Using Compressed-domain Video FeaturesG. Friedland, H. Hung, and C. YeoICSI Technical Report TR-08-007, October 2008October 2008Speech[PDF]

Meeting Recorder Project: Dialog Act Labeling GuideR. Dhillon, S. Bhagat, H. Carvey, and E. ShribergICSI Technical Report TR-04-002February 2004Speech[PDF]

Scaling Up: Learning Large-Scale Recognition Methods from Small-Scale Recognition TasksN. Morgan, B. Chen, Q. Zhu, and A. StolckeICSI Technical Report tr-03-02. Also Special Workshop in Maui(SWIM) paper 218. 2004Speech[PDF]

MLP-Based Feature Extraction for Speech TranscriptionN. Morgan, A. Faria, S. Ravuri, and S. ZhaoHandbook of Natural Language Processing and Machine Translation, J. Olive, ed., Springer, in press 2010Speech
Analytics for ExpertsG. FriedlandFeatured paper in ACM SIGMM Records, Vol. 1, Issue 1March 2009Speech[PDF]

Anthropocentric Video Segmentation for Lecture WebcastsG. Friedland and R. RojasEURASIP Journal on Image and Video Processing, Vol. 8, Issue 2, Article 9January 2008Speech[PDF]

Data-Driven Speaker and Subword Unit Clustering in Speech ProcessingM. HerschEPFL Diploma Thesis, ICSIMarch 2003Speech[PDF]

Automated Lecture RecordingG. Friedland, L. Knipping, and W. HuerstEncyclopedia of Multimedia, B. Furht, ed., SpringerOctober 2008Speech
Automatic Speech RecognitionH. Hermansky, and N. MorganEncyclopedia of Cognitive Science, Nature Publishing Group, London 2003Speech
Putting Linguistics into Speech Recognition: The Regulus Grammar CompilerM. Rayner, B.A. Hockey, and P. BouillonCSLI PressMay 2006Speech
A Study in Machine Learning from Imbalanced Data for Sentence Boundary Detection in SpeechY. Liu, N.V. Chawla, M.P. Harper, E. Shriberg, and A. StolckeComputer Speech and Language, Vol. 20, Issue 4, pp. 468-494October 2006Speech[PDF]

Midlevel Representations for Computational Auditory Scene Analysis: The Weft ElementD. Ellis and D. RosenthalComputational Auditory Scene Analysis, D.F. Rosenthal & H.G. Okuno, eds., Lawrence Erlbaum, pp. 257-272 1998Speech

Pages