| Robust Speech Recognition Based on Spectro-Temporal Processing | M. Kleinschmidt | Ph.D Dissertation, University of Oldenberg, Germany | 2002 | Speech | |
| Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System | X. Anguera, C. Wooters, B. Peskin, and M. Aguilo | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 402-414 | July 2005 | Speech | [PDF]
|
| Robust Speaker Diarization for Short Speech Recordings | D. Imseng and G. Friedland | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 432-437 | December 2009 | Speech | [PDF]
|
| Robust Speaker Diarization for Meetings: ICSI TR06 Meetings Evaluation System | X. Anguera, C. Wooters, and J. Pardo | Lecture Notes in Computer Science, Volume 4299, 2006, pp. 346-358, ISSN 0302-9743 | 2006 | Speech | [PDF]
|
| Robust Speaker Diarization for Meetings: ICSI RT06s evaluation system | X. Anguera, C. Wooters, and J. Pardo | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1674-1677 | September 2006 | Speech | [PDF]
|
| Robust Features and Environmental Compensation: A Few Comments | N. Morgan | Proceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 43-44 | 1997 | Speech | [PDF]
|
| Robust ASR Front-End Using Spectral-Based and Discriminant Features: Experiments on the Aurora Tasks | C. Benitez, L. Burget, B. Chen, S. Dupont, H. Garudadri, H. Hermansky, P. Jain, S. Kajarekar, and S. Sivadas | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark, pp. 429-432 | September 2001 | Speech | [PDF]
|
| Review of P. Dev and W. Heinrichs, "Learning Medicine Through Collaboration and Action: Collaborative, Experimental, Networked Learning Environments" | G. Friedland | ACM Computing Reviews, CR136993 | June 2009 | Speech | |
| Review of L. Cairco, et al., "AVARI: Animated Virtual Agent Retrieving Information" | G. Friedland | ACM Computing Reviews, CR137225 | August 2009 | Speech | |
| Review of J. Nichols and B. Myers, "Creating a Lightweight User Interface Description Language: An Overview and Analysis of the Personal Universal Controller Project" | G. Friedland | ACM Computing Reviews, CR137773 | March 2010 | Speech | |
| Review of J. Ajmera, et al., "Two-Stream Indexing for Spoken Web Search" | G. Friedland | ACM Computing Reviews, CR139192 | June 2011 | Speech | |
| Review of G. Welch, "History: The Use of the Kalman Filter for Human Motion Tracking in Virtual Reality" | G. Friedland | ACM Computing Reviews, CR137162 | August 2009 | Speech | |
| Review of E. Villalon, “High-Dimensionality Data Reduction in Java” | G. Friedland | ACM Computing Reviews | March 2009 | Speech | |
| Review of E. Aguilar, "Animation and Performance Capture Using Digitized Models" | G. Friedland | ACM Computing Reviews, CR138181 | July 2010 | Speech | |
| Review of Cattelan, et al, "Watch-and-Comment as a Paradigm Toward Ubiquitous Interactive Video Editing" | G. Friedland | ACM Computer Reviews, CR136487 | October 2009 | Speech | |
| Review of C. Simon, et al., "Visual Event Recognition Using Decision Trees" | G. Friedland | ACM Computing Reviews, CR138638 | January 2011 | Speech | |
| Review of C. Mueller-Tomfelder, "Tabletops - Horizontal Interactive Displays" | G. Friedland | ACM Computing Reviews, CR138453 | October 2010 | Speech | [PDF]
|
| Review of A. Rahman, et al., "Spatial-Geometric Approach to Physical Mobile Interaction Based on Accelerometer and IR Sensory Data Fusion" | G. Friedland | ACM Computing Reviews, CR139264 | July 2011 | Speech | |
| Research Developments and Directions in Speech Recognition and Understanding, Part 1 | J. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O'Shaughnessy | IEEE Signal Processing Magazine, Vol. 26, No. 3, pp. 75-80 | May 2009 | Speech | |
| Reranking for Sentence Boundary Detection in Conversational Speech | B. Roark, Y. Liu, M. Harper, R. Stewart, M. Lease, M. Snover, Z. Shafran, B. Dorr, J. Hale, A. Krasnyanskaya, and L. Young | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Vol. 1, Toulouse, France, pp. 545-548 | May 2006 | Speech | |
| REMAP: Recursive Estimation and Maximization of A Posteriori Probabilities in Connectionist Speech Recognition | H. Bourlard, Y. Konig, and N. Morgan | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| REMAP: Recursive Estimation and Maximization of A Posteriori Probabilities - Application to Transition-Based Connectionist Speech Recognition | Y. Konig, H. Bourlard, and N. Morgan | Proceedings of the Advances in Neural Information Processing Systems 8 Conference (NIPS 8), Denver, Colorado, pp. 388-394 | November 1995 | Speech | |
| REMAP: Recursive Estimation and Maximization of A Posteriori Probabilities - Application to Transition-Based Connectionist Speech Recognition | Y. Konig, H. Bourlard, and N. Morgan | Proceedings of the 9th Annual Conference on Neural Information Processing Systems (NIPS 1995), Denver, Colorado | November 1995 | Speech | [PDF]
|
| Remap Modeling for Connectionist Speech Recognition | Y. Konig, H. Bourlard, and N. Morgan | Proceedings of the 15th Annual Speech Research Symposium, Baltimore, Maryland | June 1995 | Speech | [PDF]
|
| REMAP - Experiments with Speech Recognition | Y. Konig, H. Bourlard, and N. Morgan | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP-96), Atlanta, Georgia | May 1996 | Speech | [PDF]
|
| Relevancy of Time Frequency Features for Phonetic Classification Measured by Mutual Information | H.H. Yang, S. van Vuuren, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona | March 1999 | Speech | |
| Relevance of Time-Frequency Features for Phonetic and SpeakerChannel Classification | H.H. Yan, S. Sharma, S. van Vuuren, and H. Hermansky | Speech Communication,Vol. 1, No. 31, pp. 35-50 | May 2000 | Speech | [PDF]
|
| RelAtive SpecTrAl (RASTA) Processing in Speech Analysis | H. Hermansky and N. Morgan | Proceedings of the Speech Research Symposium XII, Rutgers University, Camden, New Jersey | 1992 | Speech | |
| Relating Frame Accuracy with Word Error in Hybrid ANN-HMM ASR | M. Shire | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications | M. Rayner, P. Bouillon, B.A. Hockey, and N. Chatzichrisafis | Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006), Genoa, Italy, pp. 783-788 | May 2006 | Speech | [PDF]
|
| Reduction of English Function Words in Switchboard | D. Jurafsky, A. Bell, E. Fosler-Lussier, C. Girand, and W. Raymond | Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP 98), Sydney, Australia, Vol. 7, p. 3111 | December 1998 | Speech | [PDF]
|
| Reducing the Effect of Room Acoustics on Human-Computer Interaction | D. Gelbart | Proceedings of the Applied Voice Input/Output Society (AVIOS 2002), San Jose, California | May 2002 | Speech | [PDF]
|
| Reducing Errors by Increasing the Error Rate: MLP Acoustic Modeling for Broadcast News Transcription | N. Morgan, D. Ellis, E. Fosler-Lussier, A. Janin, and B. Kingsbury | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| Recognizing Reverberant Speech With RASTA-PLP | B. Kingsbury and N. Morgan | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 1259-1262 | April 1997 | Speech | [PDF]
|
| Recognition of Speech in Additive and Convolutional Noise Based on RASTA Spectral Processing | H. Hermansky, N. Morgan, and H.G. Hirsch | Proceedings of the IEEE Conference on Acoustics, Speech & Signal Processing, Minneapolis, Minnesota, pp. II-83-86 | 1993 | Speech | |
| Recognition in a New Key - Towards a Science of Spoken Language | S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 1041-1045 | May 1998 | Speech | [PDF]
|
| Recent Innovations in Speech-to-Text Transcription at SRI-ICSI-UW | A. Stolcke, B. Chen, H. Franco, V.R.R. Gadde, M. Graciarena, M.-Y. Hwang, K. Kirchhoff, N. Morgan, X. Lin, T. Ng, M. Ostendorf, K. Sönmez, A. Venkataraman, D. Vergyri, W. Wang, J. Zheng, and Q. Zhu | IEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1729-1744 | September 2006 | Speech | [PDF]
|
| RASTA-PLP Speech Analysis Technique | H. Hermansky, N. Morgan, A. Bayya, and P. Kohn | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, San Francisco, California, pp. I-121-124 | 1992 | Speech | |
| RASTA Processing of Speech | H. Hermansky and N. Morgan | IEEE Transactions on Speech and Audio Processing, special issue on Robust Speech Recognition, Vol. 2, No. 4, pp. 578-589 | October 1994 | Speech | |
| RASTA Extensions: Robustness to Additive and Convolutional Noise | N. Morgan and H. Hermansky | Proceedings of the Workshop on Speech Processing in Adverse Conditions, pp. 115-118 | 1992 | Speech | |
| Qualcomm-ICSI-OGI Features for ASR | A. Adami, L. Burget, S. Dupont, H. Garudadri, F. Grezl, H. Hermansky, P. Jain, S. Kajarekar, N. Morgan, and S. Sivadas | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| QASR: Question Answering Using Semantic Roles for Speech Interface | S. Stenchikova, D. Hakkani-Tur, and G. Tur | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1185-1188 | September 2006 | Speech | |
| Putting Linguistics into Speech Recognition: The Regulus Grammar Compiler | M. Rayner, B.A. Hockey, and P. Bouillon | CSLI Press | May 2006 | Speech | |
| Pushing the Limits of Mechanical Turk: Qualifying the Crowd for Video Geo-Location | L. Gottlieb, J. Choi, P. Kelm, T. Sikora, and G. Friedland | Proceedings of the ACM Workshop on Crowdsourcing for Multimedia (CrowdMM 2012), held in conjunction with ACM Multimedia 2012, pp. 23-28, Nara, Japan | October 2012 | Speech | [PDF]
|
| Pushing the Envelope - Aside | N. Morgan, Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cetin, H. Bourlard, and M. Athineos | IEEE Signal Processing Magazine, Vol. 22, No. 5, pp. 81-88 | September 2005 | Speech | |
| Purity Algorithms for Speaker Diarization of Meetings Data | X. Anguera, C. Wooters and J. Hernando | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France | May 2006 | Speech | [PDF]
|
| Punctuating Speech For Information Extraction | B. Favre, R. Grishman, D. Hillard, H. Ji, D. Hakkani-Tur, and M.Ostendorf | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5013-5016 | April 2008 | Speech | [PDF]
|
| Prosody-Based Automatic Segmentation of Speech into Sentences and Topics | E. Shriberg, A. Stolcke, D. Hakkani-Tür, and G. Tür | Speech Communications, T. Robinson and S. Rendals, eds., Vol. 32, Issue 1-2, pp. 127-154 | September 2000 | Speech | |
| Prosody-Based Automatic Detection of Punctuation and Interruption Events in the ICSI Meeting Recorder Corpus | D. Baron | M.S. Thesis, University of California at Berkeley | May 2002 | Speech | [PDF]
|
| Prosody-Based Automatic Detection of Annoyance and Frustration in Human-Computer Dialog | J. Ang, R. Dhillon, A. Krupski, E. Shriberg, and A. Stolcke | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | |