| On the Applicability of Speaker Diarization to Audio Concept Detection for Multimedia Retrieval | R. Mertens, P.-S. Huang, L. Gottlieb, G. Friedland, and A. Divakaran | Proceedings of the IEEE International Symposium on Multimedia, Dana Point, California, pp. 446-451 | December 2011 | Speech | [PDF]
|
| On the Origins of Speech Intelligibility in the Real World | S. Greenberg | Proceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 23-32 | 1997 | Speech | [PDF]
|
| On the Use of Artificial Conversation Data for Speaker Recognition in Cars | L. Gottlieb and G. Friedland | Proceedings of the Third IEEE International Conference on Semantic Computing (ICSC-2009), Berkeley, California, pp. 124-128 | September 2009 | Speech | [PDF]
|
| On the Use of Spectro-Temporal Features in Noise-Additive Speech | S. Ravuri | UC Berkeley Master's thesis, Spring 2011 | 2011 | Speech | [PDF]
|
| On Using MLP Features in LVCSR | Q. Zhu, B. Chen, N. Morgan. and A. Stolcke | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Opportunities and Challenges of Parallelizing Speech Recognition | J. Chong, G. Friedland, A. Janin, and N. Morgan | Proceedings of the Second USENIX Workshop on Hot Topics in Parallelism (HotPar '10), Berkeley, California | June 2010 | Speech | [PDF]
|
| Overlap in Meetings: ASR Effects and Analysis by Dialog Factors, Speakers, and Collection Site | O. Cetin and E. Shriberg | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 212-224 | May 2006 | Speech | [PDF]
|
| Overlapped Speech Detection for Improved Speaker Diarization in Multiparty Meetings | K.A. Boakye, B. Trueba-Hornero, O. Vinyals, and G. Friedland | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4353-4356 | April 2008 | Speech | [PDF]
|
| Packing the Meeting Summarization Knapsack | K. Riedhammer, D. Gillick, B. Favre, and D. Hakkani-Tur | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2434-2437 | September 2008 | Speech | [PDF]
|
| Parallel Training of MLP Probability Estimators for Speech Recognition: A Gender-Based Approach | N. Mirghafori, N. Morgan, and H. Bourlard | Proceedings of the IEEE Neural Networks for Signal Processing Workshop (NNSP 94), Ermioni, Greece | September 1994 | Speech | [PDF]
|
| Parallel Training of MLP Probability Estimators for Speech Recognition: A Gender-Based Approach | N. Mirghafori, N. Morgan, and H. Bourlard | Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, Greece, pp. 289-298 | 1994 | Speech | |
| Parallelizing Speaker-Attributed Speech Recognition for Meeting Browsing | G. Friedland, J. Chong, and A. Janin | Proceedings of the 2010 IEEE International Symposium on Multimedia (ISM2010), Taiwan, pp. 121-128 | December 2010 | Speech | [PDF]
|
| Parameterization of Prosodic Feature Distributions for SVM Modeling in Speaker Recognition | L. Ferrer, E. Shriberg, S. Kajarekar, and K. Sonmez | Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 233-236 | April 2007 | Speech | [PDF]
|
| Parameterization of the Score Threshold for a Text-Dependent Adaptive Speaker Verification System | N. Mirghafori and M. Hebert | Proceedings of IEEE ICASSP, Montreal | May 2004 | Speech | [PDF]
|
| Perceptually Motivated Sub-Band Decomposition for FDLP Audio Coding | P. Motlicek, S. Ganapathy, H. Hermansky, H. Garudadri, and M. Athineos | Proceedings of 11th International Conference on Text, Speech, and Dialogue (TSD 2008), Brno, Czech Republic, pp. 435-442 | September 2008 | Speech | [PDF]
|
| Perceptually-Inspired Signal Processing Strategies for Robust Speech Recognition in Reverberant Environments | B. Kingsbury | Ph.D Dissertation, University of California at Berkeley | December 1998 | Speech | [PDF]
|
| Performance Improvements Through Combining Phone- and Syllable-Length Information in Automatic Speech Recognition | S.L. Wu, B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia, pp. 854-857 | November 1998 | Speech | [PDF]
|
| Personalized, Interactive Tag Recommendation for Flickr | N. Garg and I. Weber | Proceedings of the Second ACM International Conference on Recommender Systems (RecSys 2008), Lausanne, Switzerland, pp. 67-74 | October 2008 | Speech | [PDF]
|
| Phonetic Context in Hybrid HMM/MLP Continuous Speech Recognition | H. Bourlard, M. Cohen, P. Kohn, N. Morgan, and C. Wooters | Proceedings of the Second European Conference on Speech Communication and Technology (Eurospeech '91), Genova, Italy, pp. 109-112 | 1991 | Speech | |
| Phonetic- and Speaker-Discriminant Features for Speaker Recognition | L. Stoll | UC Berkeley Masters Thesis | December 2006 | Speech | [PDF]
|
| Phrase and Word Level Strategies for Detecting Appositions in Speech | B. Favre and D. Hakkani-Tür | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2711-2714 | September 2009 | Speech | [PDF]
|
| Pitch-Based Emphasis Detection for Characterization of Meeting Recordings | L. Kennedy and D. Ellis | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2003), St. Thomas, Virgin Islands | November 2003 | Speech | [PDF]
|
| Precise Indoor Localization Using Smart Phones | E. Martin, O. Vinyals, G. Friedland, and R. Bajcsy | Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 787-790 | October 2010 | Speech | [PDF]
|
| Prediction-driven Computational Auditory Scene Analysis for Dense Sound Mixtures | D. Ellis | Proceedings of the ESCA Workshop on the "Auditory Basis of Speech Perception," Keele University, Staffordshire, UK | 1996 | Speech | [PDF]
|
| Probability Estimation by Feed-forward Networks in Continuous Speech Recognition | S. Renals, N. Morgan, and H. Bourlard | ICSI Technical Report TR-91-030. Also published in Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, pp. 309-318 | 1991 | Speech | |
| Progress in Meeting Recognition: The ICSI-SRI-UW Spring 2004 Evaluation System | A. Stolcke, C. Wooters, N. Mirghafori, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | NIST ICASSP 2004 Meeting Recognition Workshop, Montreal | May 2004 | Speech | [PDF]
|
| Prosodic and Other Long-Term Features for Speaker Diarization | G. Friedland, O. Vinyals, Y. Huang, and C. Müller | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, No. 5, pp. 985-993 | July 2009 | Speech | [PDF]
|
| Prosodic Cues For Emotion Recognition In Communicator Dialogs | J.C. Ang | M.S. Thesis, University of California at Berkeley | December 2002 | Speech | [PDF]
|
| Prosodic Features and Feature Selection for Multi-lingual Sentence Segmentation | J. Fung, D. Hakkani-Tur, M. Magimai-Doss, E. Shriberg, S. Cuendet, and N. Mirghafori | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2585-2588 | August 2007 | Speech | [PDF]
|
| Prosodic Similarities of Dialog Act Boundaries Across Speaking Styles | E. Shriberg, B. Favre, J. Fung, D. Hakkani-Tur, and S. Cuendet | Linguistic Patterns in Spontaneous Speech, S.-C. Tseng, ed., pp. 213-239, Institute of Linguistics | 2009 | Speech | [PDF]
|
| Prosodic Stress Revisited: Reassessing the Fole of Fundamental Frequency | R. Silipo and S. Greenberg | Proceedings of the National Institute of Standards and Technology Speech Transcription Workshop, College Park, Maryland | May 2000 | Speech | [PDF]
|
| Prosody Modeling for Automatic Speech Recognition and Understanding | E. Shriberg and A. Stolcke | Mathematical Foundations of Speech and Language Modeling, M. Johnson, M. Ostendorf, S. Khudanpur, R. Rosenfeld (eds.), Volume 138 in IMA Volumes in Mathematics and its Applications, pp. 105-114, Springer-Verlag. | 2004 | Speech | [PDF]
|
| Prosody-Based Automatic Detection of Annoyance and Frustration in Human-Computer Dialog | J. Ang, R. Dhillon, A. Krupski, E. Shriberg, and A. Stolcke | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | |
| Prosody-Based Automatic Detection of Punctuation and Interruption Events in the ICSI Meeting Recorder Corpus | D. Baron | M.S. Thesis, University of California at Berkeley | May 2002 | Speech | [PDF]
|
| Prosody-Based Automatic Segmentation of Speech into Sentences and Topics | E. Shriberg, A. Stolcke, D. Hakkani-Tür, and G. Tür | Speech Communications, T. Robinson and S. Rendals, eds., Vol. 32, Issue 1-2, pp. 127-154 | September 2000 | Speech | |
| Punctuating Speech For Information Extraction | B. Favre, R. Grishman, D. Hillard, H. Ji, D. Hakkani-Tur, and M.Ostendorf | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5013-5016 | April 2008 | Speech | [PDF]
|
| Purity Algorithms for Speaker Diarization of Meetings Data | X. Anguera, C. Wooters and J. Hernando | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France | May 2006 | Speech | [PDF]
|
| Pushing the Envelope - Aside | N. Morgan, Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cetin, H. Bourlard, and M. Athineos | IEEE Signal Processing Magazine, Vol. 22, No. 5, pp. 81-88 | September 2005 | Speech | |
| Pushing the Limits of Mechanical Turk: Qualifying the Crowd for Video Geo-Location | L. Gottlieb, J. Choi, P. Kelm, T. Sikora, and G. Friedland | Proceedings of the ACM Workshop on Crowdsourcing for Multimedia (CrowdMM 2012), held in conjunction with ACM Multimedia 2012, pp. 23-28, Nara, Japan | October 2012 | Speech | [PDF]
|
| Putting Linguistics into Speech Recognition: The Regulus Grammar Compiler | M. Rayner, B.A. Hockey, and P. Bouillon | CSLI Press | May 2006 | Speech | |
| QASR: Question Answering Using Semantic Roles for Speech Interface | S. Stenchikova, D. Hakkani-Tur, and G. Tur | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1185-1188 | September 2006 | Speech | |
| Qualcomm-ICSI-OGI Features for ASR | A. Adami, L. Burget, S. Dupont, H. Garudadri, F. Grezl, H. Hermansky, P. Jain, S. Kajarekar, N. Morgan, and S. Sivadas | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| RASTA Extensions: Robustness to Additive and Convolutional Noise | N. Morgan and H. Hermansky | Proceedings of the Workshop on Speech Processing in Adverse Conditions, pp. 115-118 | 1992 | Speech | |
| RASTA Processing of Speech | H. Hermansky and N. Morgan | IEEE Transactions on Speech and Audio Processing, special issue on Robust Speech Recognition, Vol. 2, No. 4, pp. 578-589 | October 1994 | Speech | |
| RASTA-PLP Speech Analysis Technique | H. Hermansky, N. Morgan, A. Bayya, and P. Kohn | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, San Francisco, California, pp. I-121-124 | 1992 | Speech | |
| Recent Innovations in Speech-to-Text Transcription at SRI-ICSI-UW | A. Stolcke, B. Chen, H. Franco, V.R.R. Gadde, M. Graciarena, M.-Y. Hwang, K. Kirchhoff, N. Morgan, X. Lin, T. Ng, M. Ostendorf, K. Sönmez, A. Venkataraman, D. Vergyri, W. Wang, J. Zheng, and Q. Zhu | IEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1729-1744 | September 2006 | Speech | [PDF]
|
| Recognition in a New Key - Towards a Science of Spoken Language | S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 1041-1045 | May 1998 | Speech | [PDF]
|
| Recognition of Speech in Additive and Convolutional Noise Based on RASTA Spectral Processing | H. Hermansky, N. Morgan, and H.G. Hirsch | Proceedings of the IEEE Conference on Acoustics, Speech & Signal Processing, Minneapolis, Minnesota, pp. II-83-86 | 1993 | Speech | |
| Recognizing Reverberant Speech With RASTA-PLP | B. Kingsbury and N. Morgan | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 1259-1262 | April 1997 | Speech | [PDF]
|
| Reducing Errors by Increasing the Error Rate: MLP Acoustic Modeling for Broadcast News Transcription | N. Morgan, D. Ellis, E. Fosler-Lussier, A. Janin, and B. Kingsbury | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|