| Text Classification by Augmenting the Bag-of-Words Representation with Redundancy-Compensated Bigrams | C. Boulis and M. Ostendof | Proceedings of the SIAM International Conference on Data Mining at the Workshop on Feature Selection in Data Mining (SIAM-FSDM 2005), Newport Beach, California | April 2005 | Speech | [PDF]
|
| Combining Multiple Clustering Systems | C. Boulis and M. Ostendof | Proceedings of the 15th European Conference on Machine Learning (ECML/PKDD 2004), Pisa, Italy | September 2004 | Speech | [PDF]
|
| Robust ASR Front-End Using Spectral-Based and Discriminant Features: Experiments on the Aurora Tasks | C. Benitez, L. Burget, B. Chen, S. Dupont, H. Garudadri, H. Hermansky, P. Jain, S. Kajarekar, and S. Sivadas | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark, pp. 429-432 | September 2001 | Speech | [PDF]
|
| Multiresolution Channel Normalization for ASR in Reverberant Environments | C. Avendano, S. Tibrewala, and H. Hermansky | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece | September 1997 | Speech | |
| There is No Data Like Less Data: Percepts for Video Concept Detection on Consumer-Produced Media | Benjamin Elizalde; Gerald Friedland; Howard Lei; Ajay Divakaran | Proceedings of the ACM International Workshop on Audio and Multimedia Methods for Large-Scale Video Analysis (AMVA) at ACM Multimedia 2012 (MM'12), Nara, Japan, pp. 27-32 | October 2012 | Speech | [PDF]
|
| Learning Discriminant Narrow-Band Temporal Patterns for Automatic Recognition of Conversational Telephone Speech | B.Y. Chen | Ph.D. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| Comparison of Grammar Based and Statistical Language Models Trained on the Same Data | B.A. Hockey and M. Rayner | Presented at the Workshop on Spoken Language Understanding at the 20th AIII National Conference on Artificial Intelligence, Pittsburgh, Pennsylvania | July 2005 | Speech | |
| Tonotopic Multi-Layered Perceptron: A Neural Network for Learning | B. Y. Chen, Q. Zhu, and N. Morgan | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 945-948 | March 2005 | Speech | [PDF]
|
| The Relationship Between Dialogue Acts and Hot Spots in Meetings | B. Wrede and E. Shriberg | Proceedings of IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands | December 2003 | Speech | [PDF]
|
| Spotting "Hot Spots" in Meetings: Human Judgments and Prosodic Cues | B. Wrede and E. Shriberg | Proceedings of the Eighth European Conference on Speech Communication and Technology (EUROSPEECH 2003), Geneva, Switzerland, pp. 2805-2808 | September 2003 | Speech | [PDF]
|
| Summarization- and Learning-Based Approaches to Information Distillation | B. Toth, D. Hakkani-Tur, and S. Yaman | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5306-5309 | March 2010 | Speech | [PDF]
|
| Comparing Different Flavors of Spectro-Temporal Features for ASR | B. T. Meyer, S. V. Ravuri, M. R. Schaedler, and N. Morgan | Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 1269-1272 | August 2011 | Speech | [PDF]
|
| Improving Automatic Speech Recognition by Learning from Human Errors | B. T. Meyer | Proceedings of the 162nd Meeting of the Acoustical Society of America, San Diego, California | October 2011 | Speech | |
| Reranking for Sentence Boundary Detection in Conversational Speech | B. Roark, Y. Liu, M. Harper, R. Stewart, M. Lease, M. Snover, Z. Shafran, B. Dorr, J. Hale, A. Krasnyanskaya, and L. Young | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Vol. 1, Toulouse, France, pp. 545-548 | May 2006 | Speech | |
| Using Prosodic and Conversational Features for High-Performance Speaker Recognition: Report From JHU WS'02. | B. Peskin, J. Navratil, J. Abramson, D. Jones, D. Klusacek, D. Reynolds, and B. Xiang | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Hooking Up Spectro-Temporal Filters with Auditory-Inspired Representations for Robust Automatic Speech Recognition | B. Meyer, C. Spille, B. Kollmeier, and N. Morgan | Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Portland, Oregon | September 2012 | Speech | [PDF]
|
| Combined Low Level and High Level Features for Out-of-Vocabulary Word Detection | B. Lecouteux, G. Linarès, and B. Favre | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1187-1190 | September 2009 | Speech | [PDF]
|
| Robust Speech Recognition Using the Modulation Spectrogram | B. Kingsbury, N. Morgan, and S. Greenberg | Speech Communication, Vol. 25, pp. 117-132 | 1998 | Speech | |
| Improving ASR Performance for Reverberant Speech | B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 87-90 | 1997 | Speech | [PDF]
|
| The IBM 2009 GALE Arabic Speech Transcription System | B. Kingsbury, H. Soltau, G. Saon, S. Chu, H.-K. Kuo, L. Mangu, S. Ravuri, A. Janin, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4672-4675 | May 2011 | Speech | [PDF]
|
| Recognizing Reverberant Speech With RASTA-PLP | B. Kingsbury and N. Morgan | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 1259-1262 | April 1997 | Speech | [PDF]
|
| Perceptually-Inspired Signal Processing Strategies for Robust Speech Recognition in Reverberant Environments | B. Kingsbury | Ph.D Dissertation, University of California at Berkeley | December 1998 | Speech | [PDF]
|
| Speech and Audio Signal Processing: Processing and Perception of Speech and Music, 2nd Edition | B. Gold, N. Morgan, and D. Ellis | Wiley | November 2011 | Speech | |
| Speech and Audio Signal Processing | B. Gold and N. Morgan | Wiley Press, New York | 1999 | Speech | |
| Punctuating Speech For Information Extraction | B. Favre, R. Grishman, D. Hillard, H. Ji, D. Hakkani-Tur, and M.Ostendorf | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5013-5016 | April 2008 | Speech | [PDF]
|
| Efficient Sentence Segmentation Using Syntactic Features | B. Favre, D. Hakkani-Tur, S. Petrov, and D. Klein | Proceedings of IEEE Workshop on Spoken Language Technologies (SLT2008), Goa, India, pp. 77-80 | December 2008 | Speech | [PDF]
|
| Syntactically Informed Models for Comma Prediction | B. Favre, D. Hakkani-Tür, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4697-4700 | April 2009 | Speech | [PDF]
|
| Evaluation of Semantic Role Labeling and Dependency Parsing of Automatic Speech Recognition Output | B. Favre, B. Bohnet, D. Hakkani-Tür | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5342-5345 | March 2010 | Speech | [PDF]
|
| Phrase and Word Level Strategies for Detecting Appositions in Speech | B. Favre and D. Hakkani-Tür | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2711-2714 | September 2009 | Speech | [PDF]
|
| ICSI-CRF: The Generation of References to the Main Subject and Named Entities Using Conditional Random Fields | B. Favre and B. Bohnet | Proceedings of the Language Generation and Summarisation (UCNLG+Sum) Workshop at the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore, pp. 99-100 | August 2009 | Speech | [PDF]
|
| Learning Discriminative Temporal Patterns in Speech: Development of Novel TRAPS-Like Classifiers | B. Chen, S. Chang, and S. Sivadas | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| Learning Long-Term Temporal Features in LVCSR Using Neural Networks | B. Chen, Q. Zhu, and N. Morgan | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Long-Term Temporal Features for Conversational Speech Recognition | B. Chen, Q. Zhu, and N. Morgan | Proceedings of the First International Workshop on Machine Learning for Multimodal Interaction (MLMI 2004), Martigny, Switzerland | June 2004 | Speech | |
| Synchronous Parsing of Syntactic and Semantic Structures | B. Bohnet | Proceedings of Quatrième Conférence Internationale Sur La Théorie Sens-Texte (Fourth International Conference on Meaning-Text Theory, MTT’09), Montreal, Canada | June 2009 | Speech | [PDF]
|
| Efficient Parsing of Syntactic and Semantic Dependency Structures | B. Bohnet | Presented at the 13th Conference on Computational Natural Language Learning (CoNLL-2009), Boulder, Colorado | June 2009 | Speech | [PDF]
|
| Vocabulary and Language Model Adaptation Using Information Retrieval | B. Bigi, Y. Huang, and R. De Mori | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data? | A. Venkataraman, Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2777-2780 | September 2005 | Speech | [PDF]
|
| Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System | A. Stolcke, X. Anguera, K. Boakye, O. Cetin, F. Grezl, A. Janin, A. Mandal, B. Peskin, C. Wooters, and J. Zheng | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 463-475 | July 2005 | Speech | [PDF]
|
| The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System | A. Stolcke, X. Anguera, K. Boakye, O. Cetin, A. Janin, M. Magimai-Doss, C. Wooters, and J. Zheng | Proceedings of the Second International Workshop on Classification of Events, Activities, and Relationships (CLEAR 2007) and the Fifth Rich Transcription 2007 Meeting Recognition (RT 2007), Baltimore, Maryland, pp. 450-463 | May 2007 | Speech | [PDF]
|
| Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms | A. Stolcke, S. Kajarekar, L. Ferrer, and E. Shriberg | IEEE Transactions on Audio, Speech, and Language Processing. Special issue on speaker and language recognition, Vol. 15, Issue 7, IEEE Computer Society, California, pp. 1987-1998 | September 2007 | Speech | [PDF]
|
| Nonparametric Feature Normalization for SVM-Based Speaker Verification | A. Stolcke, S. Kajarekar, and L. Ferrer | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 1577-1580 | April 2008 | Speech | [PDF]
|
| Improving Language Recognition with Multilingual Phone Recognition and Speaker Adaptation Transforms | A. Stolcke, M. Akbacak, L. Ferrer, S. Kajarekar, C. Richey, N. Scheffer, and E. Shriberg | Proceedings of the Odyssey Speaker and Language Recognition Workshop, Brno, Czech Republic, pp. 256-262 | June 2010 | Speech | [PDF]
|
| MLLR Transforms as Features in Speaker Recognition | A. Stolcke, L. Ferrer, S. Kajarekar, E. Shriberg, and A. Venkataraman | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2425-2428 | September 2005 | Speech | |
| Leveraging Speaker Diarization for Meeting Recognition from Distant Microphones | A. Stolcke, G. Friedland, and D. Imseng | Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4390-4393 | March 2010 | Speech | [PDF]
|
| Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons | A. Stolcke, F. Grezl, M.-Y. Hwang, X. Lei, N. Morgan, and D. Vergyri | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 321-324 | May 2006 | Speech | [PDF]
|
| Progress in Meeting Recognition: The ICSI-SRI-UW Spring 2004 Evaluation System | A. Stolcke, C. Wooters, N. Mirghafori, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | NIST ICASSP 2004 Meeting Recognition Workshop, Montreal | May 2004 | Speech | [PDF]
|
| Recent Innovations in Speech-to-Text Transcription at SRI-ICSI-UW | A. Stolcke, B. Chen, H. Franco, V.R.R. Gadde, M. Graciarena, M.-Y. Hwang, K. Kirchhoff, N. Morgan, X. Lin, T. Ng, M. Ostendorf, K. Sönmez, A. Venkataraman, D. Vergyri, W. Wang, J. Zheng, and Q. Zhu | IEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1729-1744 | September 2006 | Speech | [PDF]
|
| Making the Most from Multiple Microphones in Meeting Recognition | A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4992-4995 | May 2011 | Speech | [PDF]
|
| Cross-Lingual Sentence Extraction for Information Distillation | A. Singla and D. Hakkani-Tur | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2707-2710 | September 2008 | Speech | [PDF]
|
| A Neural Network Based, Speaker Independent, Large Vocabulary, Continuous Speech Recognition System: the Wernicke Project | A. Robinson, L. Almeida, J. Boite, H. Bourlard, F. Fallside, H. Hochberg, D. Kershaw, P. Kohn, Y. Konig, N. Morgan, J. Neto, S. Renals, M. Saerens, and C. Wooters | Proceedings of the Third European Conference on Speech Communication and Technology (Eurospeech '93), Berlin, Germany, pp. 1941-1944 | 1993 | Speech | |