| An Improved Approximation Algorithm for Vertex Cover with Hard Capacities | R. Gandhi, E. Halperin, S. Khuller, G. Kortsarz, and A. Srinivasan | Proceedings of the 30th International Colloquium on Automata, Languages and Programming (ICALP 2003), Eindhoven, The Netherlands, pp. 164-175 | June 2003 | Speech | [PDF]
|
| An Introduction to Hybrid HMM/Connectionist Continuous Speech Recognition | N. Morgan and H. Bourlard | IEEE Signal Processing Magazine, pp. 25-42 | May 1995 | Speech | [PDF]
|
| An Introduction to the Diagnostic Evaluation of the Switchboard-Corpus Automatic Speech Recognition Systems | S. Greenberg, S. Chang, and J. Hollenback | Proceedings of the National Institute of Standards and Technology Speech Transcription Workshop, College Park, Maryland | May 2000 | Speech | [PDF]
|
| An Iterative Unsupervised Learning Method for Information Distillation | K. Kamangar, D. Hakkani-Tur, G. Tur, and M. Levit | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4949 - 4952 | April 2008 | Speech | [PDF]
|
| An Overview of the SPRACH System for the Transcription of Broadcast News | G. Cook, J. Christie, D. Ellis, E. Fosler-Lussier, Y. Gotoh, B. Kingsbury, N. Morgan, S. Renals, T. Robinson, and G. Williams | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| Analytics for Experts | G. Friedland | Featured paper in ACM SIGMM Records, Vol. 1, Issue 1 | March 2009 | Speech | [PDF]
|
| Anchored Speech Recognition for Question Answering | S. Yaman, G. Tür, D. Vergyri, D. Hakkani-Tür, M. Harper, and W. Wang | Proceedings of North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009): Short Papers, Boulder, Colorado, pp. 265-268 | June 2009 | Speech | [PDF]
|
| Anthropocentric Video Segmentation for Lecture Webcasts | G. Friedland and R. Rojas | EURASIP Journal on Image and Video Processing, Vol. 8, Issue 2, Article 9 | January 2008 | Speech | [PDF]
|
| Any Questions? Automatic Question Detection in Meetings | K. Boakye, B. Favre, and D. Hakkani-Tür | Proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 485-489 | December 2009 | Speech | [PDF]
|
| Applications of Keyword-Constraining in Speaker Recognition | H. Lei | MS Thesis, University of California-Berkeley | July 2007 | Speech | [PDF]
|
| Appscio: A Software Environment for Semantic Multimedia Analysis | G. Friedland, E. Hensley, J. Schumacher, and R. Jain | Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, California, pp. 456-459 | August 2008 | Speech | [PDF]
|
| Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 Jhu Summer Workshop | K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. Saenko | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii | April 2007 | Speech | |
| Articulatory Features for Expressive Speech Synthesis | A. Black, H. T. Bunnell, Y. Dou, P. Kumar, F. Metze, D. Perry, T. Polzehl, K. Prahallad, S. Steidl, and C. Vaug | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Associating Children’s Non-Verbal and Verbal Behaviour: Body Movements, Emotions, and Laughter in a Human-Robot Interaction | A. Batliner, S. Steidl, and E. Nöth | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 22-27 | May 2011 | Speech | [PDF]
|
| Asynchronous Binarization for Synchronous Grammars | J. DeNero, A. Pauls, and D. Klein | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore | August 2009 | Speech | [PDF]
|
| Audio Information Access from Meeting Rooms | S. Renals and D. Ellis | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong | April 2003 | Speech | [PDF]
|
| Audio Segmentation for Meetings Speech Processing | K. A. Boakye | UC Berkeley dissertation | December 2008 | Speech | [PDF]
|
| Audio-Based Semantic Concept Classification for Consumer Video | K. Lee and D. Ellis | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 6, pp. 1406-1416 | August 2010 | Speech | [PDF]
|
| Auditory-Based Automatic Speech Recognition | W. Hemmert, M. Holmberg, and D. Gelbart | Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Automated Information Extraction in Production | R. Desutter, J.P. Evain, G. Friedland, A. Messina, and M. Sano | Special issue in Multimedia Tools and Applications, Springer | 2011 | Speech | |
| Automated Lecture Recording | G. Friedland, L. Knipping, and W. Huerst | Encyclopedia of Multimedia, B. Furht, ed., Springer | October 2008 | Speech | |
| Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization | X. Anguera, C. Wooters, and J. Hernando | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 248-256 | May 2006 | Speech | [PDF]
|
| Automatic Data Selection for MLP-Based Feature Extraction for ASR | C. Pelaez-Moreno, Q. Zhu, B. Chen, and N. Morgan | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 229-232 | September 2005 | Speech | [PDF]
|
| Automatic Dialog Act Segmentation and Classification in Multiparty Meetings | J. Ang, Y. Liu, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 1061-1064 | March 2005 | Speech | [PDF]
|
| Automatic Disfluency Identification in Conversational Speech Using Multiple Knowledge Sources | Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech | S. Cuendet, D. Hakkani-Tur, and E. Shriberg | Proceedings of Fourth International Conference on Machine Learning and Multimodal Interaction, Brno, Czech Republic, pp. 144-155 | June 2007 | Speech | [PDF]
|
| Automatic Labeling of Semantic Roles | D. Gildea and D. Jurafsky | The 38th Annual Meeting of the Association for Computational Linguistics (ACL-2000), Hong Kong, pp. 512-520 | October 2000 | Speech | [PDF]
|
| Automatic Laughter Detection Using Neural Networks | M. Knox and N. Mirghafori | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2973-2976 | August 2007 | Speech | [PDF]
|
| Automatic Laughter Segmentation | M. T. Knox | Master's report | May 2008 | Speech | [PDF]
|
| Automatic Learning of Word Pronunciation from Data | E. Fosler, M. Weintraub, S. Wegmann, Y. H. Kao, S. Khudanpur, C. Galles, and M. Saraclar | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| Automatic Phonetic Transcription of Spontaneous Speech American English | S. Chang, L. Shastri, and S. Greenberg | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Automatic Punctuation and Disfluency Detection in Multi-Party Meetings Using Prosodic and Lexical Cues | D. Baron, E. Shriberg, and A. Stolcke | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado, pp. 949-952 | September 2002 | Speech | [PDF]
|
| Automatic Speech Recognition | H. Hermansky, and N. Morgan | Encyclopedia of Cognitive Science, Nature Publishing Group, London | 2003 | Speech | |
| Automatic Speech Recognition with an Adaptation Model Motivated by Auditory Processing | M. Holmberg, D. Gelbart, and W. Hemmert | IEEE Transactions on Speech and Audio Processing, Vol. 14, Issue 1, pp. 44-49 | January 2006 | Speech | [PDF]
|
| Automatic Speech Recognition with Neural Spike Trains | M. Holmberg, D. Gelbart, U. Ramacher, and W. Hemmert | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal | September 2005 | Speech | [PDF]
|
| Automatic Tagging and Geo-Tagging in Video Collections and Communities | M. Larson, M. Soleymani, P. Serdyukov, S. Rudinac, C. Wartena, V. Murdock, G. Friedland, R. Ordelman, and G. J. F. Jones | Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR 2011), Trento, Italy, April 2011 | April 2011 | Speech | [PDF]
|
| Automatic Transcription of Prosodic Stress for Spontaneous English Discourse | R. Silipo and S. Greenberg | Proceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 3, pp. 2351-2354 | August 1999 | Speech | [PDF]
|
| Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings | X. Anguera, C. Wooters, J. Pardo, and J. Hernando | Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 241-244 | April 2007 | Speech | [PDF]
|
| Automatically Generated Prosodic Cues to Lexically Ambiguous Dialog Acts in Multiparty Meetings | S. Bhagat, H. Carvey, and E. Shriberg | Proceedings of the 15th International Congress of Phonetic Sciences (ICPhS 2003), Barcelona, Spain | August 2003 | Speech | [PDF]
|
| Autoregressive Modeling of Hilbert Envelopes for Wide-Band Audio Coding | S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri | Proceedings of 124th Convention of Audio Engineering Society (AES), Amsterdam, the Netherlands, paper 7481 | May 2008 | Speech | |
| Backoff Model Training Using Partially Observed Data: Application to Dialog Act Tagging | G. Ji and J. Bilmes | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2006), New York City, New York, pp. 280-287 | June 2006 | Speech | [PDF]
|
| Best Papers from the 10th IEEE International Symposium on Multimedia | G. Friedland and S.-C. Shen, eds. | International Journal on Semantic Computing (IJSC), World Scientific, Vol. 3, Issue 2 | June 2009 | Speech | |
| Best Papers from the Second IEEE International Conference on Semantic Computing (IJSC) | G. Friedland and C. Martell, eds. | International Journal on Semantic Computing (IJSC), Vol. 2, Issue 3 | September 2008 | Speech | |
| Better Word Alignments with Supervised ITG Models | A. Haghighi, J. Blitzer, J. DeNero, and D. Klein | Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore | August 2009 | Speech | [PDF]
|
| Big Dumb Neural Nets: A Working Brute Force Approach to Speech Recognition | N. Morgan | Proceedings of the International Conference on Neural Networks, Vol. VII, pp. 4462-4465 | 1994 | Speech | |
| Bird Species Recognition Combining Acoustic and Sequence Modeling | M. Graciarena, M. Delplanche, E. Shriberg, and A. Stolcke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 341-344 | May 2011 | Speech | [PDF]
|
| Building a Highly Accurate Mandarin Speech Recognizer | M-Y. Hwang, G. Peng, W. Wang, A. Faria, and A. Heidel | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, pp. 490-495 | December 2007 | Speech | [PDF]
|
| Building Multiple Pronunication Models for Novel Words using Exploratory Computational Phonology | G. Tajchman, E. Fosler, and D. Jurafsky | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| Buried Markov Models for Speech Recognition | J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-713-716 | March 1999 | Speech | [PDF]
|
| Can Conversational Word Usage Be Used to Predict Speaker Demographics? | D. Gillick | Proceedings of the 11th Internationational Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan | September 2010 | Speech | [PDF]
|