| Easy Does It: Robust Spectro-Temporal Many-Stream ASR Without Fine Tuning Streams | S. Ravuri and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | |
| Articulatory Features for Expressive Speech Synthesis | A. Black, H. T. Bunnell, Y. Dou, P. Kumar, F. Metze, D. Perry, T. Polzehl, K. Prahallad, S. Steidl, and C. Vaug | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Why Has (Reasonably Accurate) Automatic Speech Recognition Been So Hard to Achieve? | S. Wegmann and L. Gillick | ArXiv.org under CoRR abs/1003.0206 | February 2010 | Speech | [PDF]
|
| Finding Difficult Speakers in Automatic Speaker Recognition | L. Stoll | UC Berkeley PhD thesis, Berkeley, California | December 2011 | Speech | [PDF]
|
| Gappy Phrasal Alignment by Agreement | M. Bansal, C. Quirk, and R. C. Moore | Proceedings of the 49th annual Meeting of the Association for Computational Linguistics, pp. 1308-1317 Portland, Oregon | June 2011 | Speech | [PDF]
|
| The Surprising Variance in Shortest-Derivation Parsing | M. Bansal and D. Klein | Proceedings of the 49th annual Meeting of the Association for Computational Linguistics, Portland, Oregon | June 2011 | Speech | [PDF]
|
| Web-Scale Features for Full-Scale Parsing | M. Bansal and D. Klein | Proceedings of the 49th annual Meeting of the Association for Computational Linguistics, pp. 693-702, Portland, Oregon | June 2011 | Speech | [PDF]
|
| Multimodal Location Estimation of Consumer Media – Dealing with Sparse Training Data | J. Choi, G. Friedland, V. Ekambaram, and K. Ramchandran | Proceedings of the IEEE International Conference on Multimedia and Expo, Melbourne, Australia, pp. 43-48 | July 2012 | Speech | [PDF]
|
| Semantic Computing and Privacy: A Case Study Using Inferred Geo-Tagging | G. Friedland and J. Choi | International Journal of Semantic Computing, Vol. 5, No. 1, pp. 79-93. Also Best Poster in the Electrical and Computer Science and Engineering Track at the Korean Student Technical and Leadership Conference, Chicago, Illinois, March 2012. DOI: 10.1142/S1793351X11001171 | March 2011 | Speech | [PDF]
|
| Cybercasing the Joint: Language Technologies, Multimedia Retrieval, and Online Privacy | G. Friedland | Presented at the Language Technologies Institute Colloquium, Carnegie Mellon University, Pittsburgh, Pennsylvania | April 13 2012 | Speech | [PDF]
|
| From AUDREY to Siri: Is Speech Recognition A Solved Problem? | R. Pieraccini | Presented at the Mobile Voice Conference, San Francisco, California | March 2012 | Speech | [PDF]
|
| Syllable Models for Mandarin Speech Recognition: Exploiting Character Language Models | X. Liu, J. L. Hieronymus, M. J. F. Gales, and P. C. Woodland | In submission | 2012 | Speech | |
| Language Model Combination and Adaptation Using Weighted Finite State Transducers | X. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. Woodland | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, Texas | March 2010 | Speech | |
| The Grammar of Hitting and Breaking | C. J. Fillmore | In Readings in English Transformational Grammar, R. Jacobs and P. Rosenbaum, eds., pp. 120-133, Georgetown University Press. | June 1970 | Speech | [PDF]
|
| Exploiting Chinese Character Models to Improve Speech Recognition Performance | J. L. Hieronymus, X. Liu, M. J. F. Gales, and P. C. Woodland | Proceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UK | September 2009 | Speech | |
| Where did I go Wrong?: Identifying Troublesome Segments for Speaker Diarization Systems | M. T. Knox, N. Mirghafori, and G. Friedland | Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Portland, Oregon | September 2012 | Speech | [PDF]
|
| Hooking Up Spectro-Temporal Filters with Auditory-Inspired Representations for Robust Automatic Speech Recognition | B. Meyer, C. Spille, B. Kollmeier, and N. Morgan | Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Portland, Oregon | September 2012 | Speech | [PDF]
|
| Features Based on Auditory Physiology and Perception | R. M. Stern and N. Morgan | In Techniques for Noise Robustness in Automatic Speech Recognition, T. Virtanen, B. Raj, and R. Singh, Wiley Publishing | 2012 | Speech | |
| Hearing is Believing: Biologically-Inspired Feature Extraction for Robust Automatic Speech Recognition | R. M. Stern and N. Morgan | Signal Processing Magazine, Vol. 29, No. 6, pp. 34-43 | November 2012 | Speech | [PDF]
|
| There is No Data Like Less Data: Percepts for Video Concept Detection on Consumer-Produced Media | Benjamin Elizalde; Gerald Friedland; Howard Lei; Ajay Divakaran | Proceedings of the ACM International Workshop on Audio and Multimedia Methods for Large-Scale Video Analysis (AMVA) at ACM Multimedia 2012 (MM'12), Nara, Japan, pp. 27-32 | October 2012 | Speech | [PDF]
|
| Pushing the Limits of Mechanical Turk: Qualifying the Crowd for Video Geo-Location | L. Gottlieb, J. Choi, P. Kelm, T. Sikora, and G. Friedland | Proceedings of the ACM Workshop on Crowdsourcing for Multimedia (CrowdMM 2012), held in conjunction with ACM Multimedia 2012, pp. 23-28, Nara, Japan | October 2012 | Speech | [PDF]
|
| Longer Features: They Do a Speech Detector Good | TJ Tsai and N. Morgan | Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Portland, Oregon | September 2012 | Speech | |
| The 2012 ICSI/Berkeley Video Location Estimation System | J. Choi, V. Ekambaram, G. Friedland, and K. Ramchandran | Presented at the MediaEval 2012 Workshop, Pisa, Italy | October 2012 | Speech | [PDF]
|
| Semi-Autonomous Car Control Using Brain Computer Interfaces | D. Goehring, D. Latotzky, M. Wang, and R. Rojas | Proceedings of the 12th International Conference of Intelligent Autonomous Systems (IAS), Juju Island, Korea | June 2012 | Speech | |