Publication Search Results

TitleAuthorBibliographicDateGroupLinkssort descending
Don't Multiply Lightly: Quantifying Problems with the Acoustic Model Assumptions in Speech RecognitionD. Gillick, L. Gillick, and S. WegmannProceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU), Big Island, HawaiiDecember 2011Speech[PDF]

Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location EstimationJ. Choi and G. FriedlandProceedings of the Fifth IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246September 2011Speech[PDF]

Introduction to the Special Section on Deep Learning for Speech and Language ProcessingD. Yu, G. Hinton, N. Morgan, J.-T. Chien, and S. SagayamaIEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 1, pp. 4-6January 2012Speech[PDF]

Deep and Wide: Multiple Layers in Automatic Speech RecognitionN. MorganIEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 1, pp. 7-13January 2012Speech[PDF]

Multimodal City-Verification on Flickr Videos Using Acoustic and Textual FeaturesH. Lei, J. Choi, and G. FriedlandProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Spectro-Temporal Gabor Features for Speaker RecognitionH. Lei, B. T. Meyer, and N. MirghaforiProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Discriminative Training for Speech Recognition is Compensating for Statistical Dependence on the HMM FrameworkD. Gillick and S. Wegmann, L. GillickProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

How to Put It Into Words - Using Random Forests to Extract Symbol Level Descriptions from Audio Content for Concept DetectionP.-S. Huang, R. Mertens, A. Divakaran, G. Friedland, and M. Hasegawa-JohnsProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Easy Does It: Robust Spectro-Temporal Many-Stream ASR Without Fine Tuning StreamsS. Ravuri and N. MorganProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech
Articulatory Features for Expressive Speech SynthesisA. Black, H. T. Bunnell, Y. Dou, P. Kumar, F. Metze, D. Perry, T. Polzehl, K. Prahallad, S. Steidl, and C. VaugProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Why Has (Reasonably Accurate) Automatic Speech Recognition Been So Hard to Achieve?S. Wegmann and L. GillickArXiv.org under CoRR abs/1003.0206February 2010Speech[PDF]

Finding Difficult Speakers in Automatic Speaker RecognitionL. StollUC Berkeley PhD thesis, Berkeley, CaliforniaDecember 2011Speech[PDF]

Gappy Phrasal Alignment by AgreementM. Bansal, C. Quirk, and R. C. MooreProceedings of the 49th annual Meeting of the Association for Computational Linguistics, pp. 1308-1317 Portland, OregonJune 2011Speech[PDF]

The Surprising Variance in Shortest-Derivation ParsingM. Bansal and D. KleinProceedings of the 49th annual Meeting of the Association for Computational Linguistics, Portland, OregonJune 2011Speech[PDF]

Web-Scale Features for Full-Scale ParsingM. Bansal and D. KleinProceedings of the 49th annual Meeting of the Association for Computational Linguistics, pp. 693-702, Portland, OregonJune 2011Speech[PDF]

Multimodal Location Estimation of Consumer Media – Dealing with Sparse Training DataJ. Choi, G. Friedland, V. Ekambaram, and K. RamchandranProceedings of the IEEE International Conference on Multimedia and Expo, Melbourne, Australia, pp. 43-48July 2012Speech[PDF]

Semantic Computing and Privacy: A Case Study Using Inferred Geo-TaggingG. Friedland and J. ChoiInternational Journal of Semantic Computing, Vol. 5, No. 1, pp. 79-93. Also Best Poster in the Electrical and Computer Science and Engineering Track at the Korean Student Technical and Leadership Conference, Chicago, Illinois, March 2012. DOI: 10.1142/S1793351X11001171March 2011Speech[PDF]

Cybercasing the Joint: Language Technologies, Multimedia Retrieval, and Online PrivacyG. FriedlandPresented at the Language Technologies Institute Colloquium, Carnegie Mellon University, Pittsburgh, PennsylvaniaApril 13 2012Speech[PDF]

From AUDREY to Siri: Is Speech Recognition A Solved Problem?R. PieracciniPresented at the Mobile Voice Conference, San Francisco, CaliforniaMarch 2012Speech[PDF]

Syllable Models for Mandarin Speech Recognition: Exploiting Character Language ModelsX. Liu, J. L. Hieronymus, M. J. F. Gales, and P. C. WoodlandIn submission 2012Speech
Language Model Combination and Adaptation Using Weighted Finite State TransducersX. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. WoodlandProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, TexasMarch 2010Speech
The Grammar of Hitting and BreakingC. J. FillmoreIn Readings in English Transformational Grammar, R. Jacobs and P. Rosenbaum, eds., pp. 120-133, Georgetown University Press.June 1970Speech[PDF]

Exploiting Chinese Character Models to Improve Speech Recognition PerformanceJ. L. Hieronymus, X. Liu, M. J. F. Gales, and P. C. WoodlandProceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UKSeptember 2009Speech
Where did I go Wrong?: Identifying Troublesome Segments for Speaker Diarization SystemsM. T. Knox, N. Mirghafori, and G. FriedlandProceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Portland, OregonSeptember 2012Speech[PDF]

Hooking Up Spectro-Temporal Filters with Auditory-Inspired Representations for Robust Automatic Speech RecognitionB. Meyer, C. Spille, B. Kollmeier, and N. MorganProceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Portland, OregonSeptember 2012Speech[PDF]

Features Based on Auditory Physiology and PerceptionR. M. Stern and N. MorganIn Techniques for Noise Robustness in Automatic Speech Recognition, T. Virtanen, B. Raj, and R. Singh, Wiley Publishing 2012Speech
Hearing is Believing: Biologically-Inspired Feature Extraction for Robust Automatic Speech RecognitionR. M. Stern and N. MorganSignal Processing Magazine, Vol. 29, No. 6, pp. 34-43November 2012Speech[PDF]

There is No Data Like Less Data: Percepts for Video Concept Detection on Consumer-Produced MediaBenjamin Elizalde; Gerald Friedland; Howard Lei; Ajay DivakaranProceedings of the ACM International Workshop on Audio and Multimedia Methods for Large-Scale Video Analysis (AMVA) at ACM Multimedia 2012 (MM'12), Nara, Japan, pp. 27-32October 2012Speech[PDF]

Pushing the Limits of Mechanical Turk: Qualifying the Crowd for Video Geo-LocationL. Gottlieb, J. Choi, P. Kelm, T. Sikora, and G. FriedlandProceedings of the ACM Workshop on Crowdsourcing for Multimedia (CrowdMM 2012), held in conjunction with ACM Multimedia 2012, pp. 23-28, Nara, JapanOctober 2012Speech[PDF]

Longer Features: They Do a Speech Detector GoodTJ Tsai and N. MorganProceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Portland, OregonSeptember 2012Speech
The 2012 ICSI/Berkeley Video Location Estimation SystemJ. Choi, V. Ekambaram, G. Friedland, and K. RamchandranPresented at the MediaEval 2012 Workshop, Pisa, ItalyOctober 2012Speech[PDF]

Semi-Autonomous Car Control Using Brain Computer InterfacesD. Goehring, D. Latotzky, M. Wang, and R. RojasProceedings of the 12th International Conference of Intelligent Autonomous Systems (IAS), Juju Island, KoreaJune 2012Speech

Pages