Publication Search Results

TitleAuthorBibliographicDatesort ascendingGroupLinks
Improving Language Recognition with Multilingual Phone Recognition and Speaker Adaptation TransformsA. Stolcke, M. Akbacak, L. Ferrer, S. Kajarekar, C. Richey, N. Scheffer, and E. ShribergProceedings of the Odyssey Speaker and Language Recognition Workshop, Brno, Czech Republic, pp. 256-262June 2010Speech[PDF]

Cascaded Model Adaptation for Dialog Act Segmentation and TaggingU. Guz, G. Tur, D. Hakkani-Tür, and S. CuendetJournal of Computer Speech and Language, Vol. 24, Issue 2, pp. 289-306April 2010Speech
Speaker Adaptation of Language and Prosodic Models for Automatic Dialog Act Segmentation of SpeechJ. Kolar, Y. Liu, and E. ShribergSpeech Communication, Vol. 52, Issue 3, pp. 236-245March 2010Speech
An Adaptive Initialization Method for Speaker Diarization Based on Prosodic FeaturesD. Imseng and G. FriedlandProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4946-4949March 2010Speech[PDF]

Summarization- and Learning-Based Approaches to Information DistillationB. Toth, D. Hakkani-Tur, and S. YamanProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5306-5309March 2010Speech[PDF]

Comparing the Contributions of Context and Prosody in Text-Independent Dialog Act RecognitionK. Laskowski and E. ShribergProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5374-5377March 2010Speech[PDF]

A Comparison of Approaches for Modeling Prosodic Features in Speaker RecognitionL. Ferrer, N. Scheffer, and E. ShribergProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4414-4417March 2010Speech[PDF]

Acoustic Front-End Optimization for Bird Species RecognitionM. Graciarena, M. Delplanche, E. Shriberg, A. Stolcke, and L. FerrerProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 293-296March 2010Speech[PDF]

Leveraging Speaker Diarization for Meeting Recognition from Distant MicrophonesA. Stolcke, G. Friedland, and D. ImsengProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4390-4393March 2010Speech[PDF]

Cover Song Detection: From High Scores to General ClassificationS. Ravuri and D. EllisProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 65-68March 2010Speech[PDF]

Evaluation of Semantic Role Labeling and Dependency Parsing of Automatic Speech Recognition OutputB. Favre, B. Bohnet, D. Hakkani-TürProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5342-5345March 2010Speech[PDF]

Detecting Local Semantic Concepts in Environmental Sounds Using Markov Model Based ClusteringK. Lee, D. Ellis, and A. LouiProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, March 2010March 2010Speech[PDF]

Review of J. Nichols and B. Myers, "Creating a Lightweight User Interface Description Language: An Overview and Analysis of the Personal Universal Controller Project"G. FriedlandACM Computing Reviews, CR137773March 2010Speech
Language Model Combination and Adaptation Using Weighted Finite State TransducersX. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. WoodlandProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, TexasMarch 2010Speech
Multi-View Semi-Supervised Learning for Dialog Act Segmentation of SpeechU. Guz, S. Cuendet, G. Tur, and D. Hakkani-TürIEEE Transactions on Audio, Speech and Language Processing, Vol. 18, Issue 2, pp. 320-329February 2010Speech[PDF]

Why Has (Reasonably Accurate) Automatic Speech Recognition Been So Hard to Achieve?S. Wegmann and L. GillickArXiv.org under CoRR abs/1003.0206February 2010Speech[PDF]

Speaker Recognition and DiarizationG. Friedland and D. van LeeuwenIn Semantic Computing, P. Sheu, H. Yu, C. V. Ramamamoorthy, A. K. Joshi, and L. A. Zadeh, eds., pp. 115-130, IEEE Press/Wiley 2010Speech
MLP-Based Feature Extraction for Speech TranscriptionN. Morgan, A. Faria, S. Ravuri, and S. ZhaoHandbook of Natural Language Processing and Machine Translation, J. Olive, ed., Springer, in press 2010Speech
Computationally Efficient Clustering of Audio-Visual Meeting DataH. Hung, G. Friedland, and C. YeoIn Multimedia Interaction and Intelligent User Interfaces: Principles, Methods, and Applications, M. Etho, J. Luo, and L. Shao, eds., pp. 25-59 2010Speech
Robust Speaker Diarization for Short Speech RecordingsD. Imseng and G. FriedlandProceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 432-437December 2009Speech[PDF]

Using Artistic Markers and Speaker Identification for Narrative-Theme Navigation of Seinfeld EpisodesG. Friedland, L. Gottlieb, and A. JaninProceedings of the 11th IEEE International Symposium on Multimedia (ISM2009), San Diego, California, pp. 511-516December 2009Speech[PDF]

Any Questions? Automatic Question Detection in MeetingsK. Boakye, B. Favre, and D. Hakkani-TürProceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 485-489December 2009Speech[PDF]

Integrating Prosodic Features in Extractive Meeting SummarizationS. Xie, D. Hakkani-Tür, B. Favre, and Y. LiuProceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 387-391December 2009Speech[PDF]

Selected Papers from the Third IEEE International Conference on Semantic Computing (ICSC2009)G. Friedland and S. C. Shen, eds.International Journal on Semantic Computing, Vol. 3, Issue 4December 2009Speech
A View of the Parallel Computing LandscapeK. Asanović, R. Bodik, J. Demmel, T. Keaveny, K. Keutzer, J. D. Kubiatowicz, N. Morgan, D. A. Patterson, K. Sen, J. Wawrzynek, D. Wessel, and K. A. YelickCommunications of the ACM, Vol. 52, No. 10, pp. 56-67October 2009Speech[PDF]

IXIR: A Statistical Information Distillation SystemM. Levit, D. Hakkani-Tür, G. Tür, and D. GillickJournal of Computer Speech and Language, Vol. 23, Issue 4, pp. 527-542October 2009Speech[PDF]

Visual Speaker Localization Aided by Acoustic ModelsG. Friedland, C. Yeo, and H. HungProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2009), Beijing, China, pp. 195-202October 2009Speech[PDF]

Joke-o-Mat: Browsing Sitcoms Punchline by PunchlineG. Friedland, L. Gottlieb, and A. JaninProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2009), Beijing, China, pp. 1115-1116October 2009Speech[PDF]

Review of Cattelan, et al, "Watch-and-Comment as a Paradigm Toward Ubiquitous Interactive Video Editing"G. FriedlandACM Computer Reviews, CR136487October 2009Speech
Hill-Climbing Feature Selection for Multi-Stream ASRD. Gelbart, N. Morgan, and A. TsymbalProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2967-2970September 2009Speech[PDF]

On the Use of Artificial Conversation Data for Speaker Recognition in CarsL. Gottlieb and G. FriedlandProceedings of the Third IEEE International Conference on Semantic Computing (ICSC-2009), Berkeley, California, pp. 124-128September 2009Speech[PDF]

Combining Semantic and Syntactic Information Sources for 5-W Question AnsweringS. Yaman, D. Hakkani-Tür, and G. TurProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2707-2710September 2009Speech[PDF]

Classification-Based Strategies for Combining Multiple 5-W Question Answering SystemsS. Yaman, D. Hakkani-Tür, G. Tur, R. Grishman, M. Harper, K. R. McKeown, A. Meyers, and K. SharmaProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2703-2706September 2009Speech[PDF]

Leveraging Sentence Weights in a Concept-Based Optimization Framework for Extractive Meeting SummarizationS. Xie, B. Favre, D. Hakkani-Tür, and Y. LiuProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1503-1506September 2009Speech[PDF]

ClusterRank: A Graph Based Method for Meeting SummarizationN. Garg, B. Favre, K. Riedhammer, and D. Hakkani-TürProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1499-1502September 2009Speech[PDF]

Phrase and Word Level Strategies for Detecting Appositions in SpeechB. Favre and D. Hakkani-TürProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2711-2714September 2009Speech[PDF]

Combined Low Level and High Level Features for Out-of-Vocabulary Word DetectionB. Lecouteux, G. Linarès, and B. FavreProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1187-1190September 2009Speech[PDF]

Multi-Stream to Many-Stream: Using Spectro-Temporal Features for ASRS. Y. Zhao, S. Ravuri, and N. MorganProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2951-2954September 2009Speech[PDF]

Importance of Nasality Measures for Speaker Recognition Data Selection and Performance PredictionH. Lei and E. Lopez-GonzaloProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 888-891September 2009Speech[PDF]

Mel, Linear, and Antimel Frequency Cepstral Coefficients in Broad Phonetic Regions for Telephone Speaker RecognitionH. Lei and E. Lopez-GonzaloProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2323-2326September 2009Speech[PDF]

Does Session Variability Compensation in Speaker Recognition Model Intrinsic Variation Under Mismatched Conditions?E. Shriberg, S. Kajarekar, and N. SchefferProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1551-1554September 2009Speech[PDF]

Modeling Other Talkers for Improved Dialog Act Recognition in MeetingsK. Laskowski and E. ShribergProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2783-2786September 2009Speech[PDF]

Feature-Based and Channel-Based Analyses of Intrinsic Variability in Speaker VerificationM. Graciarena, T. Bocklet, E. Shriberg, A. Stolcke, and S. KajarekarProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2015-2018September 2009Speech
A Human Benchmark for Language RecognitionR. Orr and D. A. Van LeeuwenProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2175-2178September 2009Speech
Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR SystemF. Valente, M. Magimai-Doss, C. Plahl, and S. RavuriProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2963-2966September 2009Speech[PDF]

An Anticorrelation Kernel for Subsystem Training in Multiple Classifier SystemsL. Ferrer, K. Sönmez, and E. ShribergJournal of Machine Learning Research, Vol. 10, pp. 2079-2114September 2009Speech[PDF]

Exploiting Chinese Character Models to Improve Speech Recognition PerformanceJ. L. Hieronymus, X. Liu, M. J. F. Gales, and P. C. WoodlandProceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UKSeptember 2009Speech
ICSI-CRF: The Generation of References to the Main Subject and Named Entities Using Conditional Random FieldsB. Favre and B. BohnetProceedings of the Language Generation and Summarisation (UCNLG+Sum) Workshop at the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore, pp. 99-100August 2009Speech[PDF]

Who, What, When, Where, Why? Comparing Multiple Approaches to the Cross-Lingual 5W TaskK. Parton, K. R. McKeown, R. Coyne, M. T. Diab, R. Grishman, D. Hakkani-Tür, M. Harper, H. Ji, W. Y. Ma, A. Meyers, S. Stolbach, A. Sun, G. Tur, W. Xu, and S. YamanProceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Lanaguage Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore, pp. 423-431August 2009Speech[PDF]

Review of G. Welch, "History: The Use of the Kalman Filter for Human Motion Tracking in Virtual Reality"G. FriedlandACM Computing Reviews, CR137162August 2009Speech

Pages