Publication Search Results

TitleAuthorBibliographicDatesort descendingGroupLinks
Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR SystemF. Valente, M. Magimai-Doss, C. Plahl, and S. RavuriProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2963-2966September 2009Speech[PDF]

An Anticorrelation Kernel for Subsystem Training in Multiple Classifier SystemsL. Ferrer, K. Sönmez, and E. ShribergJournal of Machine Learning Research, Vol. 10, pp. 2079-2114September 2009Speech[PDF]

Exploiting Chinese Character Models to Improve Speech Recognition PerformanceJ. L. Hieronymus, X. Liu, M. J. F. Gales, and P. C. WoodlandProceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), Brighton, UKSeptember 2009Speech
A View of the Parallel Computing LandscapeK. Asanović, R. Bodik, J. Demmel, T. Keaveny, K. Keutzer, J. D. Kubiatowicz, N. Morgan, D. A. Patterson, K. Sen, J. Wawrzynek, D. Wessel, and K. A. YelickCommunications of the ACM, Vol. 52, No. 10, pp. 56-67October 2009Speech[PDF]

IXIR: A Statistical Information Distillation SystemM. Levit, D. Hakkani-Tür, G. Tür, and D. GillickJournal of Computer Speech and Language, Vol. 23, Issue 4, pp. 527-542October 2009Speech[PDF]

Visual Speaker Localization Aided by Acoustic ModelsG. Friedland, C. Yeo, and H. HungProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2009), Beijing, China, pp. 195-202October 2009Speech[PDF]

Joke-o-Mat: Browsing Sitcoms Punchline by PunchlineG. Friedland, L. Gottlieb, and A. JaninProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2009), Beijing, China, pp. 1115-1116October 2009Speech[PDF]

Review of Cattelan, et al, "Watch-and-Comment as a Paradigm Toward Ubiquitous Interactive Video Editing"G. FriedlandACM Computer Reviews, CR136487October 2009Speech
Robust Speaker Diarization for Short Speech RecordingsD. Imseng and G. FriedlandProceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 432-437December 2009Speech[PDF]

Using Artistic Markers and Speaker Identification for Narrative-Theme Navigation of Seinfeld EpisodesG. Friedland, L. Gottlieb, and A. JaninProceedings of the 11th IEEE International Symposium on Multimedia (ISM2009), San Diego, California, pp. 511-516December 2009Speech[PDF]

Any Questions? Automatic Question Detection in MeetingsK. Boakye, B. Favre, and D. Hakkani-TürProceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 485-489December 2009Speech[PDF]

Integrating Prosodic Features in Extractive Meeting SummarizationS. Xie, D. Hakkani-Tür, B. Favre, and Y. LiuProceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy, pp. 387-391December 2009Speech[PDF]

Selected Papers from the Third IEEE International Conference on Semantic Computing (ICSC2009)G. Friedland and S. C. Shen, eds.International Journal on Semantic Computing, Vol. 3, Issue 4December 2009Speech
Speaker Recognition and DiarizationG. Friedland and D. van LeeuwenIn Semantic Computing, P. Sheu, H. Yu, C. V. Ramamamoorthy, A. K. Joshi, and L. A. Zadeh, eds., pp. 115-130, IEEE Press/Wiley 2010Speech
MLP-Based Feature Extraction for Speech TranscriptionN. Morgan, A. Faria, S. Ravuri, and S. ZhaoHandbook of Natural Language Processing and Machine Translation, J. Olive, ed., Springer, in press 2010Speech
Computationally Efficient Clustering of Audio-Visual Meeting DataH. Hung, G. Friedland, and C. YeoIn Multimedia Interaction and Intelligent User Interfaces: Principles, Methods, and Applications, M. Etho, J. Luo, and L. Shao, eds., pp. 25-59 2010Speech
Multi-View Semi-Supervised Learning for Dialog Act Segmentation of SpeechU. Guz, S. Cuendet, G. Tur, and D. Hakkani-TürIEEE Transactions on Audio, Speech and Language Processing, Vol. 18, Issue 2, pp. 320-329February 2010Speech[PDF]

Why Has (Reasonably Accurate) Automatic Speech Recognition Been So Hard to Achieve?S. Wegmann and L. GillickArXiv.org under CoRR abs/1003.0206February 2010Speech[PDF]

Speaker Adaptation of Language and Prosodic Models for Automatic Dialog Act Segmentation of SpeechJ. Kolar, Y. Liu, and E. ShribergSpeech Communication, Vol. 52, Issue 3, pp. 236-245March 2010Speech
An Adaptive Initialization Method for Speaker Diarization Based on Prosodic FeaturesD. Imseng and G. FriedlandProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4946-4949March 2010Speech[PDF]

Summarization- and Learning-Based Approaches to Information DistillationB. Toth, D. Hakkani-Tur, and S. YamanProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5306-5309March 2010Speech[PDF]

Comparing the Contributions of Context and Prosody in Text-Independent Dialog Act RecognitionK. Laskowski and E. ShribergProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5374-5377March 2010Speech[PDF]

A Comparison of Approaches for Modeling Prosodic Features in Speaker RecognitionL. Ferrer, N. Scheffer, and E. ShribergProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4414-4417March 2010Speech[PDF]

Acoustic Front-End Optimization for Bird Species RecognitionM. Graciarena, M. Delplanche, E. Shriberg, A. Stolcke, and L. FerrerProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 293-296March 2010Speech[PDF]

Leveraging Speaker Diarization for Meeting Recognition from Distant MicrophonesA. Stolcke, G. Friedland, and D. ImsengProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 4390-4393March 2010Speech[PDF]

Cover Song Detection: From High Scores to General ClassificationS. Ravuri and D. EllisProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 65-68March 2010Speech[PDF]

Evaluation of Semantic Role Labeling and Dependency Parsing of Automatic Speech Recognition OutputB. Favre, B. Bohnet, D. Hakkani-TürProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 5342-5345March 2010Speech[PDF]

Detecting Local Semantic Concepts in Environmental Sounds Using Markov Model Based ClusteringK. Lee, D. Ellis, and A. LouiProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, March 2010March 2010Speech[PDF]

Review of J. Nichols and B. Myers, "Creating a Lightweight User Interface Description Language: An Overview and Analysis of the Personal Universal Controller Project"G. FriedlandACM Computing Reviews, CR137773March 2010Speech
Language Model Combination and Adaptation Using Weighted Finite State TransducersX. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. WoodlandProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, TexasMarch 2010Speech
Cascaded Model Adaptation for Dialog Act Segmentation and TaggingU. Guz, G. Tur, D. Hakkani-Tür, and S. CuendetJournal of Computer Speech and Language, Vol. 24, Issue 2, pp. 289-306April 2010Speech
Hunting for Wolves in Speaker RecognitionL. Stoll and G. DoddingtonProceedings of the Speaker and Language Recognition Workshop (Odyssey 2010), Brno, Czech Republic, pp. 159-164June 2010Speech[PDF]

LDA Based Similarity Modeling for Question AnsweringA. Celikyilmaz, D. Hakkani-Tur, and G. TurProceedings of the Workshop on Semantic Search at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2010), Los Angeles, California, pp. 1-9June 2010Speech[PDF]

A Graph-Based Semi-Supervised Learning for Question Semantic LabelingA. Celikyilmaz and D. Hakkani-TurProceedings of the Workshop on Semantic Search at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2010), Los Angeles, California, pp. 27-35June 2010Speech[PDF]

Opportunities and Challenges of Parallelizing Speech RecognitionJ. Chong, G. Friedland, A. Janin, and N. MorganProceedings of the Second USENIX Workshop on Hot Topics in Parallelism (HotPar '10), Berkeley, CaliforniaJune 2010Speech[PDF]

Improving Language Recognition with Multilingual Phone Recognition and Speaker Adaptation TransformsA. Stolcke, M. Akbacak, L. Ferrer, S. Kajarekar, C. Richey, N. Scheffer, and E. ShribergProceedings of the Odyssey Speaker and Language Recognition Workshop, Brno, Czech Republic, pp. 256-262June 2010Speech[PDF]

A Hybrid Hierarchical Model for Multi-Document SummarizationA. Celikyilmaz and D. Hakkani-TürProceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010), Uppsala, Sweden, pp. 1149-1154July 2010Speech[PDF]

Review of E. Aguilar, "Animation and Performance Capture Using Digitized Models"G. FriedlandACM Computing Reviews, CR138181July 2010Speech
Simple, Accurate Parsing with an All-Fragments GrammarM. Bansal and D. KleinProceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010), Uppsala, Sweden, pp. 1098-1107July 2010Speech[PDF]

Audio-Based Semantic Concept Classification for Consumer VideoK. Lee and D. EllisIEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 6, pp. 1406-1416August 2010Speech[PDF]

The CALO Meeting Assistant SystemG. Tur, A. Stolcke, L. Voss, S. Peters, D. Hakkani-Tür, J. Dowding, B. Favre, R. Fernandez, M. Frampton, M. Frandsen, C. Frederickson, M. Graciarena, D. Kintzing, K. Leveque, S. Mason, J. Niekrasz, M. Purver, K. Riedhammer, E. Shriberg, J. Tien, D. Vergyri, and F. YangIEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 6, pp. 1601-1611August 2010Speech[PDF]

Multimodal Indoor Localization: An Audio-Wireless-Based ApproachO. Vinyals, E. Martin, and G. FriedlandProceedings of the Fourth IEEE International Conference on Semantic Computing (ICSC-2010), Pittsburgh, Pennsylvania, pp. 120-125September 2010Speech[PDF]

A Hybrid Approach to Online Speaker DiarizationC. Vaquero, O. Vinyals, and G. FriedlandProceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2642-2645September 2010Speech[PDF]

System Output Combination for Improved Speaker DiarizationS. Bozonnet, N. Evans, X. Anguera, O. Vinyals, G. Friedland, and C. FredouilleProceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2642-2645September 2010Speech[PDF]

Multimodal Speaker Diarization Using Oriented Optical Flow HistogramsM. Knox and G. FriedlandProceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 290-293September 2010Speech[PDF]

Discriminative Training for Hierarchical Clustering in Speaker DiarizationO. Vinyals, G. Friedland, and N. MorganProceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2326-2329September 2010Speech[PDF]

Using Spectro-Temporal Features to Improve AFE Feature Extraction for ASRS. Ravuri and N. MorganProceedings of the 11th Internationational Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 1181-1184September 2010Speech
Can Conversational Word Usage Be Used to Predict Speaker Demographics?D. GillickProceedings of the 11th Internationational Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, JapanSeptember 2010Speech[PDF]

A Comparative Large Scale Study of MLP Features for Mandarin ASRF. Valente, M. Magimai Doss, C. Plahl, S. Ravuri, and W. WangProceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2630-2363September 2010Speech[PDF]

Multimodal Location EstimationG. Friedland, O. Vinyals, and T. DarrellProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 1245-1251October 2010Speech[PDF]

Pages