Publication Search Results

TitleAuthorBibliographicsort ascendingDateGroupLinks
Don't Multiply Lightly: Quantifying Problems with the Acoustic Model Assumptions in Speech RecognitionD. Gillick, L. Gillick, and S. WegmannProceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU), Big Island, HawaiiDecember 2011Speech[PDF]

Evaluating Long-term Spectral Subtraction for Reverberant ASRD. Gelbart and N. MorganProceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU 2001), Madonna di Campiglio, ItalyDecember 2001Speech[PDF]

Meeting RecorderA. JaninProceedings of the Applied Voice Input/Output Society, San Jose, CaliforniaApril 2001Speech[PDF]

Improved Recognition by Combining Different Features and Different SystemsD.P.W. EllisProceedings of the Applied Voice Input/Output Society (AVIOS-2000), San Jose, CaliforniaMay 2000Speech[PDF]

Reducing the Effect of Room Acoustics on Human-Computer InteractionD. GelbartProceedings of the Applied Voice Input/Output Society (AVIOS 2002), San Jose, CaliforniaMay 2002Speech[PDF]

Multi-Stream Spectro-Temporal Features for Robust Speech RecognitionS. Y. Zhao and N. MorganProceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 898-901September 2008Speech[PDF]

Getting the Last Laugh: Automatic Laughter Segmentation in MeetingsM. Knox, N. Morgan, and N. MirghaforiProceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 797-800September 2008Speech[PDF]

Two's a Crowd: Improving Speaker Diarization by Automatically Identifying and Excluding Overlapped Speech AuthorsK. Boakye, O. Vinyals, and G. FriedlandProceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 32-35September 2008Speech
Speech-Overlapped Acoustic Event Detection for Automotive ApplicationsC. Müller, J. I. Biel, E. Kim, and D. RosarioProceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2590-2593September 2008Speech[PDF]

Packing the Meeting Summarization KnapsackK. Riedhammer, D. Gillick, B. Favre, and D. Hakkani-TurProceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2434-2437September 2008Speech[PDF]

Development of the SRI/Nightingale Arabic ASR systemD. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlater, K. Kirchoff, A. Faria, and N. MorganProceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 1437-1440September 2008Speech
SPERT-II: A Vector Microprocessor System and Its Application to Large Problems in Backpropagation TrainingJ. Wawrzynek, K. Asanovic, B. Kingsbury, J. Beck, D. Johnson, and N. MorganProceedings of the Advances in Neural Information Processing Systems 8 Conference (NIPS 8), Denver, Colorado, pp. 619-625. Also in IEEE Computer, Vol. 29, No. 3, pp 79-86, March 1996.November 1995Speech
REMAP: Recursive Estimation and Maximization of A Posteriori Probabilities - Application to Transition-Based Connectionist Speech RecognitionY. Konig, H. Bourlard, and N. MorganProceedings of the Advances in Neural Information Processing Systems 8 Conference (NIPS 8), Denver, Colorado, pp. 388-394November 1995Speech
A Low-Cost Mobile Pointing and Drawing DeviceK. Jantz, G. Friedland, L. Knipping, and R. RojasProceedings of the ACM Workshop on Educational Multimedia and Multimedia Education at ACM Multimedia 2007, Augsburg, Germany, pp. 121-122September 2007Speech
Educational Multimedia Systems: The Past, the Present, and a Glimpse into the FutureG. Friedland, W. Huerst, and L. KnippingProceedings of the ACM Workshop on Educational Multimedia and Multimedia Education at ACM Multimedia 2007, Augsburg, Germany, pp. 1-4September 2007Speech
Pushing the Limits of Mechanical Turk: Qualifying the Crowd for Video Geo-LocationL. Gottlieb, J. Choi, P. Kelm, T. Sikora, and G. FriedlandProceedings of the ACM Workshop on Crowdsourcing for Multimedia (CrowdMM 2012), held in conjunction with ACM Multimedia 2012, pp. 23-28, Nara, JapanOctober 2012Speech[PDF]

When a Mismatch Can Be Good: Large Vocabulary Speech Recognition Trained with Idealized Tandem FeaturesA. Faria and N. MorganProceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, pp. 1574-1577March 2008Speech[PDF]

Video2GPS: A Demo of Multimodal Location Estimation on Flickr VideosG. Friedland, J. Choi, and A. JaninProceedings of the ACM Multimedia Conference (MM'11), Scottsdale, ArizonaNovember 2011Speech[PDF]

Multimodal Location Estimation on Flickr VideosG. Friedland, J. Choi, H. Lei, and A. JaninProceedings of the ACM International Workshop on Social Media (WSM11), Scottsdale, ArizonaNovember 2011Speech[PDF]

Acoustic Super Models for Large Scale Video Event DetectionR. Mertens, H. Lei, L. Gottlieb, G. Friedland, and A. DivakaranProceedings of the ACM International Workshop on Events in Multimedia (EiMM11), Scottsdale, ArizonaNovember 2011Speech[PDF]

There is No Data Like Less Data: Percepts for Video Concept Detection on Consumer-Produced MediaBenjamin Elizalde; Gerald Friedland; Howard Lei; Ajay DivakaranProceedings of the ACM International Workshop on Audio and Multimedia Methods for Large-Scale Video Analysis (AMVA) at ACM Multimedia 2012 (MM'12), Nara, Japan, pp. 27-32October 2012Speech[PDF]

Automatic Tagging and Geo-Tagging in Video Collections and CommunitiesM. Larson, M. Soleymani, P. Serdyukov, S. Rudinac, C. Wartena, V. Murdock, G. Friedland, R. Ordelman, and G. J. F. JonesProceedings of the ACM International Conference on Multimedia Retrieval (ICMR 2011), Trento, Italy, April 2011April 2011Speech[PDF]

Precise Indoor Localization Using Smart PhonesE. Martin, O. Vinyals, G. Friedland, and R. BajcsyProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 787-790October 2010Speech[PDF]

Joke-O-Mat HD: Browsing Sitcoms with Human Derived TranscriptsA. Janin, L. Gottlieb, and G. FriedlandProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 1591-1594October 2010Speech[PDF]

Multimodal Location EstimationG. Friedland, O. Vinyals, and T. DarrellProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 1245-1251October 2010Speech[PDF]

Visual Speaker Localization Aided by Acoustic ModelsG. Friedland, C. Yeo, and H. HungProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2009), Beijing, China, pp. 195-202October 2009Speech[PDF]

Joke-o-Mat: Browsing Sitcoms Punchline by PunchlineG. Friedland, L. Gottlieb, and A. JaninProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2009), Beijing, China, pp. 1115-1116October 2009Speech[PDF]

Multimodal Interfaces for Automotive Applications (MIAA)C. Müller and G. FriedlandProceedings of the ACM International Conference on Intelligent User Interfaces (IUI 2009), Sanibel, Florida, pp. 493-494February 2009Speech
Mutaphrase: Paraphrasing with FrameNetM. Ellsworth and A. JaninProceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing (TextEntail), Prague, Czech Republic, pp. 143-150June 2007Speech[PDF]

Multimedia Information Extraction RoadmapG. Myers, G. Tür, L. Voss, B. Bolles, S. Kajarekar, E. Shriberg, and D. Hakkani-TürProceedings of the AAAI Fall Symposium on Multimedia Information Extraction, Arlington, VirginiaNovember 2008Speech[PDF]

Multi-Stream Speaker Diarization Systems for the Meetings DomainA. Gallardo-Antolin, X. Anguera, and C. WootersProceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006—ICSLP), Philadelphia, Pennsylvania, pp. 2186-2189September 2006Speech[PDF]

Friends and Enemies: A Novel Initialization for Speaker DiarizationX. Anguera, C. Wooters, and J. HernandoProceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 689-692September 2006Speech[PDF]

Speaker Diarization for Multiple Distant Microphone Meetings: Mixing Acoustic Features And Inter-Channel Time DifferencesJ. Pardo, X. Anguera, and C. WootersProceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2194-2197September 2006Speech[PDF]

On Speaker-Specific Prosodic Models for Automatic Dialog Act Segmentation of Multi-Party MeetingsJ. Kolar, E. Shriberg, and Y. LiuProceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2014-2017September 2006Speech[PDF]

Improved Speech Activity Detection Using Cross-Channel Features for Recognition of Multiparty MeetingsK. Boakye and A. StolckeProceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1962-1965September 2006Speech[PDF]

Robust Speaker Diarization for Meetings: ICSI RT06s evaluation systemX. Anguera, C. Wooters, and J. PardoProceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1674-1677September 2006Speech[PDF]

Within-Class Covariance Normalization for SVM-Based Speaker RecognitionA. O. Hatch, S. Kajarekar, and A. StolckeProceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1471-1474September 2006Speech[PDF]

QASR: Question Answering Using Semantic Roles for Speech InterfaceS. Stenchikova, D. Hakkani-Tur, and G. TurProceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1185-1188September 2006Speech
The ICSI+ Muilti-Lingual Sentence Segmentation SystemM. Zimmerman, D. Hakkani-Tur, J. Fung, N. Mirghafori, L. Gottlieb, E. Shriberg, and Y. LiuProceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 117-120September 2006Speech
Effects of Vocal Effort and Speaking Style on Text-Independent Speaker VerificationE. Shriberg, M. Graciarena, H. Bratt, A. Kathol, S. Kajarekar, H. Jameel, C. Richey, and F. GoodmanProceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 609-612September 2008Speech[PDF]

Source Separation Based on Binaural Cues and Source Model ConstraintsR. Weiss, M. Mandel, and D. EllisProceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 419-422September 2008Speech[PDF]

The Case for Automatic Higher-Level Features in Forensic Speaker RecognitionE. Shriberg and A. StolckeProceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 1509-1512September 2008Speech[PDF]

Meeting Acts: A Labeling System for Group Interaction in MeetingsR. Bates, P. Menning, E. Willingham, and C. KuyperProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisbon, PortugalSeptember 2005Speech[PDF]

Using Symbolic Prominence to Help Design Feature Subsets for Topic Classification and Clustering of Natural Human-Human ConversationsC. Boulis and M. OstendofProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisbon, PortugalSeptember 2005Speech[PDF]

The Effects of Speech Recognition and Punctuation on Information Extraction PerformanceJ. Makhoul, A. Baron, I. Bulyko, L. Nguyen, L. Ramshaw, D. Stallard, R. Schwartz, and B. XiangProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 57-60September 2005Speech
Comparing HMM, Maximum Entropy, and Conditional Random Fields for Disfluency DetectionY. Liu, E. Shriberg, A. Stolcke, and M. HarperProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 3313-3316September 2005Speech
Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data?A. Venkataraman, Y. Liu, E. Shriberg, and A. StolckeProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2777-2780September 2005Speech[PDF]

MLLR Transforms as Features in Speaker RecognitionA. Stolcke, L. Ferrer, S. Kajarekar, E. Shriberg, and A. VenkataramanProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2425-2428September 2005Speech
Automatic Data Selection for MLP-Based Feature Extraction for ASRC. Pelaez-Moreno, Q. Zhu, B. Chen, and N. MorganProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 229-232September 2005Speech[PDF]

Using MLP Features in SRI's Conversational Speech Recognition SystemQ. Zhu, A. Stolcke, B.Y. Chen, and N. MorganProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2141-2144September 2005Speech[PDF]

Pages