Publication Search Results

Titlesort descendingAuthorBibliographicDateGroupLinks
Cover Song Detection: From High Scores to General ClassificationS. Ravuri and D. EllisProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, pp. 65-68March 2010Speech[PDF]

Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer PerceptronsA. Stolcke, F. Grezl, M.-Y. Hwang, X. Lei, N. Morgan, and D. VergyriProceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 321-324May 2006Speech[PDF]

Cross-Genre Feature Comparisons for Spoken Sentence SegmentationS. Cuendet, D. Hakkani-Tur, E. Shriberg, J. Fung, and B. FavreProceedings of International Conference on Semantic Computing, IEEE Computer Society, pp. 265-274, Irvine, California. Also published in International Journal of Semantic Computing, Volume 1, Issue 3, World Scientific, USA, pp. 335-346September 2007Speech[PDF]

Cross-Lingual Sentence Extraction for Information DistillationA. Singla and D. Hakkani-TurProceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2707-2710September 2008Speech[PDF]

CUDA-Level Performance with Python-Level Productivity for Gaussian Mixture Model ApplicationsH. Cook, E. Gonina, S. Kamil, G. Friedland, D. Patterson, and A. FoxProceedings of the Third USENIX Workshop on Hot Topics in Parallelism (HotPar ’11), Berkeley, CaliforniaMay 2011Speech[PDF]

Current Research in Acoustically Robust Speech RecognitionN. MorganProceedings of American Voice Input/Output Society (AVIOS), pp. 207-214September 1994Speech
Cybercasing the Joint: Language Technologies, Multimedia Retrieval, and Online PrivacyG. FriedlandPresented at the Language Technologies Institute Colloquium, Carnegie Mellon University, Pittsburgh, PennsylvaniaApril 13 2012Speech[PDF]

Data Selection with Kurtosis and Nasality features for Speaker RecognitionH. Lei and N. MirghaforiProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 2753-2756August 2011Speech[PDF]

Data-Driven Design of RASTA-like FiltersS. van Vuuren and H. HermanskyProceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, GreeceSeptember 1997Speech
Data-Driven Extensions to HMM Statistical DependenciesJ. BilmesProceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 69-72November 1998Speech[PDF]

Data-Driven Modulation Filter Design Under Adverse Acoustic Conditions and Using Phonetic and Syllabic UnitsM.L. ShireProceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. III-1123-1126September 1999Speech[PDF]

Data-driven RASTA Filters in ReverberationM. Shire and B. ChenProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1627-1630June 2000Speech[PDF]

Data-Driven Speaker and Subword Unit Clustering in Speech ProcessingM. HerschEPFL Diploma Thesis, ICSIMarch 2003Speech[PDF]

Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location EstimationJ. Choi and G. FriedlandProceedings of the IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246September 2011Speech[PDF]

Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location EstimationJ. Choi and G. FriedlandProceedings of the Fifth IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246September 2011Speech[PDF]

Decoding Speech in the Presence of Other Sound SourcesJ. Barker, M. Cooke, and D. EllisProceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, ChinaOctober 2000Speech[PDF]

Deep and Wide: Multiple Layers in Automatic Speech RecognitionN. MorganIEEE Transactions on Audio, Speech, and Language Processing, Special Issue on Deep Learning 2011Speech[PDF]

Deep and Wide: Multiple Layers in Automatic Speech RecognitionN. MorganIEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 1, pp. 7-13January 2012Speech[PDF]

Desperately Seeking Impostors: Data-Mining for Competitive Impostor Testing in a Text-Dependent Speaker Verification SystemM. Hebert and N. MirghaforiProceedings of IEEE ICASSP, MontrealMay 2004Speech[PDF]

Detecting Categories in News Video Using Acoustic, Speech, and Image FeaturesS. Petrov, A. Faria, P. Michaillat, A. Berg, A. Stolcke, D. Klein, and J. MalikPresented at the NIST TREC Video Retrieval Workshop, Gaithersburg, MarylandNovember 2006Speech[PDF]

Detecting Deception Using Critical SegmentsF. Enos, E. Shriberg, M. Graciarena, J. Hirschberg, and A. StolckeProceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2281-2284August 2007Speech[PDF]

Detecting Local Semantic Concepts in Environmental Sounds Using Markov Model Based ClusteringK. Lee, D. Ellis, and A. LouiProceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), Dallas, Texas, March 2010March 2010Speech[PDF]

Detecting Music in Ambient Audio by Long-Window AutocorrelationK. Lee and D. EllisProceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 9-12April 2008Speech[PDF]

Detection and Compensation of Sensor Malfunction in Time Delay Based Direction of Arrival EstimationT. Pirinen, J. Yli-Hietanen, P. Pertilä, and A. VisaProceedings of IEEE ISCAS, VancouverMay 2004Speech[PDF]

Detection of Agreement vs. Disagreement In Meetings: Training With Unlabeled DataD. Hillard, M. Ostendorf, and E. ShribergProceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, CanadaMay 2003Speech[PDF]

Development of the SRI/Nightingale Arabic ASR systemD. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlater, K. Kirchoff, A. Faria, and N. MorganProceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 1437-1440September 2008Speech
Dialocalizaton: Acoustic Speaker Diarization and Visual Localization as Joint Optimization ProblemG. Friedland, C. Yeo, and H. HungACM Transactions on Multimedia Computing, Communications, and Applications, Vol. 6, No. 4, Article 27November 2010Speech[PDF]

Dialog Act Tagging Using Graphical ModelsG. Ji and J. BilmesProceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, Vol. 1, pp. 33-36March 2005Speech[PDF]

Digit Recognition with Stochastic Perceptual ModelsN. Morgan, S.L. Wu, and H. BourlardProceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, SpainSeptember 1995Speech[PDF]

Direct Modeling of Prosody: An Overview of Applications in Automatic Speech ProcessingE. Shriberg and A. StolckeProceedings of the International Conference on Speech Prosody, Nara, Japan, March 2004.March 2004Speech[PDF]

Discourse Segmentation of Multi-party ConversationM. Galley, K. McKeown, E. Fosler-Lussier, and H. JingProceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL-03), Sapporo, JapanJuly 2003Speech[PDF]

Discriminant Training of Front-End and Acoustic Modeling Stages to Heterogeneous Acoustic Environments for Multi-stream Automatic Speech RecognitionM. ShirePh.D Dissertation, University of California at Berkeley, Fall 2000 2000Speech[PDF]

Discriminative Pronunciation Learning Using Phonetic Decoder and Minimum-Classification-Error CriterionO. Vinyals, L. Deng, D. Yu, and A. AceroProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4445-4448April 2009Speech[PDF]

Discriminative Training for Hierarchical Clustering in Speaker DiarizationO. Vinyals, G. Friedland, and N. MorganProceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2326-2329September 2010Speech[PDF]

Discriminative Training for Speech Recognition is Compensating for Statistical Dependence on the HMM FrameworkD. Gillick and S. Wegmann, L. GillickProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data?A. Venkataraman, Y. Liu, E. Shriberg, and A. StolckeProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2777-2780September 2005Speech[PDF]

Does Session Variability Compensation in Speaker Recognition Model Intrinsic Variation Under Mismatched Conditions?E. Shriberg, S. Kajarekar, and N. SchefferProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1551-1554September 2009Speech[PDF]

Don't Multiply Lightly: Quantifying Problems with the Acoustic Model Assumptions in Speech RecognitionD. Gillick, L. Gillick, and S. WegmannProceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU), Big Island, HawaiiDecember 2011Speech[PDF]

Double the Trouble: Handling Noise and Reverberation in Far-Field Automatic Speech RecognitionD. Gelbart and N. MorganProceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, ColoradoSeptember 2002Speech[PDF]

Duration and Pronunciation Conditioned Lexical Modeling for Speaker VerificationG. Tur, E. Shriberg, A. Stolcke, and S. KajarekarProceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech--Eurospeech 2008), Antwerp, Belgium, pp. 2049-2052August 2007Speech[PDF]

Dynamic Classifier Combinations in Hybrid Speech Recognition Systems Using Utterance-Level Confidence ValuesK. Kirchhoff and J. BilmesProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-693-696March 1999Speech[PDF]

Dynamic Pronunciation Models for Autmoatic Speech RecognitionE. Fosler-LussierPh.D. Thesis, UC Berkeley, Fall 1999, ICSI Technical Report TR-99-015September 1999Speech[PDF]

Dynamic Pronunciation Models for Automatic Speech RecognitionE. Fosler-LussierPh.D Dissertation, University of California at BerkeleyAugust 1999Speech[PDF]

Easy Does It: Robust Spectro-Temporal Many-Stream ASR Without Fine Tuning StreamsS. Ravuri and N. MorganProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech
Educational MultimediaG. Friedland, L. Knipping, and W. Huerst (guest editors)Special Section in IEEE Multimedia Magazine, pp. 54-74, July-Sept. 2008July 2008Speech[PDF]

Educational Multimedia Systems: The Past, the Present, and a Glimpse into the FutureG. Friedland, W. Huerst, and L. KnippingProceedings of the ACM Workshop on Educational Multimedia and Multimedia Education at ACM Multimedia 2007, Augsburg, Germany, pp. 1-4September 2007Speech
EEG Signal Compression Based on Classified Signature and Envelope Vector SetsH. Gurkan, U. Guz, and B.S. YarmanProceedings of the European Conference on Circuit Theory and Design, IEEE Circuits and Systems Society and the European Circuit Society, Seville, Spain, pp. 420-423August 2007Speech
Effective Arabic Dialect Classification Using Diverse Phonotactic ModelsM. Akbacak, D. Vergyri, A. Stolcke, N. Scheffer, and A. MandalProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 737-740August 2011Speech[PDF]

Effects of Speaking Rate and Word Frequency on Conversational PronunciationsE. Fosler-Lussier and N. MorganSpeech Communication Vol. 29, No. 2-4, pp. 137-158November 1999Speech[PDF]

Effects of Speaking Rate and Word Predictability on Conversational PronunciationsE. Fosler-Lussier and N. MorganProceedings of the ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, NetherlandsMay 1998Speech[PDF]

Pages