Publication Search Results

TitleAuthorsort descendingBibliographicDateGroupLinks
Mel, Linear, and Antimel Frequency Cepstral Coefficients in Broad Phonetic Regions for Telephone Speaker RecognitionH. Lei and E. Lopez-GonzaloProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2323-2326September 2009Speech[PDF]

Word-Conditioned Phone N-Grams for Speaker RecognitionH. Lei and N. MirghaforiProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, pp. 253-256April 2007Speech[PDF]

Word-Conditioned HMM Supervectors for Speaker RecognitionH. Lei and N. MirghaforiProceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 746-749August 2007Speech[PDF]

Comparisons of Recent Speaker Recognition Approaches Based on Word ConditioningH. Lei and N. MirghaforiProceedings of Odyssey 2008, Stellenbosch, South AfricaJanuary 2008Speech[PDF]

Data Selection with Kurtosis and Nasality features for Speaker RecognitionH. Lei and N. MirghaforiProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 2753-2756August 2011Speech[PDF]

Spectro-Temporal Gabor Features for Speaker RecognitionH. Lei, B. T. Meyer, and N. MirghaforiProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

User Verification: Matching the Uploaders of Videos Across AccountsH. Lei, J. Choi, A. Janin, and G. FriedlandProceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 2404-2407May 2011Speech[PDF]

Multimodal City-Verification on Flickr Videos Using Acoustic and Textual FeaturesH. Lei, J. Choi, and G. FriedlandProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Using Boosting to Improve a Hybrid HMM/Neural Network Speech RecognizerH. SchwenkProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1009-1012March 1999Speech[PDF]

The Value of Auditory Offset Adaptation and Appropriate Acoustic ModelingH. Wang, D. Gelbart, H.G. Hirsch, and W. HemmertProceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 902-905September 2008Speech[PDF]

Relevance of Time-Frequency Features for Phonetic and SpeakerChannel ClassificationH.H. Yan, S. Sharma, S. van Vuuren, and H. HermanskySpeech Communication,Vol. 1, No. 31, pp. 35-50May 2000Speech[PDF]

Search for Information Bearing Components in SpeechH.H. Yang and H. HermanskyAdvances in Neural Information Processing Systems, Vol. 12, S.A. Solla, T.K. Leen and K.-R. Muller, eds., MIT Press 2000Speech
Relevancy of Time Frequency Features for Phonetic Classification Measured by Mutual InformationH.H. Yang, S. van Vuuren, and H. HermanskyProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, ArizonaMarch 1999Speech
Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixturesI. Bulyko, M. Ostendorf, and A. StolckeProceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, Vol. 2, pp. 7-9May 2003Speech[PDF]

The ICSI Meeting Corpus: Close-Talking and Far-Field, Multi-Channel Transcriptions for Speech and Language ResearchersJ. A. EdwardsProceedings of the Workshop on Compiling and Processing Spoken Language Corpora at the Fourth International Conference on Language Resources and Evaluation (LREC 2004), pp. 8-11May 2004Speech[PDF]

A Robust Speaker Clustering AlgorithmJ. Ajmera and C. WootersProceedings of IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin IslandsDecember 2003Speech[PDF]

Unknown-Multiple Speaker Clustering Using HMMJ. Ajmera, H. Bourlard, I. Lapidot, and I. McCowanProceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, ColoradoMay 2002Speech
Prosody-Based Automatic Detection of Annoyance and Frustration in Human-Computer DialogJ. Ang, R. Dhillon, A. Krupski, E. Shriberg, and A. StolckeProceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, ColoradoSeptember 2002Speech
Automatic Dialog Act Segmentation and Classification in Multiparty MeetingsJ. Ang, Y. Liu, and E. ShribergProceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 1061-1064March 2005Speech[PDF]

Research Developments and Directions in Speech Recognition and Understanding, Part 1J. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O'ShaughnessyIEEE Signal Processing Magazine, Vol. 26, No. 3, pp. 75-80May 2009Speech
Updated MINDS Report on Speech Recognition and Understanding, Part 2J. Baker, L. Deng, S. Khudanpur, C.-H. Lee, J. Glass, N. Morgan, and D. O'ShgughnessyIEEE Signal Processing Magazine, Vol. 26, No. 4, pp. 78-85July 2009Speech[PDF]

Combining Bottom-Up and Top-Down Constraints for Robust ASR: The Multiscore DecoderJ. Barker, M. Cooke, and D. EllisProceedings of the Workshop on Consistent and Reliable Acoustic Cues (CRAC-2001), Aalborg, DenmarkSeptember 2001Speech
Decoding Speech in the Presence of Other Sound SourcesJ. Barker, M. Cooke, and D. EllisProceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, ChinaOctober 2000Speech[PDF]

A Multi-DSP Ring Array for Connectionist SimulationsJ. Beck, N. Morgan, A. Allman, and J. BeerProceedings of 23rd Asilomar Conference on Signals, Systems & Computers 1989Speech
Natural Statistical Models for Automatic Speech RecognitionJ. BilmesPh.D. Thesis, University of California at Berkeley, Fall 1999. Also ICSI Technical Report TR-99-016October 1999Speech[PDF]

Buried Markov Models for Speech RecognitionJ. BilmesProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-713-716March 1999Speech[PDF]

Data-Driven Extensions to HMM Statistical DependenciesJ. BilmesProceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 69-72November 1998Speech[PDF]

Maximum Mutual Information Based Reduction Strategies for Cross-Correlation Based Joint Distributional ModelingJ. BilmesProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 469-472May 1998Speech[PDF]

Joint Distributional Modeling with Cross-Correlation Based FeaturesJ. BilmesProceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings (ASRU-97), Santa Barbara, California, pp.148-155 1997Speech[PDF]

Factored Language Models and Generalized Parallel BackoffJ. Bilmes and K. KirchhoffProceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, p. 1May 2003Speech[PDF]

Stochastic Perceptual Speech Models with Durational DependenceJ. Bilmes, N. Morgan, S.L. Wu, and H. BourlardProceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania 1996Speech[PDF]

Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location EstimationJ. Choi and G. FriedlandProceedings of the IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246September 2011Speech[PDF]

Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location EstimationJ. Choi and G. FriedlandProceedings of the Fifth IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246September 2011Speech[PDF]

The 2010 ICSI Video Location Estimation SystemJ. Choi, A. Janin, and G. FriedlandProceedings of the MediaEval 2010 Workshop, Pisa ItalyOctober 2010Speech[PDF]

Multimodal Location Estimation of Consumer Media – Dealing with Sparse Training DataJ. Choi, G. Friedland, V. Ekambaram, and K. RamchandranProceedings of the IEEE International Conference on Multimedia and Expo, Melbourne, Australia, pp. 43-48July 2012Speech[PDF]

The 2011 ICSI Video Location Estimation SystemJ. Choi, H. Lei, and G. FriedlandProceedings of the MediaEval 2011 Workshop, Pisa, ItalySeptember 2011Speech[PDF]

The 2012 ICSI/Berkeley Video Location Estimation SystemJ. Choi, V. Ekambaram, G. Friedland, and K. RamchandranPresented at the MediaEval 2012 Workshop, Pisa, ItalyOctober 2012Speech[PDF]

Opportunities and Challenges of Parallelizing Speech RecognitionJ. Chong, G. Friedland, A. Janin, and N. MorganProceedings of the Second USENIX Workshop on Hot Topics in Parallelism (HotPar '10), Berkeley, CaliforniaJune 2010Speech[PDF]

Sampling Alignment Structure Under a Bayesian Translation ModelJ. DeNero, A. Bouchard-Côté, and D. KleinProceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP), Waikiki, Honolulu, Hawaii, pp. 314-323October 2008Speech[PDF]

Asynchronous Binarization for Synchronous GrammarsJ. DeNero, A. Pauls, and D. KleinProceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), SingaporeAugust 2009Speech[PDF]

Fast Consensus Decoding over Translation ForestsJ. DeNero, D. Chiang, and K. KnightProceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), SingaporeAugust 2009Speech[PDF]

Efficient Parsing for Transducer GrammarsJ. DeNero, M. Bansal, A. Pauls, and D. KleinProceedings of North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009), Boulder, Colorado, pp. 227-235.May 2009Speech[PDF]

Chapter 17: The Transcription of DiscourseJ. EdwardsThe Handbook of Discourse Analysis, D. Shriffrin, D. Tannen and H. Hamilton, eds. Oxford: Blackwell, pp. 321-348 2001Speech
Prosodic Features and Feature Selection for Multi-lingual Sentence SegmentationJ. Fung, D. Hakkani-Tur, M. Magimai-Doss, E. Shriberg, S. Cuendet, and N. MirghaforiProceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2585-2588August 2007Speech[PDF]

How Good Is the Crowd at "Real" WSD?J. Hong and C. F. BakerProceedings of the Fifth Linguistic Annotation Workshop (LAW-V), Portland, OregonJune 2011Speech[PDF]

Integrating RASTA-PLP into Speech RecognitionJ. Koehler, N. Morgan, H. Hermansky, H.G. Hirsch, and G. TongProceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, pp. I-421-424 1994Speech
Using Prosody for Automatic Sentence Segmentation of Multi-Party MeetingsJ. Kolar, E. Shriberg, and Y. LiuProceedings of 9th International Conference on Text, Speech and Dialogue (TSD 2006), Brno, Czech Republic, pp. 629-636September 2006Speech[PDF]

On Speaker-Specific Prosodic Models for Automatic Dialog Act Segmentation of Multi-Party MeetingsJ. Kolar, E. Shriberg, and Y. LiuProceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2014-2017September 2006Speech[PDF]

Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of MeetingsJ. Kolar, Y. Liu, and E. ShribergProceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1621-1624August 2007Speech[PDF]

Genre Effects on Automatic Sentee Segmentation of Speech: A Comparison of Broadcast News and Broadcast ConversationsncJ. Kolar, Y. Liu, and E. ShribergProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4701-4704April 2009Speech[PDF]

Pages