Publication Search Results

TitleAuthorsort ascendingBibliographicDateGroupLinks
Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location EstimationJ. Choi and G. FriedlandProceedings of the Fifth IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246September 2011Speech[PDF]

Stochastic Perceptual Speech Models with Durational DependenceJ. Bilmes, N. Morgan, S.L. Wu, and H. BourlardProceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania 1996Speech[PDF]

Factored Language Models and Generalized Parallel BackoffJ. Bilmes and K. KirchhoffProceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, p. 1May 2003Speech[PDF]

Natural Statistical Models for Automatic Speech RecognitionJ. BilmesPh.D. Thesis, University of California at Berkeley, Fall 1999. Also ICSI Technical Report TR-99-016October 1999Speech[PDF]

Buried Markov Models for Speech RecognitionJ. BilmesProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-713-716March 1999Speech[PDF]

Data-Driven Extensions to HMM Statistical DependenciesJ. BilmesProceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 69-72November 1998Speech[PDF]

Maximum Mutual Information Based Reduction Strategies for Cross-Correlation Based Joint Distributional ModelingJ. BilmesProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 469-472May 1998Speech[PDF]

Joint Distributional Modeling with Cross-Correlation Based FeaturesJ. BilmesProceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings (ASRU-97), Santa Barbara, California, pp.148-155 1997Speech[PDF]

A Multi-DSP Ring Array for Connectionist SimulationsJ. Beck, N. Morgan, A. Allman, and J. BeerProceedings of 23rd Asilomar Conference on Signals, Systems & Computers 1989Speech
Combining Bottom-Up and Top-Down Constraints for Robust ASR: The Multiscore DecoderJ. Barker, M. Cooke, and D. EllisProceedings of the Workshop on Consistent and Reliable Acoustic Cues (CRAC-2001), Aalborg, DenmarkSeptember 2001Speech
Decoding Speech in the Presence of Other Sound SourcesJ. Barker, M. Cooke, and D. EllisProceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, ChinaOctober 2000Speech[PDF]

Updated MINDS Report on Speech Recognition and Understanding, Part 2J. Baker, L. Deng, S. Khudanpur, C.-H. Lee, J. Glass, N. Morgan, and D. O'ShgughnessyIEEE Signal Processing Magazine, Vol. 26, No. 4, pp. 78-85July 2009Speech[PDF]

Research Developments and Directions in Speech Recognition and Understanding, Part 1J. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O'ShaughnessyIEEE Signal Processing Magazine, Vol. 26, No. 3, pp. 75-80May 2009Speech
Automatic Dialog Act Segmentation and Classification in Multiparty MeetingsJ. Ang, Y. Liu, and E. ShribergProceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 1061-1064March 2005Speech[PDF]

Prosody-Based Automatic Detection of Annoyance and Frustration in Human-Computer DialogJ. Ang, R. Dhillon, A. Krupski, E. Shriberg, and A. StolckeProceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, ColoradoSeptember 2002Speech
Unknown-Multiple Speaker Clustering Using HMMJ. Ajmera, H. Bourlard, I. Lapidot, and I. McCowanProceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, ColoradoMay 2002Speech
A Robust Speaker Clustering AlgorithmJ. Ajmera and C. WootersProceedings of IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin IslandsDecember 2003Speech[PDF]

The ICSI Meeting Corpus: Close-Talking and Far-Field, Multi-Channel Transcriptions for Speech and Language ResearchersJ. A. EdwardsProceedings of the Workshop on Compiling and Processing Spoken Language Corpora at the Fourth International Conference on Language Resources and Evaluation (LREC 2004), pp. 8-11May 2004Speech[PDF]

Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixturesI. Bulyko, M. Ostendorf, and A. StolckeProceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, Vol. 2, pp. 7-9May 2003Speech[PDF]

Relevancy of Time Frequency Features for Phonetic Classification Measured by Mutual InformationH.H. Yang, S. van Vuuren, and H. HermanskyProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, ArizonaMarch 1999Speech
Search for Information Bearing Components in SpeechH.H. Yang and H. HermanskyAdvances in Neural Information Processing Systems, Vol. 12, S.A. Solla, T.K. Leen and K.-R. Muller, eds., MIT Press 2000Speech
Relevance of Time-Frequency Features for Phonetic and SpeakerChannel ClassificationH.H. Yan, S. Sharma, S. van Vuuren, and H. HermanskySpeech Communication,Vol. 1, No. 31, pp. 35-50May 2000Speech[PDF]

The Value of Auditory Offset Adaptation and Appropriate Acoustic ModelingH. Wang, D. Gelbart, H.G. Hirsch, and W. HemmertProceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 902-905September 2008Speech[PDF]

Using Boosting to Improve a Hybrid HMM/Neural Network Speech RecognizerH. SchwenkProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1009-1012March 1999Speech[PDF]

Multimodal City-Verification on Flickr Videos Using Acoustic and Textual FeaturesH. Lei, J. Choi, and G. FriedlandProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

User Verification: Matching the Uploaders of Videos Across AccountsH. Lei, J. Choi, A. Janin, and G. FriedlandProceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 2404-2407May 2011Speech[PDF]

Spectro-Temporal Gabor Features for Speaker RecognitionH. Lei, B. T. Meyer, and N. MirghaforiProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Word-Conditioned Phone N-Grams for Speaker RecognitionH. Lei and N. MirghaforiProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, pp. 253-256April 2007Speech[PDF]

Word-Conditioned HMM Supervectors for Speaker RecognitionH. Lei and N. MirghaforiProceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 746-749August 2007Speech[PDF]

Comparisons of Recent Speaker Recognition Approaches Based on Word ConditioningH. Lei and N. MirghaforiProceedings of Odyssey 2008, Stellenbosch, South AfricaJanuary 2008Speech[PDF]

Data Selection with Kurtosis and Nasality features for Speaker RecognitionH. Lei and N. MirghaforiProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 2753-2756August 2011Speech[PDF]

Importance of Nasality Measures for Speaker Recognition Data Selection and Performance PredictionH. Lei and E. Lopez-GonzaloProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 888-891September 2009Speech[PDF]

Mel, Linear, and Antimel Frequency Cepstral Coefficients in Broad Phonetic Regions for Telephone Speaker RecognitionH. Lei and E. Lopez-GonzaloProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2323-2326September 2009Speech[PDF]

ICSI System Description for SRE2008 SubmissionH. Lei and D.V. LeeuwenSpeaker Recognition Evaluation 2008, National Institute of Standards and Technology 2008Speech[PDF]

Applications of Keyword-Constraining in Speaker RecognitionH. LeiMS Thesis, University of California-BerkeleyJuly 2007Speech[PDF]

Towards Structured Approaches to Arbitrary Data Selection and Performance Prediction for Speaker RecognitionH. LeiProceedings of the Third IAPR/IEEE International Conference on Biometrics (ICB 2009), Alghero, ItalyJune 2009Speech[PDF]

Structured Approaches to Data Selection for Speaker RecognitionH. LeiUC Berkeley dissertationDecember 2010Speech[PDF]

Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization StrategiesH. Hung, Y. Huang, G. Friedland, and D. Gatica-PerezProceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 2197-2200April 2008Speech[PDF]

Estimating Dominance in Multi-Party Meetings Using Speaker Diarization from a Single MicrophoneH. Hung, Y. Huang, G. Friedland, and D. Gatica-PerezIEEE Transactions on Audio, Speech and Language Processing, Vol. 19, No. 4, pp. 847–860May 2011Speech
Computationally Efficient Clustering of Audio-Visual Meeting DataH. Hung, G. Friedland, and C. YeoIn Multimedia Interaction and Intelligent User Interfaces: Principles, Methods, and Applications, M. Etho, J. Luo, and L. Shao, eds., pp. 25-59 2010Speech
Using Audio and Video Features to Classify the Most Dominant Person in MeetingsH. Hung, D. Jayagopi, C. Yeo, G. Friedland, S. Ba, J-M. Odobez, K. Ramchandran, N. Mirghafori, and D. Gatica-PerezProceedings of ACM Multimedia 2007, Augsburg, Germany, pp. 835-838September 2007Speech
Towards Audio-Visual On-Line Diarization of Participants in Group MeetingsH. Hung and G. FriedlandProceedings of European Conference on Computer Vision (ECCV), Marseille, FranceOctober 2008Speech[PDF]

Recognition of Speech in Additive and Convolutional Noise Based on RASTA Spectral ProcessingH. Hermansky, N. Morgan, and H.G. HirschProceedings of the IEEE Conference on Acoustics, Speech & Signal Processing, Minneapolis, Minnesota, pp. II-83-86 1993Speech
The Challenge of Inverse-E: The RASTA-PLP MethodH. Hermansky, N. Morgan, A. Bayya, and P. KohnProceedings of the 25th Asilomar Conference on Signals, Systems, & Computers, Pacific Grove, California, pp. 800-804November 1991Speech
RASTA-PLP Speech Analysis TechniqueH. Hermansky, N. Morgan, A. Bayya, and P. KohnProceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, San Francisco, California, pp. I-121-124 1992Speech
Tandem Connectionist Feature Stream Extraction for Conventional HMM SystemsH. Hermansky, D. Ellis, and S. SharmaProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1635-1638June 2000Speech[PDF]

Automatic Speech RecognitionH. Hermansky, and N. MorganEncyclopedia of Cognitive Science, Nature Publishing Group, London 2003Speech
Compensation for the effect of the communication channel in Perceptual Linear Predictive (PLP) analysis of speechH. Hermansky, A. Bayya, N. Morgan, P. KohnProceedings of the Second European Conference on Speech Communication and Technology (Eurospeech '91), Genova, Italy, pp. 1367-1370 1991Speech
Temporal Patterns (TRAPS) in ASR of Noisy SpeechH. Hermansky and S. SharmaProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, ArizonaMarch 1999Speech
Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition ResearchH. Hermansky and N. MorganJournal of Negative Results in Speech and Audio Sciences, Vol. 1, Issue 1 2004Speech[PDF]

Pages