Publication Search Results

Titlesort ascendingAuthorBibliographicDateGroupLinks
Multispeaker Speech Activity Detection for the ICSI Meeting RecorderT. Pfau, D. Ellis, and A. StolckeProceedings of Automatic Speech Recognition and Understanding Workshop (ASRU 2001), Madonna di Campiglio, Italy, pp. 107-110December 2001Speech[PDF]

Multiresolution Channel Normalization for ASR in Reverberant EnvironmentsC. Avendano, S. Tibrewala, and H. HermanskyProceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, GreeceSeptember 1997Speech
Multiple-State Context-Dependent Phonetic Modeling with MLPsM. Cohen, H. Franco, N. Morgan, D. Rumelhart, and V. AbrashProceedings of the Speech Research Symposium XII, Rutgers University, Camden, New Jersey 1992Speech
Multiple-Pronunciation Lexical Modeling in a Speaker Independent Speech Understanding SystemC. Wooters and A. StolckeProceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 1963-1966September 1994Speech[PDF]

Multimodal Speaker Diarization Using Oriented Optical Flow HistogramsM. Knox and G. FriedlandProceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 290-293September 2010Speech[PDF]

Multimodal Model Integration for Sentence Unit DetectionL. Chen, Y. Liu, M. Harper, and E. ShribergSixth International Conference on Multimodal Interfaces, October 2004 2004Speech
Multimodal Location Estimation on Flickr VideosG. Friedland, J. Choi, H. Lei, and A. JaninProceedings of the ACM International Workshop on Social Media (WSM11), Scottsdale, ArizonaNovember 2011Speech[PDF]

Multimodal Location Estimation of Consumer Media – Dealing with Sparse Training DataJ. Choi, G. Friedland, V. Ekambaram, and K. RamchandranProceedings of the IEEE International Conference on Multimedia and Expo, Melbourne, Australia, pp. 43-48July 2012Speech[PDF]

Multimodal Location EstimationG. Friedland, O. Vinyals, and T. DarrellProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 1245-1251October 2010Speech[PDF]

Multimodal Interfaces for Automotive Applications (MIAA)C. Müller and G. FriedlandProceedings of the ACM International Conference on Intelligent User Interfaces (IUI 2009), Sanibel, Florida, pp. 493-494February 2009Speech
Multimodal Indoor Localization: An Audio-Wireless-Based ApproachO. Vinyals, E. Martin, and G. FriedlandProceedings of the Fourth IEEE International Conference on Semantic Computing (ICSC-2010), Pittsburgh, Pennsylvania, pp. 120-125September 2010Speech[PDF]

Multimodal City-Verification on Flickr Videos Using Acoustic and Textual FeaturesH. Lei, J. Choi, and G. FriedlandProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Multimedia Technologies for E-Learning 2007G. Friedland, L. Knipping, and N. Ludwig (eds.)Special Issue of Interactive Technology Smart Education (ITSE), Vol. 4, Issue 4November 2007Speech
Multimedia Technologies for E-learningG. Friedland and L. Knipping (editors)Special issue of International Journal of Interactive Technology Smart Education (ITSE), Vol 4, No 1, Troubador Publishing Ltd., United KingdomMarch 2007Speech
Multimedia Information Extraction RoadmapG. Myers, G. Tür, L. Voss, B. Bolles, S. Kajarekar, E. Shriberg, and D. Hakkani-TürProceedings of the AAAI Fall Symposium on Multimedia Information Extraction, Arlington, VirginiaNovember 2008Speech[PDF]

Multimedia Education—Can We Find Unity in Diversity?G. Friedland, W. Hürst, and L. KnippingProceedings of the 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 1115-1116October 2008Speech[PDF]

Multimedia Education in Computer Science -- A Little Bit of Everything Is Not EnoughG. Friedland, L. Knipping, and W. HuerstIEEE Multimedia Magazine, Vol. 15, Issue 2, pp. 78-82April 2008Speech[PDF]

Multimedia Data Formats and Semantic Computing: A Practical Example and its Implications for the FutureG. FriedlandIEEE International Conference on Semantic Computing, Irvine, CaliforniaSeptember 2007Speech
Multiband Audio Modeling for Single-Channel Acoustic Source SeparationM.J. Reyes-Gomez, D. Ellis, and N. JojicProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), Montreal, Canada, Vol.5, pp. 641-644May 2004Speech[PDF]

Multi-View Semi-Supervised Learning for Dialog Act Segmentation of SpeechU. Guz, S. Cuendet, G. Tur, and D. Hakkani-TürIEEE Transactions on Audio, Speech and Language Processing, Vol. 18, Issue 2, pp. 320-329February 2010Speech[PDF]

Multi-Stream to Many-Stream: Using Spectro-Temporal Features for ASRS. Y. Zhao, S. Ravuri, and N. MorganProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2951-2954September 2009Speech[PDF]

Multi-stream Speech Recognition: Ready for Prime Time?A. Janin, D. Ellis, and N. MorganProceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. II-591-594September 1999Speech[PDF]

Multi-Stream Spectro-Temporal Features for Robust Speech RecognitionS. Y. Zhao and N. MorganProceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 898-901September 2008Speech[PDF]

Multi-Stream Speaker Diarization Systems for the Meetings DomainA. Gallardo-Antolin, X. Anguera, and C. WootersProceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006—ICSLP), Philadelphia, Pennsylvania, pp. 2186-2189September 2006Speech[PDF]

Multi-Stream ASR trained with Heterogeneous Reverberant EnvironmentsM.L. ShireProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, UtahMay 2001Speech[PDF]

Multi-Speaker Language ModelingG. Ji and J. BilmesProceedings of the Human Language Technology Conference at the North American Chapter of the Association for Computational Linguistics, Boston, Massachusetts, pp. 133-136May 2004Speech[PDF]

Multi-Rate and Variable-Rate Modeling of Speech at Phone and Syllable Time ScalesO. Cetin and M. OstendorfProceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 665-668March 2005Speech
Multi-modal Speaker Diarization of Real-world Meetings Using Compressed-domain Video FeaturesG. Friedland, H. Hung, and C. YeoICSI Technical Report TR-08-007, October 2008October 2008Speech[PDF]

Multi-Modal Speaker Diarization of Real-World Meeting Using Compressed-Domain Video FeaturesG. Friedland, H. Hung, and C. YeoProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4069-4072April 2009Speech[PDF]

Multi-Microphone Signal Processing for Automatic Speech Recognition in Meeting RoomsM. Ferras FontM.S. Thesis, Universitat Politecnica de Catalunya, Barcelona, SpainJuly 2005Speech[PDF]

Multi-Level Decision Trees for Static and Dynamic Pronunciation ModelsE. Fosler-LussierProceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. I-463-466September 1999Speech[PDF]

Multi-Channel Source Separation by Factorial HMMsM.J. Reyes-gomez, B. Raj, and D. EllisProceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong KongApril 2003Speech[PDF]

Morph-Based Speech Recognition and Modeling of Out-of-Vocabulary Words Across LanguagesM. Creutz, T. Hirsimäki, M. Kurimo, A. Puurula, J. Pylkkönen, V. Siivola, M. Varjokallio, E. Arisoy, M. Saraclar, and A. StolckeACM Transactions on Speech and Language Processing, Vol. 5, Issue 1, pp. 1-29December 2007Speech[PDF]

Modulation Spectrogram Features for Speaker DiarizationO. Vinyals and G. FriedlandProceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 630-633September 2008Speech
Modeling Prosodic Feature Sequences for Speaker RecognitionE. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. StolckeSpeech Communication, Vol. 46, Issues 3-4, pp. 455-472July 2005Speech
Modeling Other Talkers for Improved Dialog Act Recognition in MeetingsK. Laskowski and E. ShribergProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2783-2786September 2009Speech[PDF]

Modeling NERFs for Speaker RecognitionS. Kajarekar, L. Ferrer, K. Sonmez, J. Zheng, E. Shriberg, and A. StolckeProceedings of the Speaker and Language Recognition Workshop (Odyssey 2004), Toledo, Spain, pp. 51-56May 2004Speech[PDF]

Modeling Dynamics in Connectionist Speech Recognition - the Time Index ModelY. Konig and N. MorganProceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 1523-1526September 1994Speech[PDF]

Modeling Dynamic Prosodic Variation for Speaker VerificationK. Sonmez, E. Shriberg, L. Heck, and M. WeintraubProceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia, Vol. 7, p. 3189November 1998Speech
Modeling Consistency in a Speaker Independent Continuous Speech Recognition SystemY. Konig, N. Morgan, C. Wooters, V. Abrash, M. Cohen, and H. FrancoAdvances in Neural Information Processing Systems, Vol. V, pp. 682-687 1993Speech
Model Complexity Selection and Cross-validation EM Training for Robust Speaker DiarizationX. Anguera, T. Shinozaki, C. Wooters, and J. HernandoProceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4 pp. 273-276April 2007Speech[PDF]

Model Adaptation for Sentence Segmentation from SpeechS. Cuendet, D. Hakkani-Tur, and G. TurProceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 102-105December 2006Speech[PDF]

Model Adaptation for Dialog Act TaggingG. Tur, U. Guz, and D. Hakkani-TurProceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 94-97December 2006Speech[PDF]

MLP-Based Feature Extraction for Speech TranscriptionN. Morgan, A. Faria, S. Ravuri, and S. ZhaoHandbook of Natural Language Processing and Machine Translation, J. Olive, ed., Springer, in press 2010Speech
MLLR Transforms as Features in Speaker RecognitionA. Stolcke, L. Ferrer, S. Kajarekar, E. Shriberg, and A. VenkataramanProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2425-2428September 2005Speech
Midlevel Representations for Computational Auditory Scene Analysis: The Weft ElementD. Ellis and D. RosenthalComputational Auditory Scene Analysis, D.F. Rosenthal & H.G. Okuno, eds., Lawrence Erlbaum, pp. 257-272 1998Speech
Merging Multilayer Perceptrons & Hidden Markov Models: Some Experiments in Continuous Speech RecognitionH. Bourlard and N. MorganICSI Technical Report TR-089-033 1989Speech
Merging Multilayer Perceptrons & Hidden Markov Models: Some Experiments in Continuous Speech RecognitionH. Bourlard and N. MorganArtificial Neural Networks: Advances and Applications 1990Speech
Mel, Linear, and Antimel Frequency Cepstral Coefficients in Broad Phonetic Regions for Telephone Speaker RecognitionH. Lei and E. Lopez-GonzaloProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2323-2326September 2009Speech[PDF]

Meetings About Meetings: Research at ICSI on Speech in Multiparty ConversationsN. Morgan, D. Baron, S. Bhagat, H. Carvey, R. Dhillon, J. Edwards, D. Gelbart, A. Janin, A. Krupski, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. WootersProceedings of ICASSP-2003, Hong KongApril 2003Speech[PDF]

Pages