| Multi-Channel Source Separation by Factorial HMMs | M.J. Reyes-gomez, B. Raj, and D. Ellis | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong | April 2003 | Speech | [PDF]
|
| Multi-Level Decision Trees for Static and Dynamic Pronunciation Models | E. Fosler-Lussier | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. I-463-466 | September 1999 | Speech | [PDF]
|
| Multi-Microphone Signal Processing for Automatic Speech Recognition in Meeting Rooms | M. Ferras Font | M.S. Thesis, Universitat Politecnica de Catalunya, Barcelona, Spain | July 2005 | Speech | [PDF]
|
| Multi-Modal Speaker Diarization of Real-World Meeting Using Compressed-Domain Video Features | G. Friedland, H. Hung, and C. Yeo | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4069-4072 | April 2009 | Speech | [PDF]
|
| Multi-modal Speaker Diarization of Real-world Meetings Using Compressed-domain Video Features | G. Friedland, H. Hung, and C. Yeo | ICSI Technical Report TR-08-007, October 2008 | October 2008 | Speech | [PDF]
|
| Multi-Rate and Variable-Rate Modeling of Speech at Phone and Syllable Time Scales | O. Cetin and M. Ostendorf | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 665-668 | March 2005 | Speech | |
| Multi-Speaker Language Modeling | G. Ji and J. Bilmes | Proceedings of the Human Language Technology Conference at the North American Chapter of the Association for Computational Linguistics, Boston, Massachusetts, pp. 133-136 | May 2004 | Speech | [PDF]
|
| Multi-Stream ASR trained with Heterogeneous Reverberant Environments | M.L. Shire | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | [PDF]
|
| Multi-Stream Speaker Diarization Systems for the Meetings Domain | A. Gallardo-Antolin, X. Anguera, and C. Wooters | Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006—ICSLP), Philadelphia, Pennsylvania, pp. 2186-2189 | September 2006 | Speech | [PDF]
|
| Multi-Stream Spectro-Temporal Features for Robust Speech Recognition | S. Y. Zhao and N. Morgan | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 898-901 | September 2008 | Speech | [PDF]
|
| Multi-stream Speech Recognition: Ready for Prime Time? | A. Janin, D. Ellis, and N. Morgan | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. II-591-594 | September 1999 | Speech | [PDF]
|
| Multi-Stream to Many-Stream: Using Spectro-Temporal Features for ASR | S. Y. Zhao, S. Ravuri, and N. Morgan | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2951-2954 | September 2009 | Speech | [PDF]
|
| Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech | U. Guz, S. Cuendet, G. Tur, and D. Hakkani-Tür | IEEE Transactions on Audio, Speech and Language Processing, Vol. 18, Issue 2, pp. 320-329 | February 2010 | Speech | [PDF]
|
| Multiband Audio Modeling for Single-Channel Acoustic Source Separation | M.J. Reyes-Gomez, D. Ellis, and N. Jojic | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), Montreal, Canada, Vol.5, pp. 641-644 | May 2004 | Speech | [PDF]
|
| Multimedia Data Formats and Semantic Computing: A Practical Example and its Implications for the Future | G. Friedland | IEEE International Conference on Semantic Computing, Irvine, California | September 2007 | Speech | |
| Multimedia Education in Computer Science -- A Little Bit of Everything Is Not Enough | G. Friedland, L. Knipping, and W. Huerst | IEEE Multimedia Magazine, Vol. 15, Issue 2, pp. 78-82 | April 2008 | Speech | [PDF]
|
| Multimedia Education—Can We Find Unity in Diversity? | G. Friedland, W. Hürst, and L. Knipping | Proceedings of the 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 1115-1116 | October 2008 | Speech | [PDF]
|
| Multimedia Information Extraction Roadmap | G. Myers, G. Tür, L. Voss, B. Bolles, S. Kajarekar, E. Shriberg, and D. Hakkani-Tür | Proceedings of the AAAI Fall Symposium on Multimedia Information Extraction, Arlington, Virginia | November 2008 | Speech | [PDF]
|
| Multimedia Technologies for E-learning | G. Friedland and L. Knipping (editors) | Special issue of International Journal of Interactive Technology Smart Education (ITSE), Vol 4, No 1, Troubador Publishing Ltd., United Kingdom | March 2007 | Speech | |
| Multimedia Technologies for E-Learning 2007 | G. Friedland, L. Knipping, and N. Ludwig (eds.) | Special Issue of Interactive Technology Smart Education (ITSE), Vol. 4, Issue 4 | November 2007 | Speech | |
| Multimodal City-Verification on Flickr Videos Using Acoustic and Textual Features | H. Lei, J. Choi, and G. Friedland | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Multimodal Indoor Localization: An Audio-Wireless-Based Approach | O. Vinyals, E. Martin, and G. Friedland | Proceedings of the Fourth IEEE International Conference on Semantic Computing (ICSC-2010), Pittsburgh, Pennsylvania, pp. 120-125 | September 2010 | Speech | [PDF]
|
| Multimodal Interfaces for Automotive Applications (MIAA) | C. Müller and G. Friedland | Proceedings of the ACM International Conference on Intelligent User Interfaces (IUI 2009), Sanibel, Florida, pp. 493-494 | February 2009 | Speech | |
| Multimodal Location Estimation | G. Friedland, O. Vinyals, and T. Darrell | Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 1245-1251 | October 2010 | Speech | [PDF]
|
| Multimodal Location Estimation of Consumer Media – Dealing with Sparse Training Data | J. Choi, G. Friedland, V. Ekambaram, and K. Ramchandran | Proceedings of the IEEE International Conference on Multimedia and Expo, Melbourne, Australia, pp. 43-48 | July 2012 | Speech | [PDF]
|
| Multimodal Location Estimation on Flickr Videos | G. Friedland, J. Choi, H. Lei, and A. Janin | Proceedings of the ACM International Workshop on Social Media (WSM11), Scottsdale, Arizona | November 2011 | Speech | [PDF]
|
| Multimodal Model Integration for Sentence Unit Detection | L. Chen, Y. Liu, M. Harper, and E. Shriberg | Sixth International Conference on Multimodal Interfaces, October 2004 | 2004 | Speech | |
| Multimodal Speaker Diarization Using Oriented Optical Flow Histograms | M. Knox and G. Friedland | Proceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 290-293 | September 2010 | Speech | [PDF]
|
| Multiple-Pronunciation Lexical Modeling in a Speaker Independent Speech Understanding System | C. Wooters and A. Stolcke | Proceedings of the Third International Conference on Spoken Language Processing (ICSLP 94), Yokohama, Japan, pp. 1963-1966 | September 1994 | Speech | [PDF]
|
| Multiple-State Context-Dependent Phonetic Modeling with MLPs | M. Cohen, H. Franco, N. Morgan, D. Rumelhart, and V. Abrash | Proceedings of the Speech Research Symposium XII, Rutgers University, Camden, New Jersey | 1992 | Speech | |
| Multiresolution Channel Normalization for ASR in Reverberant Environments | C. Avendano, S. Tibrewala, and H. Hermansky | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece | September 1997 | Speech | |
| Multispeaker Speech Activity Detection for the ICSI Meeting Recorder | T. Pfau, D. Ellis, and A. Stolcke | Proceedings of Automatic Speech Recognition and Understanding Workshop (ASRU 2001),
Madonna di Campiglio, Italy, pp. 107-110 | December 2001 | Speech | [PDF]
|
| Mutaphrase: Paraphrasing with FrameNet | M. Ellsworth and A. Janin | Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing (TextEntail), Prague, Czech Republic, pp. 143-150 | June 2007 | Speech | [PDF]
|
| Name-Aware Speech Recognition for Interactive Question Answering | S. Stoyanchev, G. Tur, and D. Hakkani-Tür | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5113-5116 | April 2008 | Speech | [PDF]
|
| Narrative Theme Navigation for Sitcoms Supported by Fan-Generated Scripts | G. Friedland, A. Janin, and L. Gottlieb | To appear in Multimedia Tools and Applications, Springer | 2012 | Speech | [PDF]
|
| Narrative-Theme Navigation for Sitcoms Supported by Fan-Generated Scripts | G. Friedland, L. Gottlieb, and A. Janin | Proceedings of the Third International Workshop on Automated Information Extraction in Media Production (AIEMPro '10) at the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 3-8 | October 2010 | Speech | [PDF]
|
| Natural Statistical Models for Automatic Speech Recognition | J. Bilmes | Ph.D. Thesis, University of California at Berkeley, Fall 1999. Also ICSI Technical Report TR-99-016 | October 1999 | Speech | [PDF]
|
| Neural nets and hidden Markov models: Review and Generalizations | H. Bourlard, N. Morgan, and S. Renals | Speech Communication, Vol. 11, No.2-3, pp. 237-246 | 1992 | Speech | |
| Neural Networks for Statistical Inference: Generalizations with Applications to Speech Recognition | H. Bourlard and N. Morgan | Proceedings of the the International Joint Conference on Neural Networks (IJCNN '91), Singapore | 1991 | Speech | |
| Neural Networks for Statistical Inference: Generalizations with Applications to Speech Recognition | H. Bourlard and N. Morgan | Proceedings of the International Joint Conference on Neural Networks (IJCNN '91), Singapore | 1991 | Speech | |
| Neural Networks for Statistical Recognition of Continuous Speech | N. Morgan and H. Bourlard | Proceedings of IEEE, Vol. 83, Issue 5, pp. 742-770 | May 1995 | Speech | |
| Noise Robust Speaker Identification for Spontaneous Arabic Speech | M. Graciarena, S. Kajarekar, A. Stolcke, and E. Shriberg | Proceedings of the 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 245-248 | April 2007 | Speech | [PDF]
|
| Nonparametric Feature Normalization for SVM-Based Speaker Verification | A. Stolcke, S. Kajarekar, and L. Ferrer | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 1577-1580 | April 2008 | Speech | [PDF]
|
| Not Just What, But Also When: Guided Automatic Pronunciation Modeling for Broadcast News | E. Fosler-Lussier and G. Williams | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| Nuts and Flakes: A Study of Data Characteristics in Speaker Diarization | N. Mirghafori and C. Wooters | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 1017-1020 | May 2006 | Speech | [PDF]
|
| Object Cut and Paste in Images and Videos | G. Friedland, K. Jantz, T. Lenz, F. Wiesel, and R. Rojas | International Journal of Semantic Computing, World Scientific, Vol. 1, Issue 2, pp. 221-247, USA | July 2007 | Speech | |
| Observations on Overlap: Findings and Implications for Automatic Processing of Multi-Party Conversation | E. Shriberg, A. Stolcke, and D. Baron | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| On Data-Derived Temporal Processing in Speech Feature Extraction | M. Shire and B. Chen | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| On Reversing the Generation Process in Optimality Theory | E. Fosler | Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL-96), Santa Cruz, California | 1996 | Speech | [PDF]
|
| On Speaker-Specific Prosodic Models for Automatic Dialog Act Segmentation of Multi-Party Meetings | J. Kolar, E. Shriberg, and Y. Liu | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2014-2017 | September 2006 | Speech | [PDF]
|