| From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System | N. Mirghafori, A. Stolcke, C. Wooters, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, and M. Ostendorf | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| ICSI's 2005 Speaker Recognition System | N. Mirghafori, A. O. Hatch, S. Stafford, K. Boakye, D. Gillick, and B. Peskin | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 23-28 | November 2005 | Speech | [PDF]
|
| Sooner or Later: Exploring Asynchrony in Multi-Band Speech Recognition | N. Mirghafori and N. Morgan | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, Vol. 2, pp. 595-598 | September 1999 | Speech | [PDF]
|
| Combining Connectionist Multi-Band and Full-Band Probability Streams for Speech Recognition of Natural Numbers | N. Mirghafori and N. Morgan | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 743-746. | 1998 | Speech | [PDF]
|
| Transmissions and Transitions: A Study of Two Common Assumptions in Multi-Band ASR | N. Mirghafori and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 713-716 | 1998 | Speech | [PDF]
|
| Parameterization of the Score Threshold for a Text-Dependent Adaptive Speaker Verification System | N. Mirghafori and M. Hebert | Proceedings of IEEE ICASSP, Montreal | May 2004 | Speech | [PDF]
|
| Nuts and Flakes: A Study of Data Characteristics in Speaker Diarization | N. Mirghafori and C. Wooters | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 1017-1020 | May 2006 | Speech | [PDF]
|
| A Multi-Band Approach to Automatic Speech Recognition | N. Mirghafori | Ph.D Dissertation, University of California at Berkeley, December 1998. Also ICSI Technical Report, TR-99-004, January 1999 | December 1998 | Speech | [PDF]
|
| Clap Detection and Discrimination for Rhythm Therapy | N. Lesser and D.P.W. Ellis | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 37-40 | March 2005 | Speech | [PDF]
|
| Role Recognition for Meeting Participants: An Approach Based on Lexical Information and Social Network Analysis | N. Garg, S. Favre, H. Salamin, D. Hakkani-Tur, and A. Vinciarelli | Proceedings of 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 693-696. | October 2008 | Speech | [PDF]
|
| ClusterRank: A Graph Based Method for Meeting Summarization | N. Garg, B. Favre, K. Riedhammer, and D. Hakkani-Tür | Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1499-1502 | September 2009 | Speech | [PDF]
|
| Personalized, Interactive Tag Recommendation for Flickr | N. Garg and I. Weber | Proceedings of the Second ACM International Conference on Recommender Systems (RecSys 2008), Lausanne, Switzerland, pp. 67-74 | October 2008 | Speech | [PDF]
|
| Multi-Stream ASR trained with Heterogeneous Reverberant Environments | M.L. Shire | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | [PDF]
|
| Data-Driven Modulation Filter Design Under Adverse Acoustic Conditions and Using Phonetic and Syllabic Units | M.L. Shire | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. III-1123-1126 | September 1999 | Speech | [PDF]
|
| Multiband Audio Modeling for Single-Channel Acoustic Source Separation | M.J. Reyes-Gomez, D. Ellis, and N. Jojic | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), Montreal, Canada, Vol.5, pp. 641-644 | May 2004 | Speech | [PDF]
|
| Multi-Channel Source Separation by Factorial HMMs | M.J. Reyes-gomez, B. Raj, and D. Ellis | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong | April 2003 | Speech | [PDF]
|
| A* Based Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 215-219 | November 2005 | Speech | [PDF]
|
| Toward Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 187-193 | July 2005 | Speech | [PDF]
|
| Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, A. Stolcke, E.E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Vol. 1, Toulouse, France, pp. 581-584 | May 2006 | Speech | [PDF]
|
| The ICSI+ Muilti-Lingual Sentence Segmentation System | M. Zimmerman, D. Hakkani-Tur, J. Fung, N. Mirghafori, L. Gottlieb, E. Shriberg, and Y. Liu | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 117-120 | September 2006 | Speech | |
| A Dutch Treatment of an Elitist Approach to Articulatory-Acoustic Feature Classification | M. Wester, S. Greenberg, and S. Chang | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| A Comparison of Data-Derived and Knowledge-Based Modeling of Pronunciation Variation | M. Wester and E.Fosler-Lussier | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Where did I go Wrong?: Identifying Troublesome Segments for Speaker Diarization Systems | M. T. Knox, N. Mirghafori, and G. Friedland | Proceedings of the 13th Annual Conference of the International Speech Communication Association (InterSpeech 2012), Portland, Oregon | September 2012 | Speech | [PDF]
|
| Automatic Laughter Segmentation | M. T. Knox | Master's report | May 2008 | Speech | [PDF]
|
| On Data-Derived Temporal Processing in Speech Feature Extraction | M. Shire and B. Chen | Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, China | October 2000 | Speech | [PDF]
|
| Data-driven RASTA Filters in Reverberation | M. Shire and B. Chen | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1627-1630 | June 2000 | Speech | [PDF]
|
| Relating Frame Accuracy with Word Error in Hybrid ANN-HMM ASR | M. Shire | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| Discriminant Training of Front-End and Acoustic Modeling Stages to Heterogeneous Acoustic Environments for Multi-stream Automatic Speech Recognition | M. Shire | Ph.D Dissertation, University of California at Berkeley, Fall 2000 | 2000 | Speech | [PDF]
|
| REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications | M. Rayner, P. Bouillon, B.A. Hockey, and N. Chatzichrisafis | Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006), Genoa, Italy, pp. 783-788 | May 2006 | Speech | [PDF]
|
| Japanese Speech Understanding Using Grammar Specialization | M. Rayner, N. Chatzichrisafis, P. Bouillon, Y. Nakao, H. Isahara, K. Kanzaki, B. A. Hockey, M. Santaholma, and M. Starlander | Proceedings of the Joint Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP 2005), Vancouver, Canada, pp. 26-27 | October 2005 | Speech | |
| A Voice-Enabled Procedure Browser for the International Space Station | M. Rayner, B.A. Hockey, N. Chatzichrisafis, K. Farrell, and J.M. Renders | Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), Ann Arbor, Michigan, pp. 29-32 (interactive poster and demo track) | June 2005 | Speech | |
| Putting Linguistics into Speech Recognition: The Regulus Grammar Compiler | M. Rayner, B.A. Hockey, and P. Bouillon | CSLI Press | May 2006 | Speech | |
| How to Build a Spoken Dialog System with Limited (or No) Resources | M. Plauché, O. Cetin, and N. Uhdaykumar | Presented at the Workshop on AI in ICT for Development at the 20th International Joint Conference on AI (IJCAI07), Hyderabad, India | January 2007 | Speech | |
| Speech Recognition for Illiterate Access to Information and Technology | M. Plauché, N. Udhyakummar, C. Wooters, J. Pal, and D. Ramachadran | Proceedings of the First International Conference on Information and Communication Technologies and Development (ICTD '06), Berkeley, California, pp. 83-92 | May 2006 | Speech | [PDF]
|
| Tamil Market: A spoken dialog system for rural India | M. Plauché and M. Prabaker | Working Papers in Computer-Human Interfaces | April 2006 | Speech | [PDF]
|
| Speech Segmentation and Spoken Document Processing | M. Ostendorf, B. Favre, R. Grishman, D. Hakkani-Tur, M. Harper, D. Hillard, J. Hirschberg, J. Heng, J. G. Kahn, Y. Liu, S. Maskey, E. Matusov, H. Ney, A. Rosenberg, E. Shriberg, W. Wang, and C. Wooters | IEEE Signal Processing Magazine, Vol. 25, Issue 3, pp. 59-69 | May 2008 | Speech | [PDF]
|
| Entropy Based Classifier Combination for Sentence Segmentation | M. Magimai Doss, D. Hakkani-Tur, O. Cetin, E. Shriberg, J. Fung, and N. Mirghafori | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 189-192 | April 2007 | Speech | [PDF]
|
| Selecting On-topic Sentences from Natural Language Corpora | M. Levit, E. Boschee, and M. Freedman | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2793-2796 | August 2007 | Speech | |
| IXIR: A Statistical Information Distillation System | M. Levit, D. Hakkani-Tür, G. Tür, and D. Gillick | Journal of Computer Speech and Language, Vol. 23, Issue 4, pp. 527-542 | October 2009 | Speech | [PDF]
|
| Interpretation of Spatial Language in a Map Navigation Task | M. Levit and D. Roy | IEEE Transactions on Systems, Man and Cybernetics, Part B, vol. 37, no. 3, IEEE Systems, man, and Cybernetics Society, pp.667-679 | June 2007 | Speech | |
| Automatic Tagging and Geo-Tagging in Video Collections and Communities | M. Larson, M. Soleymani, P. Serdyukov, S. Rudinac, C. Wartena, V. Murdock, G. Friedland, R. Ordelman, and G. J. F. Jones | Proceedings of the ACM International Conference on Multimedia Retrieval (ICMR 2011), Trento, Italy, April 2011 | April 2011 | Speech | [PDF]
|
| Getting the Last Laugh: Automatic Laughter Segmentation in Meetings | M. Knox, N. Morgan, and N. Mirghafori | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 797-800 | September 2008 | Speech | [PDF]
|
| Automatic Laughter Detection Using Neural Networks | M. Knox and N. Mirghafori | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2973-2976 | August 2007 | Speech | [PDF]
|
| Multimodal Speaker Diarization Using Oriented Optical Flow Histograms | M. Knox and G. Friedland | Proceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 290-293 | September 2010 | Speech | [PDF]
|
| Improving Word Accuracy with Gabor Feature Extraction | M. Kleinschmidt and D. Gelbart | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| Robust Speech Recognition Based on Spectro-Temporal Processing | M. Kleinschmidt | Ph.D Dissertation, University of Oldenberg, Germany | 2002 | Speech | |
| Spectro-temporal Gabor Features as a Front End for Automatic Speech Recognition | M. Kleinschmidt | Proceedings of the Triennial Forum Acusticum 2002, Seville, Spain | September 2002 | Speech | [PDF]
|
| Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections | M. Huijbregts, C. Wooters, and R. Ordelman | Proceedings of 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2925-2928 | August 2007 | Speech | |
| The Blame Game: Performance Analysis of Speaker Diarization System Components | M. Huijbregts and C. Wooters | Proceedings of 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1857-1860 | August 2007 | Speech | |
| Automatic Speech Recognition with Neural Spike Trains | M. Holmberg, D. Gelbart, U. Ramacher, and W. Hemmert | Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal | September 2005 | Speech | [PDF]
|