| Speech Intelligibility is Highly Tolerant of Cross-Channel Spectral Asynchrony | S. Greenberg and T. Arai | Proceedings of the Joint Meeting of the 137th Acoustical Society of America and the 16th International Congress on Acoustics (ICA/ASA), Seattle, Washington, pp. 2677-2678 | June 1998 | Speech | [PDF]
|
| Speech Intelligibility in the Presence of Cross-Channel Spectral Asynchrony | T. Arai and S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-98), Seattle, Washington, pp. 933-936 | May 1998 | Speech | [PDF]
|
| Speech Intelligibility Derived From Exceedingly Sparse Spectral Information | S. Greenberg, T. Arai, and R. Silipo | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 74-77 | November 1998 | Speech | [PDF]
|
| Speech Intelligibility Derived From Asynchrounous Processing of Auditory-Visual Information | K.W. Grant and S. Greenberg | Proceedings of the International Conference on Auditory-Visual Speech Processing Workshop (AVSP 2001), Scheelsminde, Denmark | September 2001 | Speech | [PDF]
|
| Speech Encoding in a Model of Peripheral Auditory Processing: Quantitative Assessment by Means of Automatic Speech Recognition | M. Holmberg, D. Gelbart, and W. Hemmert | Speech Communication, Vol. 49, Issue 12, pp. 917-932 | December 2007 | Speech | |
| Speech Data Modeling at WS96: The Questionable Parameter Group | N. Morgan | Proceedings of the International Conference on Spoken Language Processing (ICSLP 96), Addendum, Philadelphia, Pennsylvania, pp 30-31 | 1996 | Speech | |
| Speech and Audio Signal Processing: Processing and Perception of Speech and Music, 2nd Edition | B. Gold, N. Morgan, and D. Ellis | Wiley | November 2011 | Speech | |
| Speech and Audio Signal Processing | B. Gold and N. Morgan | Wiley Press, New York | 1999 | Speech | |
| Spectro-Temporal Gabor Features for Speaker Recognition | H. Lei, B. T. Meyer, and N. Mirghafori | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, Japan | March 2012 | Speech | [PDF]
|
| Spectro-temporal Gabor Features as a Front End for Automatic Speech Recognition | M. Kleinschmidt | Proceedings of the Triennial Forum Acusticum 2002, Seville, Spain | September 2002 | Speech | [PDF]
|
| Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain | S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri | Proceedings of the 9th Annual Conference of the International Speech Communication
Association (Interspeech 2008), Brisbane, Australia | September 2008 | Speech | |
| Spectral Basis Functions from Discriminant Analysis | H. Hermansky and N. Malayath | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia | November 1998 | Speech | |
| Special Section on New Frontiers in Rich Transcription | G. Friedland, J. Fiscus, T. Hain, and S. Furui (eds) | IEEE Transactions in Audio, Speech, and Language Processing, Vol. 20, No. 2 | February 2012 | Speech | |
| Speaking in Shorthand - A Syllable-Centric Perspective for Understanding Pronunciation Variation | S. Greenberg | Proceedings of the ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kekrade, Netherlands, pp. 47-56 | 1998 | Speech | [PDF]
|
| Speaker Recogntion in the Text-Independent Domain Using Keyword Hidden Markov Models | K. Boakye | M.S. Thesis, University of California at Berkeley | May 2005 | Speech | [PDF]
|
| Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms | A. Stolcke, S. Kajarekar, L. Ferrer, and E. Shriberg | IEEE Transactions on Audio, Speech, and Language Processing. Special issue on speaker and language recognition, Vol. 15, Issue 7, IEEE Computer Society, California, pp. 1987-1998 | September 2007 | Speech | [PDF]
|
| Speaker Recognition Via Nonlinear Discriminant Features | L. Stoll, J. Frankel, and N. Mirghafori | Proceedings of the International Speech Communication Association Tutorial and Research Workshop on Non-Linear Speech Processing (NOLISP 2007), Paris, France, pp. 27-30 | May 2007 | Speech | [PDF]
|
| Speaker Recognition Using Syllable-Based Constraints for Cepstral Frame Selection | T. Bocklet and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4525-4528 | April 2009 | Speech | [PDF]
|
| Speaker Recognition Using Prosodic and Lexical Features | S. Kajarekar, L. Ferrer, A. Venkataraman, K. Sonmez, E. Shriberg, A. Stolcke, H. Bratt, and R. R. Gadde | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2003), St. Thomas, Virgin Islands, pp. 19-24 | November 2003 | Speech | [PDF]
|
| Speaker Recognition and Diarization | G. Friedland and D. van Leeuwen | In Semantic Computing, P. Sheu, H. Yu, C. V. Ramamamoorthy, A. K. Joshi, and L. A. Zadeh, eds., pp. 115-130, IEEE Press/Wiley | 2010 | Speech | |
| Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap | O. Cetin and E.E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 357-360 | May 2006 | Speech | [PDF]
|
| Speaker Diarization: A Review of Recent Research | X. Anguera, S. Bozonnet, N. Evans, C. Fredouille, G. Friedland, and O. Vinyals | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 2, pp. 356-370 | February 2012 | Speech | [PDF]
|
| Speaker Diarization For Multiple-distant-microphone Meetings Using Several Sources of Information | J. M. Pardo, X. Anguera, and C. Wooters | IEEE Transactions on Computers, Vol. 56, Issue 9, IEEE Computer Society, California, pp. 1212-1224 | September 2007 | Speech | [PDF]
|
| Speaker Diarization for Multiple Distant Microphone Meetings: Mixing Acoustic Features And Inter-Channel Time Differences | J. Pardo, X. Anguera, and C. Wooters | Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2194-2197 | September 2006 | Speech | [PDF]
|
| Speaker Diarization for Multi-Party Meetings Using Acoustic Fusion | X. Anguera, C. Wooters, and J. Hernando | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 426-461 | November 2005 | Speech | [PDF]
|
| Speaker Diarization for Multi-Microphone Meetings Using Only Between-Channel Differences | J.M. Pardo, X Anguera, and C. Wooters | Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 257-264 | May 2006 | Speech | [PDF]
|
| Speaker Diarization | G. Friedland | In Speech and Audio Signal Processing, 2nd edition, B. Gold, N. Morgan, D. Ellis, eds., Wiley | 2011 | Speech | |
| Speaker Diarization | G. Friedland and F. Valente | In Multimodal Signal Processing: Human Interactions in Meetings, S. Reynals, H. Bourlard, J. Carletta, and A. Popescu-Belis, eds., Cambridge University Press | June 2012 | Speech | |
| Speaker Detection Without Models | D. Gillick, S. Stafford, and B. Peskin | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 757-760 | March 2005 | Speech | [PDF]
|
| Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings | J. Kolar, Y. Liu, and E. Shriberg | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1621-1624 | August 2007 | Speech | [PDF]
|
| Speaker Adaptation of Language and Prosodic Models for Automatic Dialog Act Segmentation of Speech | J. Kolar, Y. Liu, and E. Shriberg | Speech Communication, Vol. 52, Issue 3, pp. 236-245 | March 2010 | Speech | |
| SPAM: Experiments with Digit Recognition | N. Morgan, S.L. Wu, and H. Bourlard | Proceedings of the 15th Annual Speech Research Symposium, Baltimore, Maryland | June 1995 | Speech | [PDF]
|
| Source Separation Based on Binaural Cues and Source Model Constraints | R. Weiss, M. Mandel, and D. Ellis | Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 419-422 | September 2008 | Speech | [PDF]
|
| Sooner or Later: Exploring Asynchrony in Multi-Band Speech Recognition | N. Mirghafori and N. Morgan | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, Vol. 2, pp. 595-598 | September 1999 | Speech | [PDF]
|
| SmartKom English: From Robust Recognition to Felicitous Interaction | D. Gelbart, J. Bryants, A. Stolcke, R. Porzel, M. Baudis, and N. Morgan | In SmartKom--Foundations of Multimodal Dialogue Systems, W. Wahlster, ed., pp. 453-470, Springer | November 2004 | Speech | [PDF]
|
| Size Matters: An Empirical Study of Neural Network Training for Large Vocabulary Continuous Speech Recognition | D. Ellis and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1013-1016 | March 1999 | Speech | [PDF]
|
| Simple, Accurate Parsing with an All-Fragments Grammar | M. Bansal and D. Klein | Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010), Uppsala, Sweden, pp. 1098-1107 | July 2010 | Speech | [PDF]
|
| Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research | H. Hermansky and N. Morgan | Journal of Negative Results in Speech and Audio Sciences, Vol. 1, Issue 1 | 2004 | Speech | [PDF]
|
| Should Recognizers Have Ears? | H. Hermansky | Proceedings of ESCA Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels, Pont-a-Mousson, France, pp. 1-10 | April 1997 | Speech | |
| Semi-Autonomous Car Control Using Brain Computer Interfaces | D. Goehring, D. Latotzky, M. Wang, and R. Rojas | Proceedings of the 12th International Conference of Intelligent Autonomous Systems (IAS), Juju Island, Korea | June 2012 | Speech | |
| Semantic Computing and Privacy: A Case Study Using Inferred Geo-Tagging | G. Friedland and J. Choi | International Journal of Semantic Computing, Vol. 5, No. 1, pp. 79-93. Also Best Poster in the Electrical and Computer Science and Engineering Track at the Korean Student Technical and Leadership Conference, Chicago, Illinois, March 2012. DOI: 10.1142/S1793351X11001171 | March 2011 | Speech | [PDF]
|
| Selecting On-topic Sentences from Natural Language Corpora | M. Levit, E. Boschee, and M. Freedman | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2793-2796 | August 2007 | Speech | |
| Selected Papers from the Third IEEE International Conference on Semantic Computing (ICSC2009) | G. Friedland and S. C. Shen, eds. | International Journal on Semantic Computing, Vol. 3, Issue 4 | December 2009 | Speech | |
| Selected Papers from the 11th IEEE International Symposium on Multimedia (ISM2009) | G. Friedland and M.-L. Shyu, eds. | International Journal on Semantic Computing, Vol. 4, No. 2 | November 2010 | Speech | |
| Search for Information Bearing Components in Speech | H.H. Yang and H. Hermansky | Advances in Neural Information Processing Systems, Vol. 12, S.A. Solla, T.K. Leen and K.-R. Muller, eds., MIT Press | 2000 | Speech | |
| Scaling Up: Learning Large-Scale Recognition Methods from Small-Scale Recognition Tasks | N. Morgan, B. Chen, Q. Zhu, and A. Stolcke | ICSI Technical Report tr-03-02. Also Special Workshop in Maui(SWIM) paper 218. | 2004 | Speech | [PDF]
|
| Scaling a Hybrid HMM/MLP System for Large Vocabulary CSR | N. Morgan, G. Tajchman, N. Mirghafori, Y. Konig, and C. Wooters | ARPA Spoken Language Technology Workshop, Morgan Kaufmann, pp. 123-124 | 1994 | Speech | |
| Sampling Alignment Structure Under a Bayesian Translation Model | J. DeNero, A. Bouchard-Côté, and D. Klein | Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP), Waikiki, Honolulu, Hawaii, pp. 314-323 | October 2008 | Speech | [PDF]
|
| Role Recognition for Meeting Participants: An Approach Based on Lexical Information and Social Network Analysis | N. Garg, S. Favre, H. Salamin, D. Hakkani-Tur, and A. Vinciarelli | Proceedings of 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 693-696. | October 2008 | Speech | [PDF]
|
| Robust Speech Recognition Using the Modulation Spectrogram | B. Kingsbury, N. Morgan, and S. Greenberg | Speech Communication, Vol. 25, pp. 117-132 | 1998 | Speech | |