| The Relation Between Speech Intelligibility and the Complex Modulation Spectrum | S. Greenberg and T. Arai | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| Vowel Height is Intimately Associated with Stress Accent in Spontaneous American English Discourse | L. Hitchcock and S. Greenberg | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| A Dutch Treatment of an Elitist Approach to Articulatory-Acoustic Feature Classification | M. Wester, S. Greenberg, and S. Chang | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| Speech Intelligibility Derived From Asynchrounous Processing of Auditory-Visual Information | K.W. Grant and S. Greenberg | Proceedings of the International Conference on Auditory-Visual Speech Processing Workshop (AVSP 2001), Scheelsminde, Denmark | September 2001 | Speech | [PDF]
|
| The Relation Between Stress Accent and Vocalic Identity in Spontaneous American English Discourse | S. Greenberg, S. Chang, and L. Hitchcock | Proceedings of ISCA Workshop on Prosody in Speech Recognition and Understanding, Red Bank, New Jersey | October 2001 | Speech | |
| Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech | E. Shriberg, A. Stolcke, and D. Baron | Proceedings of the ISCA Tutorial and Research Workshop on Prosody in Speech Recognition and Understanding, Red Bank, New Jersey | October 2001 | Speech | [PDF]
|
| Multispeaker Speech Activity Detection for the ICSI Meeting Recorder | T. Pfau, D. Ellis, and A. Stolcke | Proceedings of Automatic Speech Recognition and Understanding Workshop (ASRU 2001),
Madonna di Campiglio, Italy, pp. 107-110 | December 2001 | Speech | [PDF]
|
| Evaluating Long-term Spectral Subtraction for Reverberant ASR | D. Gelbart and N. Morgan | Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU 2001), Madonna di Campiglio, Italy | December 2001 | Speech | [PDF]
|
| Robust Speech Recognition Based on Spectro-Temporal Processing | M. Kleinschmidt | Ph.D Dissertation, University of Oldenberg, Germany | 2002 | Speech | |
| The Relation of Stress Accent to Pronunciation Variation in Spontaneous American English Discourse | S. Greenberg, H.M. Carvey, and L. Hitchcock | Proceedings of the International Conference on Speech Prosody 2002, Aix-en-Provence, France | April 2002 | Speech | |
| A New Speaker Change Detection Method for Two-Speaker Segmentation | A. Adami, S. Kajarekar, and H. Hermansky | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Unknown-Multiple Speaker Clustering Using HMM | J. Ajmera, H. Bourlard, I. Lapidot, and I. McCowan | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | May 2002 | Speech | |
| Prosody-Based Automatic Detection of Punctuation and Interruption Events in the ICSI Meeting Recorder Corpus | D. Baron | M.S. Thesis, University of California at Berkeley | May 2002 | Speech | [PDF]
|
| Reducing the Effect of Room Acoustics on Human-Computer Interaction | D. Gelbart | Proceedings of the Applied Voice Input/Output Society (AVIOS 2002), San Jose, California | May 2002 | Speech | [PDF]
|
| Hierarchical Tandem Feature Extraction | S. Sivadas and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Using Prosodic and Lexical Information for Speaker Identification | F. Weber, L. Manganaro, B. Peskin, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Qualcomm-ICSI-OGI Features for ASR | A. Adami, L. Burget, S. Dupont, H. Garudadri, F. Grezl, H. Hermansky, P. Jain, S. Kajarekar, N. Morgan, and S. Sivadas | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| Prosody-Based Automatic Detection of Annoyance and Frustration in Human-Computer Dialog | J. Ang, R. Dhillon, A. Krupski, E. Shriberg, and A. Stolcke | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | |
| Automatic Punctuation and Disfluency Detection in Multi-Party Meetings Using Prosodic and Lexical Cues | D. Baron, E. Shriberg, and A. Stolcke | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado, pp. 949-952 | September 2002 | Speech | [PDF]
|
| A Syllable, Articulatory-Feature, and Stress-Accent Model of Speech Recognition | S. Chang | Ph.D. Thesis, University of California at Berkeley. Also ICSI Technical Report TR-02-007 | September 2002 | Speech | [PDF]
|
| Double the Trouble: Handling Noise and Reverberation in Far-Field Automatic Speech Recognition | D. Gelbart and N. Morgan | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| Spectro-temporal Gabor Features as a Front End for Automatic Speech Recognition | M. Kleinschmidt | Proceedings of the Triennial Forum Acusticum 2002, Seville, Spain | September 2002 | Speech | [PDF]
|
| Improving Word Accuracy with Gabor Feature Extraction | M. Kleinschmidt and D. Gelbart | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| What's New in Government-Sponsored Speech Recognition Research | N. Morgan | Speech Technology Magazine, Vol. 7, No. 5 | September 2002 | Speech | |
| Speech Modeling Using Variational Bayesian Mixture of Gaussians | P. Somervuo | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| Prosodic Cues For Emotion Recognition In Communicator Dialogs | J.C. Ang | M.S. Thesis, University of California at Berkeley | December 2002 | Speech | [PDF]
|
| Automatic Speech Recognition | H. Hermansky, and N. Morgan | Encyclopedia of Cognitive Science, Nature Publishing Group, London | 2003 | Speech | |
| Word Fragments Identification Using Acoustic-Prosodic Features in Conversational Speech | Y. Liu | Proceedings of HLT/NAACL, Student Session, Edmonton, Alberta | 2003 | Speech | |
| Data-Driven Speaker and Subword Unit Clustering in Speech Processing | M. Hersch | EPFL Diploma Thesis, ICSI | March 2003 | Speech | [PDF]
|
| The ICSI Meeting Corpus | A. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. Wooters | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Meetings About Meetings: Research at ICSI on Speech in Multiparty Conversations | N. Morgan, D. Baron, S. Bhagat, H. Carvey, R. Dhillon, J. Edwards, D. Gelbart, A. Janin, A. Krupski, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. Wooters | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Using Prosodic and Conversational Features for High-Performance Speaker Recognition: Report From JHU WS'02. | B. Peskin, J. Navratil, J. Abramson, D. Jones, D. Klusacek, D. Reynolds, and B. Xiang | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Audio Information Access from Meeting Rooms | S. Renals and D. Ellis | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong | April 2003 | Speech | [PDF]
|
| The SuperSID Project: Exploiting High-Level Information for High-Accuracy Speaker Recognition | D. Reynolds, W. Andrews, J. Campbell, J. Navratil, B. Peskin, A. Adami, Q. Jin, D. Klusacek, J. Abramson, R. Mihaescu, J. Godfrey, D. Jones, and B. Xiang | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Experiments with Linear and Nonlinear Feature Transformations in HMM Based Phone Recognition | P. Somervuo | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Multi-Channel Source Separation by Factorial HMMs | M.J. Reyes-gomez, B. Raj, and D. Ellis | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong | April 2003 | Speech | [PDF]
|
| Detection of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data | D. Hillard, M. Ostendorf, and E. Shriberg | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada | May 2003 | Speech | [PDF]
|
| Factored Language Models and Generalized Parallel Backoff | J. Bilmes and K. Kirchhoff | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, p. 1 | May 2003 | Speech | [PDF]
|
| Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures | I. Bulyko, M. Ostendorf, and A. Stolcke | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, Vol. 2, pp. 7-9 | May 2003 | Speech | [PDF]
|
| An Improved Approximation Algorithm for Vertex Cover with Hard Capacities | R. Gandhi, E. Halperin, S. Khuller, G. Kortsarz, and A. Srinivasan | Proceedings of the 30th International Colloquium on Automata, Languages and Programming (ICALP 2003), Eindhoven, The Netherlands, pp. 164-175 | June 2003 | Speech | [PDF]
|
| Discourse Segmentation of Multi-party Conversation | M. Galley, K. McKeown, E. Fosler-Lussier, and H. Jing | Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL-03), Sapporo, Japan | July 2003 | Speech | [PDF]
|
| Automatically Generated Prosodic Cues to Lexically Ambiguous Dialog Acts in Multiparty Meetings | S. Bhagat, H. Carvey, and E. Shriberg | Proceedings of the 15th International Congress of Phonetic Sciences (ICPhS 2003), Barcelona, Spain | August 2003 | Speech | [PDF]
|
| Improving Word Sense Disambiguation in Lexical Chaining | M. Galley and K. McKeown | Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI 03), Acapulco, Mexico, pp. 1486-1488 | August 2003 | Speech | [PDF]
|
| Learning Discriminative Temporal Patterns in Speech: Development of Novel TRAPS-Like Classifiers | B. Chen, S. Chang, and S. Sivadas | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| Far-Field ASR on Inexpensive Microphones | L. Docio, D. Gelbart, and N. Morgan | Proceedings of Eighth European Conference on Speech Communication and Technology (EUROSPEECH 2003), Geneva, Switzerland, pp. 2141-2144 | September 2003 | Speech | [PDF]
|
| Automatic Disfluency Identification in Conversational Speech Using Multiple Knowledge Sources | Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| Feature Transformations and Combinations for Improving ASR Performance | P. Somervuo, B. Chen, and Q. Zhu | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| Spotting "Hot Spots" in Meetings: Human Judgments and Prosodic Cues | B. Wrede and E. Shriberg | Proceedings of the Eighth European Conference on Speech Communication and Technology (EUROSPEECH 2003), Geneva, Switzerland, pp. 2805-2808 | September 2003 | Speech | [PDF]
|
| Speaker Recognition Using Prosodic and Lexical Features | S. Kajarekar, L. Ferrer, A. Venkataraman, K. Sonmez, E. Shriberg, A. Stolcke, H. Bratt, and R. R. Gadde | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2003), St. Thomas, Virgin Islands, pp. 19-24 | November 2003 | Speech | [PDF]
|
| Pitch-Based Emphasis Detection for Characterization of Meeting Recordings | L. Kennedy and D. Ellis | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2003), St. Thomas, Virgin Islands | November 2003 | Speech | [PDF]
|