| Incorporating Tandem/HATs MLP Features into SRI's Conversational Speech Recognition System | Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan | Proceedings of the EARS RT-04F Workshop, Palisades, New York, November 2004. | November 2004 | Speech | [PDF]
|
| The ICSI Meeting Corpus: Close-Talking and Far-Field, Multi-Channel Transcriptions for Speech and Language Researchers | J. A. Edwards | Proceedings of the Workshop on Compiling and Processing Spoken Language Corpora at the Fourth International Conference on Language Resources and Evaluation (LREC 2004), pp. 8-11 | May 2004 | Speech | [PDF]
|
| Auditory-Based Automatic Speech Recognition | W. Hemmert, M. Holmberg, and D. Gelbart | Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Vocabulary and Language Model Adaptation Using Information Retrieval | B. Bigi, Y. Huang, and R. De Mori | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Learning Long-Term Temporal Features in LVCSR Using Neural Networks | B. Chen, Q. Zhu, and N. Morgan | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| On Using MLP Features in LVCSR | Q. Zhu, B. Chen, N. Morgan. and A. Stolcke | Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004. | October 2004 | Speech | [PDF]
|
| Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing | E. Shriberg and A. Stolcke | Proceedings of the International Conference on Speech Prosody, Nara, Japan, March 2004. | March 2004 | Speech | [PDF]
|
| Prosody Modeling for Automatic Speech Recognition and Understanding | E. Shriberg and A. Stolcke | Mathematical Foundations of Speech and Language Modeling, M. Johnson, M. Ostendorf, S. Khudanpur, R. Rosenfeld (eds.), Volume 138 in IMA Volumes in Mathematics and its Applications, pp. 105-114, Springer-Verlag. | 2004 | Speech | [PDF]
|
| Qualcomm-ICSI-OGI Features for ASR | A. Adami, L. Burget, S. Dupont, H. Garudadri, F. Grezl, H. Hermansky, P. Jain, S. Kajarekar, N. Morgan, and S. Sivadas | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| A New Speaker Change Detection Method for Two-Speaker Segmentation | A. Adami, S. Kajarekar, and H. Hermansky | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Unknown-Multiple Speaker Clustering Using HMM | J. Ajmera, H. Bourlard, I. Lapidot, and I. McCowan | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | May 2002 | Speech | |
| Prosodic Cues For Emotion Recognition In Communicator Dialogs | J.C. Ang | M.S. Thesis, University of California at Berkeley | December 2002 | Speech | [PDF]
|
| Prosody-Based Automatic Detection of Annoyance and Frustration in Human-Computer Dialog | J. Ang, R. Dhillon, A. Krupski, E. Shriberg, and A. Stolcke | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | |
| Prosody-Based Automatic Detection of Punctuation and Interruption Events in the ICSI Meeting Recorder Corpus | D. Baron | M.S. Thesis, University of California at Berkeley | May 2002 | Speech | [PDF]
|
| Automatic Punctuation and Disfluency Detection in Multi-Party Meetings Using Prosodic and Lexical Cues | D. Baron, E. Shriberg, and A. Stolcke | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado, pp. 949-952 | September 2002 | Speech | [PDF]
|
| A Syllable, Articulatory-Feature, and Stress-Accent Model of Speech Recognition | S. Chang | Ph.D. Thesis, University of California at Berkeley. Also ICSI Technical Report TR-02-007 | September 2002 | Speech | [PDF]
|
| Reducing the Effect of Room Acoustics on Human-Computer Interaction | D. Gelbart | Proceedings of the Applied Voice Input/Output Society (AVIOS 2002), San Jose, California | May 2002 | Speech | [PDF]
|
| Double the Trouble: Handling Noise and Reverberation in Far-Field Automatic Speech Recognition | D. Gelbart and N. Morgan | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| The Relation of Stress Accent to Pronunciation Variation in Spontaneous American English Discourse | S. Greenberg, H.M. Carvey, and L. Hitchcock | Proceedings of the International Conference on Speech Prosody 2002, Aix-en-Provence, France | April 2002 | Speech | |
| Robust Speech Recognition Based on Spectro-Temporal Processing | M. Kleinschmidt | Ph.D Dissertation, University of Oldenberg, Germany | 2002 | Speech | |
| Spectro-temporal Gabor Features as a Front End for Automatic Speech Recognition | M. Kleinschmidt | Proceedings of the Triennial Forum Acusticum 2002, Seville, Spain | September 2002 | Speech | [PDF]
|
| Improving Word Accuracy with Gabor Feature Extraction | M. Kleinschmidt and D. Gelbart | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| What's New in Government-Sponsored Speech Recognition Research | N. Morgan | Speech Technology Magazine, Vol. 7, No. 5 | September 2002 | Speech | |
| Hierarchical Tandem Feature Extraction | S. Sivadas and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Speech Modeling Using Variational Bayesian Mixture of Gaussians | P. Somervuo | Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, Colorado | September 2002 | Speech | [PDF]
|
| Using Prosodic and Lexical Information for Speaker Identification | F. Weber, L. Manganaro, B. Peskin, and E. Shriberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, Florida | May 2002 | Speech | [PDF]
|
| Combining Bottom-Up and Top-Down Constraints for Robust ASR: The Multiscore Decoder | J. Barker, M. Cooke, and D. Ellis | Proceedings of the Workshop on Consistent and Reliable Acoustic Cues (CRAC-2001), Aalborg, Denmark | September 2001 | Speech | |
| Chapter 17: The Transcription of Discourse | J. Edwards | The Handbook of Discourse Analysis, D. Shriffrin, D. Tannen and H. Hamilton, eds. Oxford: Blackwell, pp. 321-348 | 2001 | Speech | |
| Investigations Into Tandem Acoustic Modeling for the Aurora Taks | D.P.W. Ellis and M. Reyes | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | |
| Tandem Acoustic Modeling in Large-Vocabulary Recognition | D. Ellis, R. Singh, and S. Sivadas | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| The Relation Between Stress Accent and Vocalic Identity in Spontaneous American English Discourse | S. Greenberg, S. Chang, and L. Hitchcock | Proceedings of ISCA Workshop on Prosody in Speech Recognition and Understanding, Red Bank, New Jersey | October 2001 | Speech | |
| A Study of Two Dimensional Linear Descriminants For ASR | S. Kajarekar, B. Yegnanarayana, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| Multispeaker Speech Activity Detection for the ICSI Meeting Recorder | T. Pfau, D. Ellis, and A. Stolcke | Proceedings of Automatic Speech Recognition and Understanding Workshop (ASRU 2001),
Madonna di Campiglio, Italy, pp. 107-110 | December 2001 | Speech | [PDF]
|
| Evaluating Long-term Spectral Subtraction for Reverberant ASR | D. Gelbart and N. Morgan | Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU 2001), Madonna di Campiglio, Italy | December 2001 | Speech | [PDF]
|
| Relating Frame Accuracy with Word Error in Hybrid ANN-HMM ASR | M. Shire | Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech | E. Shriberg, A. Stolcke, and D. Baron | Proceedings of the ISCA Tutorial and Research Workshop on Prosody in Speech Recognition and Understanding, Red Bank, New Jersey | October 2001 | Speech | [PDF]
|
| SpeechCorder, The Portable Meeting Recorder | A. Janin and N. Morgan | Proceedings of the Workshop on Hands-Free Speech Communication, Kyoto, Japan | April 2001 | Speech | [PDF]
|
| Meeting Recorder | A. Janin | Proceedings of the Applied Voice Input/Output Society, San Jose, California | April 2001 | Speech | [PDF]
|
| Robust ASR Front-End Using Spectral-Based and Discriminant Features: Experiments on the Aurora Tasks | C. Benitez, L. Burget, B. Chen, S. Dupont, H. Garudadri, H. Hermansky, P. Jain, S. Kajarekar, and S. Sivadas | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark, pp. 429-432 | September 2001 | Speech | [PDF]
|
| Observations on Overlap: Findings and Implications for Automatic Processing of Multi-Party Conversation | E. Shriberg, A. Stolcke, and D. Baron | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| An Elitist Approach to Articulatory-Acoustic Feature Classification | S. Chang, S. Greenberg, and M. Wester | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| From Here to Utility - Melding Phonetic Insight with Speech Technology | S. Greenberg | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| Whither Speech Technology? - A Twenty-First Century Perspective | S. Greenberg | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| The Relation Between Speech Intelligibility and the Complex Modulation Spectrum | S. Greenberg and T. Arai | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| Vowel Height is Intimately Associated with Stress Accent in Spontaneous American English Discourse | L. Hitchcock and S. Greenberg | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| A Dutch Treatment of an Elitist Approach to Articulatory-Acoustic Feature Classification | M. Wester, S. Greenberg, and S. Chang | Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark | September 2001 | Speech | [PDF]
|
| Multi-Stream ASR trained with Heterogeneous Reverberant Environments | M.L. Shire | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | [PDF]
|
| Global Posterior Probability Estimates as Confidence Measures in an Automatic Speech Recognition System | W. Warren | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, Utah | May 2001 | Speech | |
| The Meeting Project at ICSI | N. Morgan, D. Baron, J. Edwards, D. Ellis, D. Gelbart, A. Janin, T. Pfau, E. Shriberg, and A. Stolcke | Proceedings of the Human Language Technologies Conference, San Diego, California | March 2001 | Speech | [PDF]
|
| Speech Intelligibility Derived From Asynchrounous Processing of Auditory-Visual Information | K.W. Grant and S. Greenberg | Proceedings of the International Conference on Auditory-Visual Speech Processing Workshop (AVSP 2001), Scheelsminde, Denmark | September 2001 | Speech | [PDF]
|