| Model Adaptation for Sentence Segmentation from Speech | S. Cuendet, D. Hakkani-Tur, and G. Tur | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 102-105 | December 2006 | Speech | [PDF]
|
| The ICSI Meeting Project: Resources and Research | A. Janin, J. Ang, S. Bhagat, R. Dhillon, J. Edwards, J. Macias, N. Morgan, B. Peskin, E. Shriberg, A. Stolcke, C. Wooters, and B. Wrede | Proceedings of the ICASSP 2004 Meeting Recognition Workshop, Montreal, Canada | May 2004 | Speech | [PDF]
|
| Backoff Model Training Using Partially Observed Data: Application to Dialog Act Tagging | G. Ji and J. Bilmes | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2006), New York City, New York, pp. 280-287 | June 2006 | Speech | [PDF]
|
| Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures | I. Bulyko, M. Ostendorf, and A. Stolcke | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, Vol. 2, pp. 7-9 | May 2003 | Speech | [PDF]
|
| Factored Language Models and Generalized Parallel Backoff | J. Bilmes and K. Kirchhoff | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, p. 1 | May 2003 | Speech | [PDF]
|
| Detection of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data | D. Hillard, M. Ostendorf, and E. Shriberg | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada | May 2003 | Speech | [PDF]
|
| Multi-Speaker Language Modeling | G. Ji and J. Bilmes | Proceedings of the Human Language Technology Conference at the North American Chapter of the Association for Computational Linguistics, Boston, Massachusetts, pp. 133-136 | May 2004 | Speech | [PDF]
|
| The ICSI Meeting Recorder Dialog Act (MRDA) Corpus | E. Shriberg, R. Dhillon, S. Bhagat, J. Ang, and H. Carvey | Proceedings of the Human Language Technology Conference at the North American Chapter of the Association for Computational Linguistics, Boston, Massachusetts | April 2004 | Speech | [PDF]
|
| The Meeting Project at ICSI | N. Morgan, D. Baron, J. Edwards, D. Ellis, D. Gelbart, A. Janin, T. Pfau, E. Shriberg, and A. Stolcke | Proceedings of the Human Language Technologies Conference, San Diego, California | March 2001 | Speech | [PDF]
|
| Automatic Learning of Word Pronunciation from Data | E. Fosler, M. Weintraub, S. Wegmann, Y. H. Kao, S. Khudanpur, C. Galles, and M. Saraclar | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| Stochastic Perceptual Speech Models with Durational Dependence | J. Bilmes, N. Morgan, S.L. Wu, and H. Bourlard | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| Insights Into Spoken Language Gleaned from Phonetic Transcriptions of the Switchboard Corpus | S. Greenberg, J. Hollenback, and D. Ellis | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus | L. Chen, Y. Liu, M. Harper, E. Maia, and S. McRoy | Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, Portugal, pp. 759-762 | 2004 | Speech | [PDF]
|
| Multimodal Indoor Localization: An Audio-Wireless-Based Approach | O. Vinyals, E. Martin, and G. Friedland | Proceedings of the Fourth IEEE International Conference on Semantic Computing (ICSC-2010), Pittsburgh, Pennsylvania, pp. 120-125 | September 2010 | Speech | [PDF]
|
| Digit Recognition with Stochastic Perceptual Models | N. Morgan, S.L. Wu, and H. Bourlard | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| Building Multiple Pronunication Models for Novel Words using Exploratory Computational Phonology | G. Tajchman, E. Fosler, and D. Jurafsky | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| REMAP: Recursive Estimation and Maximization of A Posteriori Probabilities in Connectionist Speech Recognition | H. Bourlard, Y. Konig, and N. Morgan | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| Fast Speakers in Large Vocabulary Continuous Speech Recognition: Analysis & Antidotes | N. Mirghafori, E. Fosler, and N. Morgan | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| Long-Term Temporal Features for Conversational Speech Recognition | B. Chen, Q. Zhu, and N. Morgan | Proceedings of the First International Workshop on Machine Learning for Multimodal Interaction (MLMI 2004), Martigny, Switzerland | June 2004 | Speech | |
| Tandem Connectionist Feature Extraction for Conversational Speech Recognition | Q. Zhu, B. Chen, N. Morgan, and A.Stolcke | Proceedings of the First International Workshop on Machine Learning for Multimodal Interaction (MLMI 2004), Martigny, Switzerland | June 2004 | Speech | |
| Speech Recognition for Illiterate Access to Information and Technology | M. Plauché, N. Udhyakummar, C. Wooters, J. Pal, and D. Ramachadran | Proceedings of the First International Conference on Information and Communication Technologies and Development (ICTD '06), Berkeley, California, pp. 83-92 | May 2006 | Speech | [PDF]
|
| How Good Is the Crowd at "Real" WSD? | J. Hong and C. F. Baker | Proceedings of the Fifth Linguistic Annotation Workshop (LAW-V), Portland, Oregon | June 2011 | Speech | [PDF]
|
| Modeling Dynamic Prosodic Variation for Speaker Verification | K. Sonmez, E. Shriberg, L. Heck, and M. Weintraub | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia, Vol. 7, p. 3189 | November 1998 | Speech | |
| Performance Improvements Through Combining Phone- and Syllable-Length Information in Automatic Speech Recognition | S.L. Wu, B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia, pp. 854-857 | November 1998 | Speech | [PDF]
|
| Spectral Basis Functions from Discriminant Analysis | H. Hermansky and N. Malayath | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia | November 1998 | Speech | |
| Combining Connectionist Multi-Band and Full-Band Probability Streams for Speech Recognition of Natural Numbers | N. Mirghafori and N. Morgan | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 743-746. | 1998 | Speech | [PDF]
|
| Speech Intelligibility Derived From Exceedingly Sparse Spectral Information | S. Greenberg, T. Arai, and R. Silipo | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 74-77 | November 1998 | Speech | [PDF]
|
| Data-Driven Extensions to HMM Statistical Dependencies | J. Bilmes | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 69-72 | November 1998 | Speech | [PDF]
|
| REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications | M. Rayner, P. Bouillon, B.A. Hockey, and N. Chatzichrisafis | Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006), Genoa, Italy, pp. 783-788 | May 2006 | Speech | [PDF]
|
| Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location Estimation | J. Choi and G. Friedland | Proceedings of the Fifth IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246 | September 2011 | Speech | [PDF]
|
| Speech Recognition Using On-line Estimation of Speaking Rate | N. Morgan, E. Fosler, and N. Mirghafori | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece, Vol. 4, pp. 2079-2082 | September 1997 | Speech | [PDF]
|
| The Temporal Properties of Spoken Japanese Are Similar to Those of English | T. Arai and S. Greenberg | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece, Vol. 2, pp. 1011-1014 | September 1997 | Speech | [PDF]
|
| Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems | L. Hennebert, C. Ris, H. Bourlard, S Renals, and N. Morgan | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece, pp. 1951-1954 | September 1997 | Speech | |
| Multiresolution Channel Normalization for ASR in Reverberant Environments | C. Avendano, S. Tibrewala, and H. Hermansky | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece | September 1997 | Speech | |
| Data-Driven Design of RASTA-like Filters | S. van Vuuren and H. Hermansky | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece | September 1997 | Speech | |
| EEG Signal Compression Based on Classified Signature and Envelope Vector Sets | H. Gurkan, U. Guz, and B.S. Yarman | Proceedings of the European Conference on Circuit Theory and Design, IEEE Circuits and Systems Society and the European Circuit Society, Seville, Spain, pp. 420-423 | August 2007 | Speech | |
| A New Algorithm for High Speed Speech and Audio Coding | U. Guz, H. Gurkan, and B.S. Yarman | Proceedings of the European Conference on Circuit Theory and Design, IEEE Circuits and Systems Society and the European Circuit Society, Seville, Spain | August 2007 | Speech | |
| Understanding Speech Understanding | S. Greenberg | Proceedings of the ESCA Workshop on the "Auditory Basis of Speech Perception," Keele University, Staffordshire, UK, pp. 1-8 | 1996 | Speech | [PDF]
|
| Prediction-driven Computational Auditory Scene Analysis for Dense Sound Mixtures | D. Ellis | Proceedings of the ESCA Workshop on the "Auditory Basis of Speech Perception," Keele University, Staffordshire, UK | 1996 | Speech | [PDF]
|
| Effects of Speaking Rate and Word Predictability on Conversational Pronunciations | E. Fosler-Lussier and N. Morgan | Proceedings of the ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, Netherlands | May 1998 | Speech | [PDF]
|
| Speaking in Shorthand - A Syllable-Centric Perspective for Understanding Pronunciation Variation | S. Greenberg | Proceedings of the ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kekrade, Netherlands, pp. 47-56 | 1998 | Speech | [PDF]
|
| Improving ASR Performance for Reverberant Speech | B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 87-90 | 1997 | Speech | [PDF]
|
| Robust Features and Environmental Compensation: A Few Comments | N. Morgan | Proceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 43-44 | 1997 | Speech | [PDF]
|
| On the Origins of Speech Intelligibility in the Real World | S. Greenberg | Proceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 23-32 | 1997 | Speech | [PDF]
|
| Spotting "Hot Spots" in Meetings: Human Judgments and Prosodic Cues | B. Wrede and E. Shriberg | Proceedings of the Eighth European Conference on Speech Communication and Technology (EUROSPEECH 2003), Geneva, Switzerland, pp. 2805-2808 | September 2003 | Speech | [PDF]
|
| Incorporating Tandem/HATs MLP Features into SRI's Conversational Speech Recognition System | Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan | Proceedings of the EARS RT-04F Workshop, Palisades, New York, November 2004. | November 2004 | Speech | [PDF]
|
| Not Just What, But Also When: Guided Automatic Pronunciation Modeling for Broadcast News | E. Fosler-Lussier and G. Williams | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| Reducing Errors by Increasing the Error Rate: MLP Acoustic Modeling for Broadcast News Transcription | N. Morgan, D. Ellis, E. Fosler-Lussier, A. Janin, and B. Kingsbury | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| An Overview of the SPRACH System for the Transcription of Broadcast News | G. Cook, J. Christie, D. Ellis, E. Fosler-Lussier, Y. Gotoh, B. Kingsbury, N. Morgan, S. Renals, T. Robinson, and G. Williams | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia | February 1999 | Speech | [PDF]
|
| The Uninvited Guest: Information's Role in Guiding the Production of Spontaneous Speech | S. Greenberg and E. Fosler-Lussier | Proceedings of the Crest Workshop on Models of Speech Production: Motor Planning and Articulatory Modelling, Kloster Seeon, Germany | May 2000 | Speech | [PDF]
|