| The Temporal Properties of Spoken Japanese Are Similar to Those of English | T. Arai and S. Greenberg | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece, Vol. 2, pp. 1011-1014 | September 1997 | Speech | [PDF]
|
| Speech Recognition Using On-line Estimation of Speaking Rate | N. Morgan, E. Fosler, and N. Mirghafori | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece, Vol. 4, pp. 2079-2082 | September 1997 | Speech | [PDF]
|
| Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location Estimation | J. Choi and G. Friedland | Proceedings of the Fifth IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246 | September 2011 | Speech | [PDF]
|
| REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications | M. Rayner, P. Bouillon, B.A. Hockey, and N. Chatzichrisafis | Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006), Genoa, Italy, pp. 783-788 | May 2006 | Speech | [PDF]
|
| Data-Driven Extensions to HMM Statistical Dependencies | J. Bilmes | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 69-72 | November 1998 | Speech | [PDF]
|
| Speech Intelligibility Derived From Exceedingly Sparse Spectral Information | S. Greenberg, T. Arai, and R. Silipo | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 74-77 | November 1998 | Speech | [PDF]
|
| Combining Connectionist Multi-Band and Full-Band Probability Streams for Speech Recognition of Natural Numbers | N. Mirghafori and N. Morgan | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 743-746. | 1998 | Speech | [PDF]
|
| Spectral Basis Functions from Discriminant Analysis | H. Hermansky and N. Malayath | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia | November 1998 | Speech | |
| Performance Improvements Through Combining Phone- and Syllable-Length Information in Automatic Speech Recognition | S.L. Wu, B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia, pp. 854-857 | November 1998 | Speech | [PDF]
|
| Modeling Dynamic Prosodic Variation for Speaker Verification | K. Sonmez, E. Shriberg, L. Heck, and M. Weintraub | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia, Vol. 7, p. 3189 | November 1998 | Speech | |
| How Good Is the Crowd at "Real" WSD? | J. Hong and C. F. Baker | Proceedings of the Fifth Linguistic Annotation Workshop (LAW-V), Portland, Oregon | June 2011 | Speech | [PDF]
|
| Speech Recognition for Illiterate Access to Information and Technology | M. Plauché, N. Udhyakummar, C. Wooters, J. Pal, and D. Ramachadran | Proceedings of the First International Conference on Information and Communication Technologies and Development (ICTD '06), Berkeley, California, pp. 83-92 | May 2006 | Speech | [PDF]
|
| Long-Term Temporal Features for Conversational Speech Recognition | B. Chen, Q. Zhu, and N. Morgan | Proceedings of the First International Workshop on Machine Learning for Multimodal Interaction (MLMI 2004), Martigny, Switzerland | June 2004 | Speech | |
| Tandem Connectionist Feature Extraction for Conversational Speech Recognition | Q. Zhu, B. Chen, N. Morgan, and A.Stolcke | Proceedings of the First International Workshop on Machine Learning for Multimodal Interaction (MLMI 2004), Martigny, Switzerland | June 2004 | Speech | |
| Digit Recognition with Stochastic Perceptual Models | N. Morgan, S.L. Wu, and H. Bourlard | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| Building Multiple Pronunication Models for Novel Words using Exploratory Computational Phonology | G. Tajchman, E. Fosler, and D. Jurafsky | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| REMAP: Recursive Estimation and Maximization of A Posteriori Probabilities in Connectionist Speech Recognition | H. Bourlard, Y. Konig, and N. Morgan | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| Fast Speakers in Large Vocabulary Continuous Speech Recognition: Analysis & Antidotes | N. Mirghafori, E. Fosler, and N. Morgan | Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain | September 1995 | Speech | [PDF]
|
| Multimodal Indoor Localization: An Audio-Wireless-Based Approach | O. Vinyals, E. Martin, and G. Friedland | Proceedings of the Fourth IEEE International Conference on Semantic Computing (ICSC-2010), Pittsburgh, Pennsylvania, pp. 120-125 | September 2010 | Speech | [PDF]
|
| Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus | L. Chen, Y. Liu, M. Harper, E. Maia, and S. McRoy | Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, Portugal, pp. 759-762 | 2004 | Speech | [PDF]
|
| Automatic Learning of Word Pronunciation from Data | E. Fosler, M. Weintraub, S. Wegmann, Y. H. Kao, S. Khudanpur, C. Galles, and M. Saraclar | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| Stochastic Perceptual Speech Models with Durational Dependence | J. Bilmes, N. Morgan, S.L. Wu, and H. Bourlard | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| Insights Into Spoken Language Gleaned from Phonetic Transcriptions of the Switchboard Corpus | S. Greenberg, J. Hollenback, and D. Ellis | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| The Meeting Project at ICSI | N. Morgan, D. Baron, J. Edwards, D. Ellis, D. Gelbart, A. Janin, T. Pfau, E. Shriberg, and A. Stolcke | Proceedings of the Human Language Technologies Conference, San Diego, California | March 2001 | Speech | [PDF]
|
| The ICSI Meeting Recorder Dialog Act (MRDA) Corpus | E. Shriberg, R. Dhillon, S. Bhagat, J. Ang, and H. Carvey | Proceedings of the Human Language Technology Conference at the North American Chapter of the Association for Computational Linguistics, Boston, Massachusetts | April 2004 | Speech | [PDF]
|
| Multi-Speaker Language Modeling | G. Ji and J. Bilmes | Proceedings of the Human Language Technology Conference at the North American Chapter of the Association for Computational Linguistics, Boston, Massachusetts, pp. 133-136 | May 2004 | Speech | [PDF]
|
| Detection of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data | D. Hillard, M. Ostendorf, and E. Shriberg | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada | May 2003 | Speech | [PDF]
|
| Factored Language Models and Generalized Parallel Backoff | J. Bilmes and K. Kirchhoff | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, p. 1 | May 2003 | Speech | [PDF]
|
| Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures | I. Bulyko, M. Ostendorf, and A. Stolcke | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, Vol. 2, pp. 7-9 | May 2003 | Speech | [PDF]
|
| Backoff Model Training Using Partially Observed Data: Application to Dialog Act Tagging | G. Ji and J. Bilmes | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL 2006), New York City, New York, pp. 280-287 | June 2006 | Speech | [PDF]
|
| The ICSI Meeting Project: Resources and Research | A. Janin, J. Ang, S. Bhagat, R. Dhillon, J. Edwards, J. Macias, N. Morgan, B. Peskin, E. Shriberg, A. Stolcke, C. Wooters, and B. Wrede | Proceedings of the ICASSP 2004 Meeting Recognition Workshop, Montreal, Canada | May 2004 | Speech | [PDF]
|
| Model Adaptation for Sentence Segmentation from Speech | S. Cuendet, D. Hakkani-Tur, and G. Tur | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 102-105 | December 2006 | Speech | [PDF]
|
| Let's DISCOH: Collecting an Annotated Open Corpus with Dialogue Acts and Reward Signals for Natural Language Helpdesks | G. Andeani, D. Di Fabbrizio, M. Gilbert, D. Gillick, D. Hakkani-Tur, and O. Lemon | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 218-221 | December 2006 | Speech | [PDF]
|
| Impact of Automatic Comma Prediction on POS/Name Tagging of Speech | D. Hillard, Z. Huang, H. Ji, R. Grishman, D. Hakkani-Tur, M. Harper, M. Ostendorf, and W. Wang | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 58-61 | December 2006 | Speech | [PDF]
|
| Model Adaptation for Dialog Act Tagging | G. Tur, U. Guz, and D. Hakkani-Tur | Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 94-97 | December 2006 | Speech | [PDF]
|
| Fast Speaker Diarization Using a High-Level Scripting Language | E. Gonina, G. Friedland, H. Cook, and K. Keutzer | Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2011), Big Island, Hawaii | December 2011 | Speech | [PDF]
|
| Recognition of Speech in Additive and Convolutional Noise Based on RASTA Spectral Processing | H. Hermansky, N. Morgan, and H.G. Hirsch | Proceedings of the IEEE Conference on Acoustics, Speech & Signal Processing, Minneapolis, Minnesota, pp. II-83-86 | 1993 | Speech | |
| Continuous Speech Recognition Using Multilayer Perceptrons with Hidden Markov Models | H. Bourlard and N. Morgan | Proceedings of the IEEE International Conference of Acoustics, Speech & Signal Processing (ICASSP 1990), Albuquerque, New Mexico | 1990 | Speech | |
| User Verification: Matching the Uploaders of Videos Across Accounts | H. Lei, J. Choi, A. Janin, and G. Friedland | Proceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 2404-2407 | May 2011 | Speech | [PDF]
|
| Connectionist Probability Estimation in the Decipher Speech Recognition System | S. Renals, N. Morgan, M. Cohen H. Bourlard, and H. Franco | Proceedings of the IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP 1992), pp. I-601-604 | 1992 | Speech | [PDF]
|
| Supervised and Unsupervised Clustering of the Speaker Space for Connectionist Speech Recognition | Y. Konig and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech & Signal Processing, Minneapolis, Minnesota, pp. I-545-548 | 1993 | Speech | |
| Stochastic Perceptual Models of Speech | N. Morgan, H. Bourlard, S. Greenberg, H. Hermansky, and S.L. Wu. | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 95), Detroit, Michigan | May 1995 | Speech | [PDF]
|
| Using A Stochastic Context-Free Grammar as a Language Model for Speech Recognition | D. Jurafsky, C. Wooters, J. Segal, A. Stolcke, E. Fosler, G. Tajchman, and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 95), Detroit, Michigan | May 1995 | Speech | [PDF]
|
| Multiband Audio Modeling for Single-Channel Acoustic Source Separation | M.J. Reyes-Gomez, D. Ellis, and N. Jojic | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), Montreal, Canada, Vol.5, pp. 641-644 | May 2004 | Speech | [PDF]
|
| Recognition in a New Key - Towards a Science of Spoken Language | S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 1041-1045 | May 1998 | Speech | [PDF]
|
| Maximum Mutual Information Based Reduction Strategies for Cross-Correlation Based Joint Distributional Modeling | J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 469-472 | May 1998 | Speech | [PDF]
|
| Transmissions and Transitions: A Study of Two Common Assumptions in Multi-Band ASR | N. Mirghafori and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 713-716 | 1998 | Speech | [PDF]
|
| Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition | S.L. Wu, B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 721-724 | May 1998 | Speech | [PDF]
|
| Combining Multiple Estimators of Speaking Rate | N. Morgan and E. Fosler-Lussier | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 729-732 | May 1998 | Speech | [PDF]
|
| Temporal Patterns (TRAPS) in ASR of Noisy Speech | H. Hermansky and S. Sharma | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona | March 1999 | Speech | |