| Speech and Audio Signal Processing | B. Gold and N. Morgan | Wiley Press, New York | 1999 | Speech | |
| Temporal Patterns (TRAPS) in ASR of Noisy Speech | H. Hermansky and S. Sharma | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona | March 1999 | Speech | |
| Temporal Signal Processing for ASR | N. Morgan | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 9-16 | 1999 | Speech | |
| Relevancy of Time Frequency Features for Phonetic Classification Measured by Mutual Information | H.H. Yang, S. van Vuuren, and H. Hermansky | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona | March 1999 | Speech | |
| Speech Recognition with Dynamic Bayesian Networks | G. Zweig | Ph.D Dissertation, University of California at Berkeley, Spring 1998 | 1998 | Speech | [PDF]
|
| Speech Intelligibility in the Presence of Cross-Channel Spectral Asynchrony | T. Arai and S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-98), Seattle, Washington, pp. 933-936 | May 1998 | Speech | [PDF]
|
| Data-Driven Extensions to HMM Statistical Dependencies | J. Bilmes | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 69-72 | November 1998 | Speech | [PDF]
|
| Maximum Mutual Information Based Reduction Strategies for Cross-Correlation Based Joint Distributional Modeling | J. Bilmes | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 469-472 | May 1998 | Speech | [PDF]
|
| Midlevel Representations for Computational Auditory Scene Analysis: The Weft Element | D. Ellis and D. Rosenthal | Computational Auditory Scene Analysis, D.F. Rosenthal & H.G. Okuno, eds., Lawrence Erlbaum, pp. 257-272 | 1998 | Speech | |
| Effects of Speaking Rate and Word Predictability on Conversational Pronunciations | E. Fosler-Lussier and N. Morgan | Proceedings of the ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, Netherlands | May 1998 | Speech | [PDF]
|
| Recognition in a New Key - Towards a Science of Spoken Language | S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 1041-1045 | May 1998 | Speech | [PDF]
|
| Speaking in Shorthand - A Syllable-Centric Perspective for Understanding Pronunciation Variation | S. Greenberg | Proceedings of the ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kekrade, Netherlands, pp. 47-56 | 1998 | Speech | [PDF]
|
| Speech Intelligibility is Highly Tolerant of Cross-Channel Spectral Asynchrony | S. Greenberg and T. Arai | Proceedings of the Joint Meeting of the 137th Acoustical Society of America and the 16th International Congress on Acoustics (ICA/ASA), Seattle, Washington, pp. 2677-2678 | June 1998 | Speech | [PDF]
|
| Speech Intelligibility Derived From Exceedingly Sparse Spectral Information | S. Greenberg, T. Arai, and R. Silipo | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 74-77 | November 1998 | Speech | [PDF]
|
| Robust Speech Recognition Using the Modulation Spectrogram | B. Kingsbury, N. Morgan, and S. Greenberg | Speech Communication, Vol. 25, pp. 117-132 | 1998 | Speech | |
| Perceptually-Inspired Signal Processing Strategies for Robust Speech Recognition in Reverberant Environments | B. Kingsbury | Ph.D Dissertation, University of California at Berkeley | December 1998 | Speech | [PDF]
|
| A Multi-Band Approach to Automatic Speech Recognition | N. Mirghafori | Ph.D Dissertation, University of California at Berkeley, December 1998. Also ICSI Technical Report, TR-99-004, January 1999 | December 1998 | Speech | [PDF]
|
| Combining Connectionist Multi-Band and Full-Band Probability Streams for Speech Recognition of Natural Numbers | N. Mirghafori and N. Morgan | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP '98), Sydney, Australia, pp. 743-746. | 1998 | Speech | [PDF]
|
| Transmissions and Transitions: A Study of Two Common Assumptions in Multi-Band ASR | N. Mirghafori and N. Morgan | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 713-716 | 1998 | Speech | [PDF]
|
| Combining Multiple Estimators of Speaking Rate | N. Morgan and E. Fosler-Lussier | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 729-732 | May 1998 | Speech | [PDF]
|
| Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition | S.L. Wu | Ph.D. Thesis, University of California at Berkeley, Spring 1998. Also ICSI Technical Report TR-98-014 | 1998 | Speech | [PDF]
|
| Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition | S.L. Wu, B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 721-724 | May 1998 | Speech | [PDF]
|
| Performance Improvements Through Combining Phone- and Syllable-Length Information in Automatic Speech Recognition | S.L. Wu, B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia, pp. 854-857 | November 1998 | Speech | [PDF]
|
| Training Neural Networks with SPERT-II | K. Asanovic, J. Beck, D. Johnson, B. Kingsbury, N. Morgan, and J. Wawrzynek | Chapter in Parallel Architectures for Artificial Networks - Paradigms and Implementations, eds. N. Sundararajan and P. Saratchandran, IEEE Computer Society Press, pp. 345-364 | 1998 | Speech | |
| Connectionist Techniques for Speech Recognition | H. Bourlard and N. Morgan | Article in the Survey on the State of the Art in Human Language Technology, ed. R. Cole, Cambridge University Press, pp. 356-361 | 1998 | Speech | |
| Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions | H. Bourlard and N. Morgan | Adaptive Processing of Sequences and Data Structures, C.L. Giles and M. Gori (Eds.), pp. 389-417, Lecture Notes in Artificial Intelligence (1387), Springer | 1998 | Speech | |
| Spectral Basis Functions from Discriminant Analysis | H. Hermansky and N. Malayath | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia | November 1998 | Speech | |
| Modeling Dynamic Prosodic Variation for Speaker Verification | K. Sonmez, E. Shriberg, L. Heck, and M. Weintraub | Proceedings of the Fifth International Conference on Spoken Language Processing (ICSLP'98), Sydney, Australia, Vol. 7, p. 3189 | November 1998 | Speech | |
| The Temporal Properties of Spoken Japanese Are Similar to Those of English | T. Arai and S. Greenberg | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece, Vol. 2, pp. 1011-1014 | September 1997 | Speech | [PDF]
|
| Speech Recognition Using On-line Estimation of Speaking Rate | N. Morgan, E. Fosler, and N. Mirghafori | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece, Vol. 4, pp. 2079-2082 | September 1997 | Speech | [PDF]
|
| On the Origins of Speech Intelligibility in the Real World | S. Greenberg | Proceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 23-32 | 1997 | Speech | [PDF]
|
| Robust Features and Environmental Compensation: A Few Comments | N. Morgan | Proceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 43-44 | 1997 | Speech | [PDF]
|
| Improving ASR Performance for Reverberant Speech | B. Kingsbury, N. Morgan, and S. Greenberg | Proceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 87-90 | 1997 | Speech | [PDF]
|
| A Space-Time Theory of Pitch and Timbre Based on Cortical Expansion of the Cochlea Traveling Wave Delay | S. Greenberg, D. Poeppel, and T. Roberts | Proceedings of the 11th International Symposium on Hearing, Grantham, United Kingdom | August 1997 | Speech | [PDF]
|
| The Modulation Spectrogram: In Pursuit of an Invariant Representation of Speech | S. Greenberg and B. Kingsbury | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 3, pp. 1647-1650 | April 1997 | Speech | [PDF]
|
| Recognizing Reverberant Speech With RASTA-PLP | B. Kingsbury and N. Morgan | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 1259-1262 | April 1997 | Speech | [PDF]
|
| Integrating Syllable Boundary Information Into Speech Recognition | S.L. Wu, M. Shire, S. Greenberg, and N. Morgan | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 987-990 | April 1997 | Speech | [PDF]
|
| The Weft: A Representation for Periodic Sounds | D. Ellis | The 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 2, pp. 1307-1310 | April 1997 | Speech | [PDF]
|
| Computational Auditory Scene Analysis Exploiting Speech-Recognition Knowledge | D. Ellis | Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, p. 4 | October 1997 | Speech | [PDF]
|
| Joint Distributional Modeling with Cross-Correlation Based Features | J. Bilmes | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings (ASRU-97), Santa Barbara, California, pp.148-155 | 1997 | Speech | [PDF]
|
| Multiresolution Channel Normalization for ASR in Reverberant Environments | C. Avendano, S. Tibrewala, and H. Hermansky | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece | September 1997 | Speech | |
| Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems | L. Hennebert, C. Ris, H. Bourlard, S Renals, and N. Morgan | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece, pp. 1951-1954 | September 1997 | Speech | |
| Should Recognizers Have Ears? | H. Hermansky | Proceedings of ESCA Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels, Pont-a-Mousson, France, pp. 1-10 | April 1997 | Speech | |
| Switchboard-DAMSL Labeling Project Coder's Manual | D. Jurafsky, E. Shriberg, and D. Biasca | Technical Report 97-02, University of Colorado, Institute of Cognitive Science, Boulder, Colorado | 1997 | Speech | [PDF]
|
| Data-Driven Design of RASTA-like Filters | S. van Vuuren and H. Hermansky | Proceedings of the Fifth European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece | September 1997 | Speech | |
| Towards Robustness to Fast Speech in ASR | N. Mirghafori, E. Fosler, and N. Morgan | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP-96), Atlanta, Georgia | 1996 | Speech | [PDF]
|
| REMAP - Experiments with Speech Recognition | Y. Konig, H. Bourlard, and N. Morgan | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP-96), Atlanta, Georgia | May 1996 | Speech | [PDF]
|
| On Reversing the Generation Process in Optimality Theory | E. Fosler | Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL-96), Santa Cruz, California | 1996 | Speech | [PDF]
|
| Automatic Learning of Word Pronunciation from Data | E. Fosler, M. Weintraub, S. Wegmann, Y. H. Kao, S. Khudanpur, C. Galles, and M. Saraclar | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|
| Stochastic Perceptual Speech Models with Durational Dependence | J. Bilmes, N. Morgan, S.L. Wu, and H. Bourlard | Proceedings of the Fourth International Conference on Spoken Language Processing (CSLP-96), Philadelphia, Pennsylvania | 1996 | Speech | [PDF]
|