| The Blame Game: Performance Analysis of Speaker Diarization System Components | M. Huijbregts and C. Wooters | Proceedings of 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1857-1860 | August 2007 | Speech | |
| Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections | M. Huijbregts, C. Wooters, and R. Ordelman | Proceedings of 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2925-2928 | August 2007 | Speech | |
| Using Prosody for Automatic Sentence Segmentation of Multi-Party Meetings | J. Kolar, E. Shriberg, and Y. Liu | Proceedings of 9th International Conference on Text, Speech and Dialogue (TSD 2006), Brno, Czech Republic, pp. 629-636 | September 2006 | Speech | [PDF]
|
| Using Audio and Video Features to Classify the Most Dominant Person in Meetings | H. Hung, D. Jayagopi, C. Yeo, G. Friedland, S. Ba, J-M. Odobez, K. Ramchandran, N. Mirghafori, and D. Gatica-Perez | Proceedings of ACM Multimedia 2007, Augsburg, Germany, pp. 835-838 | September 2007 | Speech | |
| Current Research in Acoustically Robust Speech Recognition | N. Morgan | Proceedings of American Voice Input/Output Society (AVIOS), pp. 207-214 | September 1994 | Speech | |
| Multispeaker Speech Activity Detection for the ICSI Meeting Recorder | T. Pfau, D. Ellis, and A. Stolcke | Proceedings of Automatic Speech Recognition and Understanding Workshop (ASRU 2001),
Madonna di Campiglio, Italy, pp. 107-110 | December 2001 | Speech | [PDF]
|
| Sampling Alignment Structure Under a Bayesian Translation Model | J. DeNero, A. Bouchard-Côté, and D. Klein | Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP), Waikiki, Honolulu, Hawaii, pp. 314-323 | October 2008 | Speech | [PDF]
|
| Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in Speech | Y. Liu, A. Stolcke, E. Shriberg, and M. Harper | Proceedings of Conference on Empirical Methods in Natural Language Processing, Barcelona | July 2004 | Speech | [PDF]
|
| Far-Field ASR on Inexpensive Microphones | L. Docio, D. Gelbart, and N. Morgan | Proceedings of Eighth European Conference on Speech Communication and Technology (EUROSPEECH 2003), Geneva, Switzerland, pp. 2141-2144 | September 2003 | Speech | [PDF]
|
| Should Recognizers Have Ears? | H. Hermansky | Proceedings of ESCA Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels, Pont-a-Mousson, France, pp. 1-10 | April 1997 | Speech | |
| Towards Audio-Visual On-Line Diarization of Participants in Group Meetings | H. Hung and G. Friedland | Proceedings of European Conference on Computer Vision (ECCV), Marseille, France | October 2008 | Speech | [PDF]
|
| Learning Discriminative Temporal Patterns in Speech: Development of Novel TRAPS-Like Classifiers | B. Chen, S. Chang, and S. Sivadas | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| Automatic Disfluency Identification in Conversational Speech Using Multiple Knowledge Sources | Y. Liu, E. Shriberg, and A. Stolcke | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| Feature Transformations and Combinations for Improving ASR Performance | P. Somervuo, B. Chen, and Q. Zhu | Proceedings of EUROSPEECH 2003, Geneva | September 2003 | Speech | [PDF]
|
| Towards Robust Speaker Segmentation: The ICSI-SRI Fall 2004 Diarization System | C. Wooters, J. Fung, B. Peskin, and X. Anguera | Proceedings of Fall 2004 Rich Transcription Workshop (RT-04F), Nov. 2004 | November 2004 | Speech | [PDF]
|
| Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech | S. Cuendet, D. Hakkani-Tur, and E. Shriberg | Proceedings of Fourth International Conference on Machine Learning and Multimodal Interaction, Brno, Czech Republic, pp. 144-155 | June 2007 | Speech | [PDF]
|
| Improving Automatic Sentence Boundary Detection with Confusion Networks | D. Hillard, M. Ostendorf, A. Stolcke, Y. Liu, and E. Shriberg | Proceedings of HLT-NAACL Conference, Boston | April 2004 | Speech | [PDF]
|
| Word Fragments Identification Using Acoustic-Prosodic Features in Conversational Speech | Y. Liu | Proceedings of HLT/NAACL, Student Session, Edmonton, Alberta | 2003 | Speech | |
| The ICSI Meeting Corpus | A. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. Wooters | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Meetings About Meetings: Research at ICSI on Speech in Multiparty Conversations | N. Morgan, D. Baron, S. Bhagat, H. Carvey, R. Dhillon, J. Edwards, D. Gelbart, A. Janin, A. Krupski, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. Wooters | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Using Prosodic and Conversational Features for High-Performance Speaker Recognition: Report From JHU WS'02. | B. Peskin, J. Navratil, J. Abramson, D. Jones, D. Klusacek, D. Reynolds, and B. Xiang | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| The SuperSID Project: Exploiting High-Level Information for High-Accuracy Speaker Recognition | D. Reynolds, W. Andrews, J. Campbell, J. Navratil, B. Peskin, A. Adami, Q. Jin, D. Klusacek, J. Abramson, R. Mihaescu, J. Godfrey, D. Jones, and B. Xiang | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Experiments with Linear and Nonlinear Feature Transformations in HMM Based Phone Recognition | P. Somervuo | Proceedings of ICASSP-2003, Hong Kong | April 2003 | Speech | [PDF]
|
| Desperately Seeking Impostors: Data-Mining for Competitive Impostor Testing in a Text-Dependent Speaker Verification System | M. Hebert and N. Mirghafori | Proceedings of IEEE ICASSP, Montreal | May 2004 | Speech | [PDF]
|
| Parameterization of the Score Threshold for a Text-Dependent Adaptive Speaker Verification System | N. Mirghafori and M. Hebert | Proceedings of IEEE ICASSP, Montreal | May 2004 | Speech | [PDF]
|
| TRAPping Conversational Speech: Extending TRAP/Tandem Approaches to Conversational Telephone Speech Recognition | N. Morgan, B. Y. Chen, Q. Zhu, and A. Stolcke | Proceedings of IEEE ICASSP, Montreal | May 2004 | Speech | [PDF]
|
| Integrating RASTA-PLP into Speech Recognition | J. Koehler, N. Morgan, H. Hermansky, H.G. Hirsch, and G. Tong | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, pp. I-421-424 | 1994 | Speech | |
| RASTA-PLP Speech Analysis Technique | H. Hermansky, N. Morgan, A. Bayya, and P. Kohn | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, San Francisco, California, pp. I-121-124 | 1992 | Speech | |
| CDNN: A Context Dependent Neural Network for Continuous Speech Recognition | H. Bourlard, N. Morgan, C. Wooters, and S. Renals | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, San Francisco, California, pp. II-349-352 | 1992 | Speech | |
| Continuous Speech Recognition Using PLP Analysis with Multilayer Perceptrons | N. Morgan, H. Hermansky, H. Bourlard, P. Kohn, and C. Wooters | Proceedings of IEEE International Conference on Acoustics, Speech & Signal Processing, Toronto, Canada, pp. 49-52 | 1991 | Speech | |
| Automatic Dialog Act Segmentation and Classification in Multiparty Meetings | J. Ang, Y. Liu, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 1061-1064 | March 2005 | Speech | [PDF]
|
| Improved Phonetic Speaker Recognition Using Lattice Decoding | A. O. Hatch, B. Peskin, and A. Stolcke | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 169-172 | March 2005 | Speech | [PDF]
|
| Clap Detection and Discrimination for Rhythm Therapy | N. Lesser and D.P.W. Ellis | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 37-40 | March 2005 | Speech | [PDF]
|
| Multi-Rate and Variable-Rate Modeling of Speech at Phone and Syllable Time Scales | O. Cetin and M. Ostendorf | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 665-668 | March 2005 | Speech | |
| Speaker Detection Without Models | D. Gillick, S. Stafford, and B. Peskin | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 757-760 | March 2005 | Speech | [PDF]
|
| Tonotopic Multi-Layered Perceptron: A Neural Network for Learning | B. Y. Chen, Q. Zhu, and N. Morgan | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 945-948 | March 2005 | Speech | [PDF]
|
| Structural Metadata Research in the EARS Program | Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, J. Ang, D. Hillard, M. Ostendorf, M. Tomalin, P. Woodland, and M. Harper | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 957-960 | March 2005 | Speech | [PDF]
|
| Dialog Act Tagging Using Graphical Models | G. Ji and J. Bilmes | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, Vol. 1, pp. 33-36 | March 2005 | Speech | [PDF]
|
| Purity Algorithms for Speaker Diarization of Meetings Data | X. Anguera, C. Wooters and J. Hernando | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France | May 2006 | Speech | [PDF]
|
| Nuts and Flakes: A Study of Data Characteristics in Speaker Diarization | N. Mirghafori and C. Wooters | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 1017-1020 | May 2006 | Speech | [PDF]
|
| Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons | A. Stolcke, F. Grezl, M.-Y. Hwang, X. Lei, N. Morgan, and D. Vergyri | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 321-324 | May 2006 | Speech | [PDF]
|
| Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap | O. Cetin and E.E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 357-360 | May 2006 | Speech | [PDF]
|
| Generalized Linear Kernels for One-Versus-All Classification: Application to Speaker Recognition | A. O. Hatch and A. Stolcke | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 585-588 | May 2006 | Speech | [PDF]
|
| Reranking for Sentence Boundary Detection in Conversational Speech | B. Roark, Y. Liu, M. Harper, R. Stewart, M. Lease, M. Snover, Z. Shafran, B. Dorr, J. Hale, A. Krasnyanskaya, and L. Young | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Vol. 1, Toulouse, France, pp. 545-548 | May 2006 | Speech | |
| Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings | M. Zimmermann, A. Stolcke, E.E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Vol. 1, Toulouse, France, pp. 581-584 | May 2006 | Speech | [PDF]
|
| Nonparametric Feature Normalization for SVM-Based Speaker Verification | A. Stolcke, S. Kajarekar, and L. Ferrer | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 1577-1580 | April 2008 | Speech | [PDF]
|
| Overlapped Speech Detection for Improved Speaker Diarization in Multiparty Meetings | K.A. Boakye, B. Trueba-Hornero, O. Vinyals, and G. Friedland | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4353-4356 | April 2008 | Speech | [PDF]
|
| Temporal Masking for Bit-Rate Reduction in Audio Codec based on Frequency Domain Linear Prediction | S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4781-4784 | April 2008 | Speech | [PDF]
|
| An Iterative Unsupervised Learning Method for Information Distillation | K. Kamangar, D. Hakkani-Tur, G. Tur, and M. Levit | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4949 - 4952 | April 2008 | Speech | [PDF]
|
| Using Corpus and Knowledge-Based Similarity Measure in Maximum Marginal Relevance for Meeting Summarization | S. Xie and Y. Liu | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4985-4988 | March 2008 | Speech | [PDF]
|