| Multimedia Education—Can We Find Unity in Diversity? | G. Friedland, W. Hürst, and L. Knipping | Proceedings of the 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 1115-1116 | October 2008 | Speech | [PDF]
|
| Personalized, Interactive Tag Recommendation for Flickr | N. Garg and I. Weber | Proceedings of the Second ACM International Conference on Recommender Systems (RecSys 2008), Lausanne, Switzerland, pp. 67-74 | October 2008 | Speech | [PDF]
|
| Packing the Meeting Summarization Knapsack | K. Riedhammer, D. Gillick, B. Favre, and D. Hakkani-Tur | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2434-2437 | September 2008 | Speech | [PDF]
|
| Speech-Overlapped Acoustic Event Detection for Automotive Applications | C. Müller, J. I. Biel, E. Kim, and D. Rosario | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2590-2593 | September 2008 | Speech | [PDF]
|
| Multi-Stream Spectro-Temporal Features for Robust Speech Recognition | S. Y. Zhao and N. Morgan | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 898-901 | September 2008 | Speech | [PDF]
|
| Two's a Crowd: Improving Speaker Diarization by Automatically Identifying and Excluding Overlapped Speech Authors | K. Boakye, O. Vinyals, and G. Friedland | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 32-35 | September 2008 | Speech | |
| Getting the Last Laugh: Automatic Laughter Segmentation in Meetings | M. Knox, N. Morgan, and N. Mirghafori | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 797-800 | September 2008 | Speech | [PDF]
|
| Development of the SRI/Nightingale Arabic ASR system | D. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlater, K. Kirchoff, A. Faria, and N. Morgan | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 1437-1440 | September 2008 | Speech | |
| Cross-Lingual Sentence Extraction for Information Distillation | A. Singla and D. Hakkani-Tur | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2707-2710 | September 2008 | Speech | [PDF]
|
| The Value of Auditory Offset Adaptation and Appropriate Acoustic Modeling | H. Wang, D. Gelbart, H.G. Hirsch, and W. Hemmert | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 902-905 | September 2008 | Speech | [PDF]
|
| Unsupervised Learning of Edit Parameters for Matching Name Variants | D. Gillick, D. Hakkani-Tur, and M. Levit. | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 467-470 | September 2008 | Speech | [PDF]
|
| Best Papers from the Second IEEE International Conference on Semantic Computing (IJSC) | G. Friedland and C. Martell, eds. | International Journal on Semantic Computing (IJSC), Vol. 2, Issue 3 | September 2008 | Speech | |
| Perceptually Motivated Sub-Band Decomposition for FDLP Audio Coding | P. Motlicek, S. Ganapathy, H. Hermansky, H. Garudadri, and M. Athineos | Proceedings of 11th International Conference on Text, Speech, and Dialogue (TSD 2008), Brno, Czech Republic, pp. 435-442 | September 2008 | Speech | [PDF]
|
| Effects of Vocal Effort and Speaking Style on Text-Independent Speaker Verification | E. Shriberg, M. Graciarena, H. Bratt, A. Kathol, S. Kajarekar, H. Jameel, C. Richey, and F. Goodman | Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 609-612 | September 2008 | Speech | [PDF]
|
| The Case for Automatic Higher-Level Features in Forensic Speaker Recognition | E. Shriberg and A. Stolcke | Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 1509-1512 | September 2008 | Speech | [PDF]
|
| Source Separation Based on Binaural Cues and Source Model Constraints | R. Weiss, M. Mandel, and D. Ellis | Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 419-422 | September 2008 | Speech | [PDF]
|
| Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain | S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri | Proceedings of the 9th Annual Conference of the International Speech Communication
Association (Interspeech 2008), Brisbane, Australia | September 2008 | Speech | |
| Modulation Spectrogram Features for Speaker Diarization | O. Vinyals and G. Friedland | Proceedings of the 9th Annual Conference of the International Speech Communication
Association (Interspeech 2008), Brisbane, Australia, pp. 630-633 | September 2008 | Speech | |
| Towards Semantic Analysis of Conversations: A System for the Live Identification of Speakers in Meetings | O. Vinyals and G. Friedland | Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, pp. 426-431 | August 2008 | Speech | [PDF]
|
| Appscio: A Software Environment for Semantic Multimedia Analysis | G. Friedland, E. Hensley, J. Schumacher, and R. Jain | Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, California, pp. 456-459 | August 2008 | Speech | [PDF]
|
| Educational Multimedia | G. Friedland, L. Knipping, and W. Huerst (guest editors) | Special Section in IEEE Multimedia Magazine, pp. 54-74, July-Sept. 2008 | July 2008 | Speech | [PDF]
|
| Automatic Laughter Segmentation | M. T. Knox | Master's report | May 2008 | Speech | [PDF]
|
| Speech Segmentation and Spoken Document Processing | M. Ostendorf, B. Favre, R. Grishman, D. Hakkani-Tur, M. Harper, D. Hillard, J. Hirschberg, J. Heng, J. G. Kahn, Y. Liu, S. Maskey, E. Matusov, H. Ney, A. Rosenberg, E. Shriberg, W. Wang, and C. Wooters | IEEE Signal Processing Magazine, Vol. 25, Issue 3, pp. 59-69 | May 2008 | Speech | [PDF]
|
| Autoregressive Modeling of Hilbert Envelopes for Wide-Band Audio Coding | S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri | Proceedings of 124th Convention of Audio Engineering Society (AES), Amsterdam, the Netherlands, paper 7481 | May 2008 | Speech | |
| Corrected Tandem Features for Acoustic Model Training | A. Faria and N. Morgan | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4737-4740 | April 2008 | Speech | [PDF]
|
| Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies | H. Hung, Y. Huang, G. Friedland, and D. Gatica-Perez | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 2197-2200 | April 2008 | Speech | [PDF]
|
| Overlapped Speech Detection for Improved Speaker Diarization in Multiparty Meetings | K.A. Boakye, B. Trueba-Hornero, O. Vinyals, and G. Friedland | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4353-4356 | April 2008 | Speech | [PDF]
|
| An Iterative Unsupervised Learning Method for Information Distillation | K. Kamangar, D. Hakkani-Tur, G. Tur, and M. Levit | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4949 - 4952 | April 2008 | Speech | [PDF]
|
| Punctuating Speech For Information Extraction | B. Favre, R. Grishman, D. Hillard, H. Ji, D. Hakkani-Tur, and M.Ostendorf | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5013-5016 | April 2008 | Speech | [PDF]
|
| Name-Aware Speech Recognition for Interactive Question Answering | S. Stoyanchev, G. Tur, and D. Hakkani-Tür | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5113-5116 | April 2008 | Speech | [PDF]
|
| System Combination Using Auxiliary Information for Speaker Verification | L. Ferrer, M. Graciarena, A. Zymnis, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4853-4856 | April 2008 | Speech | [PDF]
|
| Exploiting Dialog Act Tagging and Prosodic Information for Action Item Identification | F. Yang, G. Tur, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4941-4944 | April 2008 | Speech | [PDF]
|
| Nonparametric Feature Normalization for SVM-Based Speaker Verification | A. Stolcke, S. Kajarekar, and L. Ferrer | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 1577-1580 | April 2008 | Speech | [PDF]
|
| Multimedia Education in Computer Science -- A Little Bit of Everything Is Not Enough | G. Friedland, L. Knipping, and W. Huerst | IEEE Multimedia Magazine, Vol. 15, Issue 2, pp. 78-82 | April 2008 | Speech | [PDF]
|
| Detecting Music in Ambient Audio by Long-Window Autocorrelation | K. Lee and D. Ellis | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 9-12 | April 2008 | Speech | [PDF]
|
| Temporal Masking for Bit-Rate Reduction in Audio Codec based on Frequency Domain Linear Prediction | S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4781-4784 | April 2008 | Speech | [PDF]
|
| When a Mismatch Can Be Good: Large Vocabulary Speech Recognition Trained with Idealized Tandem Features | A. Faria and N. Morgan | Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, pp. 1574-1577 | March 2008 | Speech | [PDF]
|
| Using Corpus and Knowledge-Based Similarity Measure in Maximum Marginal Relevance for Meeting Summarization | S. Xie and Y. Liu | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4985-4988 | March 2008 | Speech | [PDF]
|
| Anthropocentric Video Segmentation for Lecture Webcasts | G. Friedland and R. Rojas | EURASIP Journal on Image and Video Processing, Vol. 8, Issue 2, Article 9 | January 2008 | Speech | [PDF]
|
| Comparisons of Recent Speaker Recognition Approaches Based on Word Conditioning | H. Lei and N. Mirghafori | Proceedings of Odyssey 2008, Stellenbosch, South Africa | January 2008 | Speech | [PDF]
|
| ICSI System Description for SRE2008 Submission | H. Lei and D.V. Leeuwen | Speaker Recognition Evaluation 2008, National Institute of Standards and Technology | 2008 | Speech | [PDF]
|
| A Fast-Match Approach for Robust, Faster than Real-Time Speaker Diarization | Y. Huang, O. Vinyals, G. Friedland, C. Müller, N. Mirghafori, and C. Wooters | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, pp. 693-698 | December 2007 | Speech | [PDF]
|
| Speech Encoding in a Model of Peripheral Auditory Processing: Quantitative Assessment by Means of Automatic Speech Recognition | M. Holmberg, D. Gelbart, and W. Hemmert | Speech Communication, Vol. 49, Issue 12, pp. 917-932 | December 2007 | Speech | |
| Building a Highly Accurate Mandarin Speech Recognizer | M-Y. Hwang, G. Peng, W. Wang, A. Faria, and A. Heidel | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, pp. 490-495 | December 2007 | Speech | [PDF]
|
| Morph-Based Speech Recognition and Modeling of Out-of-Vocabulary Words Across Languages | M. Creutz, T. Hirsimäki, M. Kurimo, A. Puurula, J. Pylkkönen, V. Siivola, M. Varjokallio, E. Arisoy, M. Saraclar, and A. Stolcke | ACM Transactions on Speech and Language Processing, Vol. 5, Issue 1, pp. 1-29 | December 2007 | Speech | [PDF]
|
| Visualizing Large-Screen Electronic Chalkboard Content on Handheld Devices | A. Lüning, G. Friedland, L. Knipping, and R. Rojas | Proceedings of the Second IEEE International Workshop on Multimedia Technologies for E-Learning at 9th IEEE Symposium on Multimedia, Taichung, Taiwan, pp. 369-375 | December 2007 | Speech | |
| Multimedia Technologies for E-Learning 2007 | G. Friedland, L. Knipping, and N. Ludwig (eds.) | Special Issue of Interactive Technology Smart Education (ITSE), Vol. 4, Issue 4 | November 2007 | Speech | |
| Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms | A. Stolcke, S. Kajarekar, L. Ferrer, and E. Shriberg | IEEE Transactions on Audio, Speech, and Language Processing. Special issue on speaker and language recognition, Vol. 15, Issue 7, IEEE Computer Society, California, pp. 1987-1998 | September 2007 | Speech | [PDF]
|
| Acoustic Beamforming for Speaker Diarization of Meetings | X. Anguera, C. Wooters, and J. Hernando | IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 7, IEEE Computer Society, California, pp. 2011-2022 | September 2007 | Speech | |
| Speaker Diarization For Multiple-distant-microphone Meetings Using Several Sources of Information | J. M. Pardo, X. Anguera, and C. Wooters | IEEE Transactions on Computers, Vol. 56, Issue 9, IEEE Computer Society, California, pp. 1212-1224 | September 2007 | Speech | [PDF]
|