| Punctuating Speech For Information Extraction | B. Favre, R. Grishman, D. Hillard, H. Ji, D. Hakkani-Tur, and M.Ostendorf | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5013-5016 | April 2008 | Speech | [PDF]
|
| Name-Aware Speech Recognition for Interactive Question Answering | S. Stoyanchev, G. Tur, and D. Hakkani-Tür | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5113-5116 | April 2008 | Speech | [PDF]
|
| System Combination Using Auxiliary Information for Speaker Verification | L. Ferrer, M. Graciarena, A. Zymnis, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4853-4856 | April 2008 | Speech | [PDF]
|
| Exploiting Dialog Act Tagging and Prosodic Information for Action Item Identification | F. Yang, G. Tur, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4941-4944 | April 2008 | Speech | [PDF]
|
| Nonparametric Feature Normalization for SVM-Based Speaker Verification | A. Stolcke, S. Kajarekar, and L. Ferrer | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 1577-1580 | April 2008 | Speech | [PDF]
|
| Multimedia Education in Computer Science -- A Little Bit of Everything Is Not Enough | G. Friedland, L. Knipping, and W. Huerst | IEEE Multimedia Magazine, Vol. 15, Issue 2, pp. 78-82 | April 2008 | Speech | [PDF]
|
| Detecting Music in Ambient Audio by Long-Window Autocorrelation | K. Lee and D. Ellis | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 9-12 | April 2008 | Speech | [PDF]
|
| Temporal Masking for Bit-Rate Reduction in Audio Codec based on Frequency Domain Linear Prediction | S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4781-4784 | April 2008 | Speech | [PDF]
|
| Automatic Laughter Segmentation | M. T. Knox | Master's report | May 2008 | Speech | [PDF]
|
| Speech Segmentation and Spoken Document Processing | M. Ostendorf, B. Favre, R. Grishman, D. Hakkani-Tur, M. Harper, D. Hillard, J. Hirschberg, J. Heng, J. G. Kahn, Y. Liu, S. Maskey, E. Matusov, H. Ney, A. Rosenberg, E. Shriberg, W. Wang, and C. Wooters | IEEE Signal Processing Magazine, Vol. 25, Issue 3, pp. 59-69 | May 2008 | Speech | [PDF]
|
| Autoregressive Modeling of Hilbert Envelopes for Wide-Band Audio Coding | S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri | Proceedings of 124th Convention of Audio Engineering Society (AES), Amsterdam, the Netherlands, paper 7481 | May 2008 | Speech | |
| Educational Multimedia | G. Friedland, L. Knipping, and W. Huerst (guest editors) | Special Section in IEEE Multimedia Magazine, pp. 54-74, July-Sept. 2008 | July 2008 | Speech | [PDF]
|
| Towards Semantic Analysis of Conversations: A System for the Live Identification of Speakers in Meetings | O. Vinyals and G. Friedland | Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, pp. 426-431 | August 2008 | Speech | [PDF]
|
| Appscio: A Software Environment for Semantic Multimedia Analysis | G. Friedland, E. Hensley, J. Schumacher, and R. Jain | Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, California, pp. 456-459 | August 2008 | Speech | [PDF]
|
| Packing the Meeting Summarization Knapsack | K. Riedhammer, D. Gillick, B. Favre, and D. Hakkani-Tur | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2434-2437 | September 2008 | Speech | [PDF]
|
| Speech-Overlapped Acoustic Event Detection for Automotive Applications | C. Müller, J. I. Biel, E. Kim, and D. Rosario | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2590-2593 | September 2008 | Speech | [PDF]
|
| Multi-Stream Spectro-Temporal Features for Robust Speech Recognition | S. Y. Zhao and N. Morgan | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 898-901 | September 2008 | Speech | [PDF]
|
| Two's a Crowd: Improving Speaker Diarization by Automatically Identifying and Excluding Overlapped Speech Authors | K. Boakye, O. Vinyals, and G. Friedland | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 32-35 | September 2008 | Speech | |
| Getting the Last Laugh: Automatic Laughter Segmentation in Meetings | M. Knox, N. Morgan, and N. Mirghafori | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 797-800 | September 2008 | Speech | [PDF]
|
| Development of the SRI/Nightingale Arabic ASR system | D. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlater, K. Kirchoff, A. Faria, and N. Morgan | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 1437-1440 | September 2008 | Speech | |
| Cross-Lingual Sentence Extraction for Information Distillation | A. Singla and D. Hakkani-Tur | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2707-2710 | September 2008 | Speech | [PDF]
|
| The Value of Auditory Offset Adaptation and Appropriate Acoustic Modeling | H. Wang, D. Gelbart, H.G. Hirsch, and W. Hemmert | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 902-905 | September 2008 | Speech | [PDF]
|
| Unsupervised Learning of Edit Parameters for Matching Name Variants | D. Gillick, D. Hakkani-Tur, and M. Levit. | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 467-470 | September 2008 | Speech | [PDF]
|
| Best Papers from the Second IEEE International Conference on Semantic Computing (IJSC) | G. Friedland and C. Martell, eds. | International Journal on Semantic Computing (IJSC), Vol. 2, Issue 3 | September 2008 | Speech | |
| Perceptually Motivated Sub-Band Decomposition for FDLP Audio Coding | P. Motlicek, S. Ganapathy, H. Hermansky, H. Garudadri, and M. Athineos | Proceedings of 11th International Conference on Text, Speech, and Dialogue (TSD 2008), Brno, Czech Republic, pp. 435-442 | September 2008 | Speech | [PDF]
|
| Effects of Vocal Effort and Speaking Style on Text-Independent Speaker Verification | E. Shriberg, M. Graciarena, H. Bratt, A. Kathol, S. Kajarekar, H. Jameel, C. Richey, and F. Goodman | Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 609-612 | September 2008 | Speech | [PDF]
|
| The Case for Automatic Higher-Level Features in Forensic Speaker Recognition | E. Shriberg and A. Stolcke | Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 1509-1512 | September 2008 | Speech | [PDF]
|
| Source Separation Based on Binaural Cues and Source Model Constraints | R. Weiss, M. Mandel, and D. Ellis | Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 419-422 | September 2008 | Speech | [PDF]
|
| Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain | S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri | Proceedings of the 9th Annual Conference of the International Speech Communication
Association (Interspeech 2008), Brisbane, Australia | September 2008 | Speech | |
| Modulation Spectrogram Features for Speaker Diarization | O. Vinyals and G. Friedland | Proceedings of the 9th Annual Conference of the International Speech Communication
Association (Interspeech 2008), Brisbane, Australia, pp. 630-633 | September 2008 | Speech | |
| Role Recognition for Meeting Participants: An Approach Based on Lexical Information and Social Network Analysis | N. Garg, S. Favre, H. Salamin, D. Hakkani-Tur, and A. Vinciarelli | Proceedings of 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 693-696. | October 2008 | Speech | [PDF]
|
| Live Speaker Identification in Conversations | G. Friedland and O. Vinyals | Proceedings of the 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 1017-1018 | October 2008 | Speech | [PDF]
|
| Towards Audio-Visual On-Line Diarization of Participants in Group Meetings | H. Hung and G. Friedland | Proceedings of European Conference on Computer Vision (ECCV), Marseille, France | October 2008 | Speech | [PDF]
|
| Automated Lecture Recording | G. Friedland, L. Knipping, and W. Huerst | Encyclopedia of Multimedia, B. Furht, ed., Springer | October 2008 | Speech | |
| Multi-modal Speaker Diarization of Real-world Meetings Using Compressed-domain Video Features | G. Friedland, H. Hung, and C. Yeo | ICSI Technical Report TR-08-007, October 2008 | October 2008 | Speech | [PDF]
|
| Sampling Alignment Structure Under a Bayesian Translation Model | J. DeNero, A. Bouchard-Côté, and D. Klein | Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP), Waikiki, Honolulu, Hawaii, pp. 314-323 | October 2008 | Speech | [PDF]
|
| Multimedia Education—Can We Find Unity in Diversity? | G. Friedland, W. Hürst, and L. Knipping | Proceedings of the 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 1115-1116 | October 2008 | Speech | [PDF]
|
| Personalized, Interactive Tag Recommendation for Flickr | N. Garg and I. Weber | Proceedings of the Second ACM International Conference on Recommender Systems (RecSys 2008), Lausanne, Switzerland, pp. 67-74 | October 2008 | Speech | [PDF]
|
| The ICSI Summarization System at TAC 2008 | D. Gillick, B. Favre, and D. Hakkani-Tur | Proceedings of Text Analysis Conference (TAC), Gaithersburg, Maryland | November 2008 | Speech | [PDF]
|
| Multimedia Information Extraction Roadmap | G. Myers, G. Tür, L. Voss, B. Bolles, S. Kajarekar, E. Shriberg, and D. Hakkani-Tür | Proceedings of the AAAI Fall Symposium on Multimedia Information Extraction, Arlington, Virginia | November 2008 | Speech | [PDF]
|
| A Comparison of Single- and Multi-Objective Programming Approaches to Problems with Multiple Design Objectives | S. Yaman and C.-H. Lee | Journal of Signal Processing Systems, MLSP special issue | November 2008 | Speech | [PDF]
|
| A Hardware-Independent Fast Logarithm Approximation with Adjustable Accuracy | O. Vinyals and G. Friedland | Proceedings of the 10th IEEE International Symposium on Multimedia, Berkeley, California, pp. 61-65 | December 2008 | Speech | [PDF]
|
| A Keyphrase Based Approach to Interactive Meeting Summarization | K. Riedhammer, B. Favre, and D. Hakkani-Tur | Proceedings of IEEE Workshop on Spoken Language Technologies (SLT2008), Goa, India, pp. 153-156 | December 2008 | Speech | [PDF]
|
| Efficient Sentence Segmentation Using Syntactic Features | B. Favre, D. Hakkani-Tur, S. Petrov, and D. Klein | Proceedings of IEEE Workshop on Spoken Language Technologies (SLT2008), Goa, India, pp. 77-80 | December 2008 | Speech | [PDF]
|
| The CALO Meeting Speech Recognition and Understanding System | G. Tur, A. Stolcke, L. Voss, J. Dowding, B. Favre, R. Fernandez, M. Frampton, M. Frandsen, C. Frederickson, M. Graciarena, D. Hankkani-Tur, D. Kintzing, K. Leveque, S. Mason, J. Niekrasz, S. Peters, M. Purver, K. Riedhammer, E. Shriberg, J. Tien, D. Vergyri, and F. Yang | Proceedings of IEEE Workshop on Spoken Language Technologies (SLT2008), Goa, India, pp. 69-72 | December 2008 | Speech | [PDF]
|
| Ensemble Feature Selection for Multi-stream Automatic Speech Recognition | D. Gelbart | UC Berkeley dissertation | December 2008 | Speech | [PDF]
|
| Audio Segmentation for Meetings Speech Processing | K. A. Boakye | UC Berkeley dissertation | December 2008 | Speech | [PDF]
|
| Efficient Data Selection for Machine Translation | A. Mandal, D. Vergyri, W. Wang, J. Zheng, A. Stolcke, G. Tür, D. Hakkani-Tür, and N. Fazil Ayan | Proceedings of IEEE/ACL Workshop on Spoken Language Technologies (SLT), Goa, India, pp. 261-264 | December 2008 | Speech | [PDF]
|
| Prosodic Similarities of Dialog Act Boundaries Across Speaking Styles | E. Shriberg, B. Favre, J. Fung, D. Hakkani-Tur, and S. Cuendet | Linguistic Patterns in Spontaneous Speech, S.-C. Tseng, ed., pp. 213-239, Institute of Linguistics | 2009 | Speech | [PDF]
|
| Can We Escape the Trough of Disillusionment?--A Perspective on E-learning Technology Research from the ACM Workshop on Educational Multimedia and Multimedia Education | G. Friedland, L. Knipping, W. Huerst, and M. Muhlhauser | ACM E-Learn Journal | February 2009 | Speech | |