| EEG Signal Compression Based on Classified Signature and Envelope Vector Sets | H. Gurkan, U. Guz, and B.S. Yarman | Proceedings of the European Conference on Circuit Theory and Design, IEEE Circuits and Systems Society and the European Circuit Society, Seville, Spain, pp. 420-423 | August 2007 | Speech | |
| Selecting On-topic Sentences from Natural Language Corpora | M. Levit, E. Boschee, and M. Freedman | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2793-2796 | August 2007 | Speech | |
| Interpretation of Spatial Language in a Map Navigation Task | M. Levit and D. Roy | IEEE Transactions on Systems, Man and Cybernetics, Part B, vol. 37, no. 3, IEEE Systems, man, and Cybernetics Society, pp.667-679 | June 2007 | Speech | |
| A Fast-Match Approach for Robust, Faster than Real-Time Speaker Diarization | Y. Huang, O. Vinyals, G. Friedland, C. Müller, N. Mirghafori, and C. Wooters | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, pp. 693-698 | December 2007 | Speech | [PDF]
|
| Speech Encoding in a Model of Peripheral Auditory Processing: Quantitative Assessment by Means of Automatic Speech Recognition | M. Holmberg, D. Gelbart, and W. Hemmert | Speech Communication, Vol. 49, Issue 12, pp. 917-932 | December 2007 | Speech | |
| Corrected Tandem Features for Acoustic Model Training | A. Faria and N. Morgan | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4737-4740 | April 2008 | Speech | [PDF]
|
| When a Mismatch Can Be Good: Large Vocabulary Speech Recognition Trained with Idealized Tandem Features | A. Faria and N. Morgan | Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, pp. 1574-1577 | March 2008 | Speech | [PDF]
|
| Building a Highly Accurate Mandarin Speech Recognizer | M-Y. Hwang, G. Peng, W. Wang, A. Faria, and A. Heidel | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, pp. 490-495 | December 2007 | Speech | [PDF]
|
| Detecting Categories in News Video Using Acoustic, Speech, and Image Features | S. Petrov, A. Faria, P. Michaillat, A. Berg, A. Stolcke, D. Klein, and J. Malik | Presented at the NIST TREC Video Retrieval Workshop, Gaithersburg, Maryland | November 2006 | Speech | [PDF]
|
| Accent Classification for Speech Recognition | A. Faria | Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 285-293 | July 2005 | Speech | [PDF]
|
| Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies | H. Hung, Y. Huang, G. Friedland, and D. Gatica-Perez | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 2197-2200 | April 2008 | Speech | [PDF]
|
| Overlapped Speech Detection for Improved Speaker Diarization in Multiparty Meetings | K.A. Boakye, B. Trueba-Hornero, O. Vinyals, and G. Friedland | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4353-4356 | April 2008 | Speech | [PDF]
|
| An Iterative Unsupervised Learning Method for Information Distillation | K. Kamangar, D. Hakkani-Tur, G. Tur, and M. Levit | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4949 - 4952 | April 2008 | Speech | [PDF]
|
| Punctuating Speech For Information Extraction | B. Favre, R. Grishman, D. Hillard, H. Ji, D. Hakkani-Tur, and M.Ostendorf | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5013-5016 | April 2008 | Speech | [PDF]
|
| Name-Aware Speech Recognition for Interactive Question Answering | S. Stoyanchev, G. Tur, and D. Hakkani-Tür | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5113-5116 | April 2008 | Speech | [PDF]
|
| System Combination Using Auxiliary Information for Speaker Verification | L. Ferrer, M. Graciarena, A. Zymnis, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4853-4856 | April 2008 | Speech | [PDF]
|
| Exploiting Dialog Act Tagging and Prosodic Information for Action Item Identification | F. Yang, G. Tur, and E. Shriberg | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4941-4944 | April 2008 | Speech | [PDF]
|
| Morph-Based Speech Recognition and Modeling of Out-of-Vocabulary Words Across Languages | M. Creutz, T. Hirsimäki, M. Kurimo, A. Puurula, J. Pylkkönen, V. Siivola, M. Varjokallio, E. Arisoy, M. Saraclar, and A. Stolcke | ACM Transactions on Speech and Language Processing, Vol. 5, Issue 1, pp. 1-29 | December 2007 | Speech | [PDF]
|
| Nonparametric Feature Normalization for SVM-Based Speaker Verification | A. Stolcke, S. Kajarekar, and L. Ferrer | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 1577-1580 | April 2008 | Speech | [PDF]
|
| Educational Multimedia | G. Friedland, L. Knipping, and W. Huerst (guest editors) | Special Section in IEEE Multimedia Magazine, pp. 54-74, July-Sept. 2008 | July 2008 | Speech | [PDF]
|
| Multimedia Education in Computer Science -- A Little Bit of Everything Is Not Enough | G. Friedland, L. Knipping, and W. Huerst | IEEE Multimedia Magazine, Vol. 15, Issue 2, pp. 78-82 | April 2008 | Speech | [PDF]
|
| Multimedia Technologies for E-Learning 2007 | G. Friedland, L. Knipping, and N. Ludwig (eds.) | Special Issue of Interactive Technology Smart Education (ITSE), Vol. 4, Issue 4 | November 2007 | Speech | |
| Anthropocentric Video Segmentation for Lecture Webcasts | G. Friedland and R. Rojas | EURASIP Journal on Image and Video Processing, Vol. 8, Issue 2, Article 9 | January 2008 | Speech | [PDF]
|
| Visualizing Large-Screen Electronic Chalkboard Content on Handheld Devices | A. Lüning, G. Friedland, L. Knipping, and R. Rojas | Proceedings of the Second IEEE International Workshop on Multimedia Technologies for E-Learning at 9th IEEE Symposium on Multimedia, Taichung, Taiwan, pp. 369-375 | December 2007 | Speech | |
| Packing the Meeting Summarization Knapsack | K. Riedhammer, D. Gillick, B. Favre, and D. Hakkani-Tur | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2434-2437 | September 2008 | Speech | [PDF]
|
| Speech-Overlapped Acoustic Event Detection for Automotive Applications | C. Müller, J. I. Biel, E. Kim, and D. Rosario | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2590-2593 | September 2008 | Speech | [PDF]
|
| Multi-Stream Spectro-Temporal Features for Robust Speech Recognition | S. Y. Zhao and N. Morgan | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 898-901 | September 2008 | Speech | [PDF]
|
| Two's a Crowd: Improving Speaker Diarization by Automatically Identifying and Excluding Overlapped Speech Authors | K. Boakye, O. Vinyals, and G. Friedland | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 32-35 | September 2008 | Speech | |
| Getting the Last Laugh: Automatic Laughter Segmentation in Meetings | M. Knox, N. Morgan, and N. Mirghafori | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 797-800 | September 2008 | Speech | [PDF]
|
| Development of the SRI/Nightingale Arabic ASR system | D. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlater, K. Kirchoff, A. Faria, and N. Morgan | Proceedings of the Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 1437-1440 | September 2008 | Speech | |
| Cross-Lingual Sentence Extraction for Information Distillation | A. Singla and D. Hakkani-Tur | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 2707-2710 | September 2008 | Speech | [PDF]
|
| ICSI System Description for SRE2008 Submission | H. Lei and D.V. Leeuwen | Speaker Recognition Evaluation 2008, National Institute of Standards and Technology | 2008 | Speech | [PDF]
|
| The Value of Auditory Offset Adaptation and Appropriate Acoustic Modeling | H. Wang, D. Gelbart, H.G. Hirsch, and W. Hemmert | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 902-905 | September 2008 | Speech | [PDF]
|
| Role Recognition for Meeting Participants: An Approach Based on Lexical Information and Social Network Analysis | N. Garg, S. Favre, H. Salamin, D. Hakkani-Tur, and A. Vinciarelli | Proceedings of 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 693-696. | October 2008 | Speech | [PDF]
|
| Unsupervised Learning of Edit Parameters for Matching Name Variants | D. Gillick, D. Hakkani-Tur, and M. Levit. | Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, Australia, pp. 467-470 | September 2008 | Speech | [PDF]
|
| Automatic Laughter Segmentation | M. T. Knox | Master's report | May 2008 | Speech | [PDF]
|
| Towards Semantic Analysis of Conversations: A System for the Live Identification of Speakers in Meetings | O. Vinyals and G. Friedland | Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, pp. 426-431 | August 2008 | Speech | [PDF]
|
| Appscio: A Software Environment for Semantic Multimedia Analysis | G. Friedland, E. Hensley, J. Schumacher, and R. Jain | Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, California, pp. 456-459 | August 2008 | Speech | [PDF]
|
| Live Speaker Identification in Conversations | G. Friedland and O. Vinyals | Proceedings of the 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 1017-1018 | October 2008 | Speech | [PDF]
|
| Towards Audio-Visual On-Line Diarization of Participants in Group Meetings | H. Hung and G. Friedland | Proceedings of European Conference on Computer Vision (ECCV), Marseille, France | October 2008 | Speech | [PDF]
|
| A Hardware-Independent Fast Logarithm Approximation with Adjustable Accuracy | O. Vinyals and G. Friedland | Proceedings of the 10th IEEE International Symposium on Multimedia, Berkeley, California, pp. 61-65 | December 2008 | Speech | [PDF]
|
| Can We Escape the Trough of Disillusionment?--A Perspective on E-learning Technology Research from the ACM Workshop on Educational Multimedia and Multimedia Education | G. Friedland, L. Knipping, W. Huerst, and M. Muhlhauser | ACM E-Learn Journal | February 2009 | Speech | |
| Automated Lecture Recording | G. Friedland, L. Knipping, and W. Huerst | Encyclopedia of Multimedia, B. Furht, ed., Springer | October 2008 | Speech | |
| Speaker Recognition and Diarization | G. Friedland and D. van Leeuwen | In Semantic Computing, P. Sheu, H. Yu, C. V. Ramamamoorthy, A. K. Joshi, and L. A. Zadeh, eds., pp. 115-130, IEEE Press/Wiley | 2010 | Speech | |
| Comparisons of Recent Speaker Recognition Approaches Based on Word Conditioning | H. Lei and N. Mirghafori | Proceedings of Odyssey 2008, Stellenbosch, South Africa | January 2008 | Speech | [PDF]
|
| Applications of Keyword-Constraining in Speaker Recognition | H. Lei | MS Thesis, University of California-Berkeley | July 2007 | Speech | [PDF]
|
| A Keyphrase Based Approach to Interactive Meeting Summarization | K. Riedhammer, B. Favre, and D. Hakkani-Tur | Proceedings of IEEE Workshop on Spoken Language Technologies (SLT2008), Goa, India, pp. 153-156 | December 2008 | Speech | [PDF]
|
| Prosodic and Other Long-Term Features for Speaker Diarization | G. Friedland, O. Vinyals, Y. Huang, and C. Müller | IEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, No. 5, pp. 985-993 | July 2009 | Speech | [PDF]
|
| Multi-modal Speaker Diarization of Real-world Meetings Using Compressed-domain Video Features | G. Friedland, H. Hung, and C. Yeo | ICSI Technical Report TR-08-007, October 2008 | October 2008 | Speech | [PDF]
|
| Efficient Sentence Segmentation Using Syntactic Features | B. Favre, D. Hakkani-Tur, S. Petrov, and D. Klein | Proceedings of IEEE Workshop on Spoken Language Technologies (SLT2008), Goa, India, pp. 77-80 | December 2008 | Speech | [PDF]
|