Publication Details

Title: Multi-Modal Speaker Diarization of Real-World Meeting Using Compressed-Domain Video Features
Author: G. Friedland, H. Hung, and C. Yeo
Group: Speech
Date: April 2009
PDF: http://www.icsi.berkeley.edu/pubs/speech/multimodalspeaker09.pdf

Overview:
Copyright 2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish any copyrighted component of this material for advertising or promotional purposes; in new collective works for resale or redistribution to servers or lists; or in other works must be obtained from the IEEE. Contact Manager, Copyrights and Permissions, IEEE Service Center, 445 Hoes Lane, P.O. Box 1331, Piscataway, NJ 08855-1331, ph. 908-562-3966.

Acknowledgements:
This work was supported by funding provided by the European Integrated Project on Augmented Multiparty Interaction with Distance Access (AMIDA) and by the Swiss National Science Foundation (SNSF) via the Swiss National Center of Competence in Research on Interactive Multimodal Information Management (IM2).

Bibliographic Information:
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4069-4072

Bibliographic Reference:
G. Friedland, H. Hung, and C. Yeo. Multi-Modal Speaker Diarization of Real-World Meeting Using Compressed-Domain Video Features. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4069-4072, April 2009