| Robust Speaker Diarization for Meetings: ICSI TR06 Meetings Evaluation System | X. Anguera, C. Wooters, and J. Pardo | Lecture Notes in Computer Science, Volume 4299, 2006, pp. 346-358, ISSN 0302-9743 | 2006 | Speech | [PDF]
|
| Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings | X. Anguera, C. Wooters, J. Pardo, and J. Hernando | Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 241-244 | April 2007 | Speech | [PDF]
|
| Model Complexity Selection and Cross-validation EM Training for Robust Speaker Diarization | X. Anguera, T. Shinozaki, C. Wooters, and J. Hernando | Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4 pp. 273-276 | April 2007 | Speech | [PDF]
|
| A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition | O. Cheng, J. Dines, and M. Magimai Doss | Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 345-348 | April 2007 | Speech | [PDF]
|
| Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus | L. Chen, Y. Liu, M. Harper, E. Maia, and S. McRoy | Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, Portugal, pp. 759-762 | 2004 | Speech | [PDF]
|
| Parameterization of Prosodic Feature Distributions for SVM Modeling in Speaker Recognition | L. Ferrer, E. Shriberg, S. Kajarekar, and K. Sonmez | Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 233-236 | April 2007 | Speech | [PDF]
|
| Noise Robust Speaker Identification for Spontaneous Arabic Speech | M. Graciarena, S. Kajarekar, A. Stolcke, and E. Shriberg | Proceedings of the 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 245-248 | April 2007 | Speech | [PDF]
|
| The ICSI RT07s Speaker Diarization System | C. Wooters and M. Huijbregts | Proceedings of the Second International Workshop on Classification of Events, Activities, and Relationships (CLEAR 2007) and the Fifth Rich Transcription 2007 Meeting Recognition (RT 2007), Baltimore, Maryland, pp. 509-519 | May 2007 | Speech | [PDF]
|
| The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System | A. Stolcke, X. Anguera, K. Boakye, O. Cetin, A. Janin, M. Magimai-Doss, C. Wooters, and J. Zheng | Proceedings of the Second International Workshop on Classification of Events, Activities, and Relationships (CLEAR 2007) and the Fifth Rich Transcription 2007 Meeting Recognition (RT 2007), Baltimore, Maryland, pp. 450-463 | May 2007 | Speech | [PDF]
|
| Automatic Laughter Detection Using Neural Networks | M. Knox and N. Mirghafori | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2973-2976 | August 2007 | Speech | [PDF]
|
| Exploiting Information Extraction Annotations for Document Retrieval in Distillation Tasks | D. Hakkani-Tur, G. Tur, and M. Levit | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 330-333 | August 2007 | Speech | [PDF]
|
| Co-training Using Prosodic and Lexical Information for Sentence Segmentation | U. Guz, S. Cuendet, D. Hakkani-Tur, and G. Tur | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2597-2600 | August 2007 | Speech | [PDF]
|
| Detecting Deception Using Critical Segments | F. Enos, E. Shriberg, M. Graciarena, J. Hirschberg, and A. Stolcke | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2281-2284 | August 2007 | Speech | [PDF]
|
| fMPE-MAP: Improved Discriminative Adaptation for Modeling New Domains | J. Zheng and A. Stolcke | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1573-1576 | August 2007 | Speech | [PDF]
|
| Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms | A. Stolcke, S. Kajarekar, L. Ferrer, and E. Shriberg | IEEE Transactions on Audio, Speech, and Language Processing. Special issue on speaker and language recognition, Vol. 15, Issue 7, IEEE Computer Society, California, pp. 1987-1998 | September 2007 | Speech | [PDF]
|
| Word-Conditioned HMM Supervectors for Speaker Recognition | H. Lei and N. Mirghafori | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 746-749 | August 2007 | Speech | [PDF]
|
| Higher Level Features in Speaker Recognition | E. Shriberg | Speaker Classification I (Lecture Notes in Computer Science, Vol. 4343), pp. 241-259, Springer: Heidelberg / Berlin | 2007 | Speech | |
| Prosodic Features and Feature Selection for Multi-lingual Sentence Segmentation | J. Fung, D. Hakkani-Tur, M. Magimai-Doss, E. Shriberg, S. Cuendet, and N. Mirghafori | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2585-2588 | August 2007 | Speech | [PDF]
|
| Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification | G. Tur, E. Shriberg, A. Stolcke, and S. Kajarekar | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech--Eurospeech 2008), Antwerp, Belgium, pp. 2049-2052 | August 2007 | Speech | [PDF]
|
| A Smoothing Kernel for Spatially Related Features and Its Application to Speaker Verification | L. Ferrer, K. Sonmez, and E. Shriberg | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 738-741 | August 2007 | Speech | [PDF]
|
| Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings | J. Kolar, Y. Liu, and E. Shriberg | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1621-1624 | August 2007 | Speech | [PDF]
|
| A Text-constrained Prosodic System for Speaker Verification | E. Shriberg and L. Ferrer | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1226-1229 | August 2007 | Speech | [PDF]
|
| Combining Short-term Cepstral and Long-term Pitch Features for Automatic Recognition of Speaker Age | C. Müller and F. Burkhardt | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2277-2280 | August 2007 | Speech | |
| The Digital Hand, Vol 2 - How Computers Changed the Work of the American Financial, Telecommunications, Media, and Entertainment Industries (book review) | G. Friedland | IEEE Annals of the History of Computing, Vol. 29, Issue 3, IEEE Computer Society, California, pp. 72-75 | July 2007 | Speech | [PDF]
|
| Object Cut and Paste in Images and Videos | G. Friedland, K. Jantz, T. Lenz, F. Wiesel, and R. Rojas | International Journal of Semantic Computing, World Scientific, Vol. 1, Issue 2, pp. 221-247, USA | July 2007 | Speech | |
| Computers and Commerce: A Study of Technology and Management at Eckert-Mauchly Computer Company, Engineering Research Associates, and Remington Rand, 1946-1957 (book review) | G. Friedland | IEEE Annals of the History of Computing, Vol. 29, No. 2, IEEE Computer Society, California, pp. 74-77 | June 2007 | Speech | |
| Multimedia Technologies for E-learning | G. Friedland and L. Knipping (editors) | Special issue of International Journal of Interactive Technology Smart Education (ITSE), Vol 4, No 1, Troubador Publishing Ltd., United Kingdom | March 2007 | Speech | |
| Speaker Recognition Via Nonlinear Discriminant Features | L. Stoll, J. Frankel, and N. Mirghafori | Proceedings of the International Speech Communication Association Tutorial and Research Workshop on Non-Linear Speech Processing (NOLISP 2007), Paris, France, pp. 27-30 | May 2007 | Speech | [PDF]
|
| Phonetic- and Speaker-Discriminant Features for Speaker Recognition | L. Stoll | UC Berkeley Masters Thesis | December 2006 | Speech | [PDF]
|
| Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech | S. Cuendet, D. Hakkani-Tur, and E. Shriberg | Proceedings of Fourth International Conference on Machine Learning and Multimodal Interaction, Brno, Czech Republic, pp. 144-155 | June 2007 | Speech | [PDF]
|
| The Blame Game: Performance Analysis of Speaker Diarization System Components | M. Huijbregts and C. Wooters | Proceedings of 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 1857-1860 | August 2007 | Speech | |
| Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections | M. Huijbregts, C. Wooters, and R. Ordelman | Proceedings of 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2925-2928 | August 2007 | Speech | |
| Acoustic Beamforming for Speaker Diarization of Meetings | X. Anguera, C. Wooters, and J. Hernando | IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 7, IEEE Computer Society, California, pp. 2011-2022 | September 2007 | Speech | |
| Speaker Diarization For Multiple-distant-microphone Meetings Using Several Sources of Information | J. M. Pardo, X. Anguera, and C. Wooters | IEEE Transactions on Computers, Vol. 56, Issue 9, IEEE Computer Society, California, pp. 1212-1224 | September 2007 | Speech | [PDF]
|
| Improving Speech Translation with Automatic Boundary Prediction | E. Matusov, D. Hillard, M. Magimai-Doss, D. Hakkani-Tur, M. Ostendorf, and H. Ney | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2449-2452 | August 2007 | Speech | [PDF]
|
| A New Algorithm for High Speed Speech and Audio Coding | U. Guz, H. Gurkan, and B.S. Yarman | Proceedings of the European Conference on Circuit Theory and Design, IEEE Circuits and Systems Society and the European Circuit Society, Seville, Spain | August 2007 | Speech | |
| An Analysis of Sentence Segmentation Features for Broadcast News, Broadcast Conversations, and Meetings | S. Cuendet, E. Shriberg, B. Favre, J. Fung, and D. Hakkani-Tür | Proceedings of the SIGIR Workshop on Searching Conversational Spontaneous Speech, Amsterdam, Netherlands, pp. 43-59 | July 2007 | Speech | |
| Cross-Genre Feature Comparisons for Spoken Sentence Segmentation | S. Cuendet, D. Hakkani-Tur, E. Shriberg, J. Fung, and B. Favre | Proceedings of International Conference on Semantic Computing, IEEE Computer Society, pp. 265-274, Irvine, California. Also published in International Journal of Semantic Computing, Volume 1, Issue 3, World Scientific, USA, pp. 335-346 | September 2007 | Speech | [PDF]
|
| Multimedia Data Formats and Semantic Computing: A Practical Example and its Implications for the Future | G. Friedland | IEEE International Conference on Semantic Computing, Irvine, California | September 2007 | Speech | |
| Educational Multimedia Systems: The Past, the Present, and a Glimpse into the Future | G. Friedland, W. Huerst, and L. Knipping | Proceedings of the ACM Workshop on Educational Multimedia and Multimedia Education at ACM Multimedia 2007, Augsburg, Germany, pp. 1-4 | September 2007 | Speech | |
| A Low-Cost Mobile Pointing and Drawing Device | K. Jantz, G. Friedland, L. Knipping, and R. Rojas | Proceedings of the ACM Workshop on Educational Multimedia and Multimedia Education at ACM Multimedia 2007, Augsburg, Germany, pp. 121-122 | September 2007 | Speech | |
| Using Audio and Video Features to Classify the Most Dominant Person in Meetings | H. Hung, D. Jayagopi, C. Yeo, G. Friedland, S. Ba, J-M. Odobez, K. Ramchandran, N. Mirghafori, and D. Gatica-Perez | Proceedings of ACM Multimedia 2007, Augsburg, Germany, pp. 835-838 | September 2007 | Speech | |
| EEG Signal Compression Based on Classified Signature and Envelope Vector Sets | H. Gurkan, U. Guz, and B.S. Yarman | Proceedings of the European Conference on Circuit Theory and Design, IEEE Circuits and Systems Society and the European Circuit Society, Seville, Spain, pp. 420-423 | August 2007 | Speech | |
| Selecting On-topic Sentences from Natural Language Corpora | M. Levit, E. Boschee, and M. Freedman | Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, pp. 2793-2796 | August 2007 | Speech | |
| Interpretation of Spatial Language in a Map Navigation Task | M. Levit and D. Roy | IEEE Transactions on Systems, Man and Cybernetics, Part B, vol. 37, no. 3, IEEE Systems, man, and Cybernetics Society, pp.667-679 | June 2007 | Speech | |
| A Fast-Match Approach for Robust, Faster than Real-Time Speaker Diarization | Y. Huang, O. Vinyals, G. Friedland, C. Müller, N. Mirghafori, and C. Wooters | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, pp. 693-698 | December 2007 | Speech | [PDF]
|
| Speech Encoding in a Model of Peripheral Auditory Processing: Quantitative Assessment by Means of Automatic Speech Recognition | M. Holmberg, D. Gelbart, and W. Hemmert | Speech Communication, Vol. 49, Issue 12, pp. 917-932 | December 2007 | Speech | |
| Corrected Tandem Features for Acoustic Model Training | A. Faria and N. Morgan | Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4737-4740 | April 2008 | Speech | [PDF]
|
| When a Mismatch Can Be Good: Large Vocabulary Speech Recognition Trained with Idealized Tandem Features | A. Faria and N. Morgan | Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, pp. 1574-1577 | March 2008 | Speech | [PDF]
|
| Building a Highly Accurate Mandarin Speech Recognizer | M-Y. Hwang, G. Peng, W. Wang, A. Faria, and A. Heidel | Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, pp. 490-495 | December 2007 | Speech | [PDF]
|