Publication Search Results

TitleAuthorBibliographicDateGroupsort ascendingLinks
Parallelizing Speaker-Attributed Speech Recognition for Meeting BrowsingG. Friedland, J. Chong, and A. JaninProceedings of the 2010 IEEE International Symposium on Multimedia (ISM2010), Taiwan, pp. 121-128December 2010Speech[PDF]

Dialocalizaton: Acoustic Speaker Diarization and Visual Localization as Joint Optimization ProblemG. Friedland, C. Yeo, and H. HungACM Transactions on Multimedia Computing, Communications, and Applications, Vol. 6, No. 4, Article 27November 2010Speech[PDF]

Review of C. Mueller-Tomfelder, "Tabletops - Horizontal Interactive Displays"G. FriedlandACM Computing Reviews, CR138453October 2010Speech[PDF]

The 2010 ICSI Video Location Estimation SystemJ. Choi, A. Janin, and G. FriedlandProceedings of the MediaEval 2010 Workshop, Pisa ItalyOctober 2010Speech[PDF]

Structured Approaches to Data Selection for Speaker RecognitionH. LeiUC Berkeley dissertationDecember 2010Speech[PDF]

Introduction to Multimedia ComputingG. Friedland and R. JainCambridge University Press 2011Speech
Special Section on New Frontiers in Rich TranscriptionG. Friedland, J. Fiscus, T. Hain, and S. Furui (eds)IEEE Transactions in Audio, Speech, and Language Processing, Vol. 20, No. 2February 2012Speech
Speaker Diarization: A Review of Recent ResearchX. Anguera, S. Bozonnet, N. Evans, C. Fredouille, G. Friedland, and O. VinyalsIEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 2, pp. 356-370February 2012Speech[PDF]

Automated Information Extraction in ProductionR. Desutter, J.P. Evain, G. Friedland, A. Messina, and M. SanoSpecial issue in Multimedia Tools and Applications, Springer 2011Speech
Computationally Efficient Clustering of Audio-Visual Meeting DataH. Hung, G. Friedland, and C. YeoIn Multimedia Interaction and Intelligent User Interfaces: Principles, Methods, and Applications, M. Etho, J. Luo, and L. Shao, eds., pp. 25-59 2010Speech
CUDA-Level Performance with Python-Level Productivity for Gaussian Mixture Model ApplicationsH. Cook, E. Gonina, S. Kamil, G. Friedland, D. Patterson, and A. FoxProceedings of the Third USENIX Workshop on Hot Topics in Parallelism (HotPar ’11), Berkeley, CaliforniaMay 2011Speech[PDF]

User Verification: Matching the Uploaders of Videos Across AccountsH. Lei, J. Choi, A. Janin, and G. FriedlandProceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 2404-2407May 2011Speech[PDF]

Automatic Tagging and Geo-Tagging in Video Collections and CommunitiesM. Larson, M. Soleymani, P. Serdyukov, S. Rudinac, C. Wartena, V. Murdock, G. Friedland, R. Ordelman, and G. J. F. JonesProceedings of the ACM International Conference on Multimedia Retrieval (ICMR 2011), Trento, Italy, April 2011April 2011Speech[PDF]

The SRI NIST 2010 Speaker Recognition Evaluation SystemN. Scheffer, L. Ferrer, M. Graciarena, S. Kajarekar, E. Shriberg, and A. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5292-5295May 2011Speech[PDF]

Speech and Audio Signal Processing: Processing and Perception of Speech and Music, 2nd EditionB. Gold, N. Morgan, and D. EllisWileyNovember 2011Speech
Deep and Wide: Multiple Layers in Automatic Speech RecognitionN. MorganIEEE Transactions on Audio, Speech, and Language Processing, Special Issue on Deep Learning 2011Speech[PDF]

The IBM 2009 GALE Arabic Speech Transcription SystemB. Kingsbury, H. Soltau, G. Saon, S. Chu, H.-K. Kuo, L. Mangu, S. Ravuri, A. Janin, and N. MorganProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4672-4675May 2011Speech[PDF]

Language-Independent Constrained Cepstral Features for Speaker RecognitionE. Shriberg and A. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5296-5299May 2011Speech[PDF]

Bird Species Recognition Combining Acoustic and Sequence ModelingM. Graciarena, M. Delplanche, E. Shriberg, and A. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 341-344May 2011Speech[PDF]

Making the Most from Multiple Microphones in Meeting RecognitionA. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4992-4995May 2011Speech[PDF]

Improving Language Recognition with Multilingual Phone Recognition and Speaker Adaptation TransformsA. Stolcke, M. Akbacak, L. Ferrer, S. Kajarekar, C. Richey, N. Scheffer, and E. ShribergProceedings of the Odyssey Speaker and Language Recognition Workshop, Brno, Czech Republic, pp. 256-262June 2010Speech[PDF]

The Automatic Recognition of Emotions in SpeechA. Batliner, B. Schuller, D. Seppi, S. Steidl, L. Devillers, L. Vidrascu, T. Vogt, V. Aharonson, and N. AmirArticle in P. Petta, Paolo, C. Pelachaud, R. Cowie, eds., Emotion-Oriented Systems: The Humaine Handbook Cognitive Technologies, pp. 71-99, Springer 2011Speech
Exploiting User Feedback for Language Model Adaptation in Meeting RecognitionD. Vergyri, A. Stolcke, and G. TurProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4737-4740April 2009Speech[PDF]

Associating Children’s Non-Verbal and Verbal Behaviour: Body Movements, Emotions, and Laughter in a Human-Robot InteractionA. Batliner, S. Steidl, and E. NöthProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 22-27May 2011Speech[PDF]

Comparing Multilayer Perceptron to Deep Belief Network Tandem Features for Robust ASRO. Vinyals and S. RavuriProceedings of the 36th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '11), Prague, Czech RepublicMay 2011Speech[PDF]

On the Use of Spectro-Temporal Features in Noise-Additive SpeechS. RavuriUC Berkeley Master's thesis, Spring 2011 2011Speech[PDF]

Improved Overlapped Speech Handling for Speaker DiarizationK. Boakye, O. Vinyals, and G. FriedlandProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 941-944August 2011Speech
How Good Is the Crowd at "Real" WSD?J. Hong and C. F. BakerProceedings of the Fifth Linguistic Annotation Workshop (LAW-V), Portland, OregonJune 2011Speech[PDF]

Data Selection with Kurtosis and Nasality features for Speaker RecognitionH. Lei and N. MirghaforiProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 2753-2756August 2011Speech[PDF]

Improved Classification of Speaking Styles for Mental Health Monitoring using Phoneme DynamicsK. Chang, H. Lei, and J. CannyProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 85-88August 2011Speech[PDF]

Effective Arabic Dialect Classification Using Diverse Phonotactic ModelsM. Akbacak, D. Vergyri, A. Stolcke, N. Scheffer, and A. MandalProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 737-740August 2011Speech[PDF]

Constrained Cepstral Speaker Recognition Using Matched UBM and JFA TrainingM. H. Sanchez, L. Ferrer, E. Shriberg, and A. StolckeProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 141-144August 2011Speech[PDF]

Speaker DiarizationG. FriedlandIn Speech and Audio Signal Processing, 2nd edition, B. Gold, N. Morgan, D. Ellis, eds., Wiley 2011Speech
Video2GPS: A Demo of Multimodal Location Estimation on Flickr VideosG. Friedland, J. Choi, and A. JaninProceedings of the ACM Multimedia Conference (MM'11), Scottsdale, ArizonaNovember 2011Speech[PDF]

Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location EstimationJ. Choi and G. FriedlandProceedings of the IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246September 2011Speech[PDF]

Estimating Dominance in Multi-Party Meetings Using Speaker Diarization from a Single MicrophoneH. Hung, Y. Huang, G. Friedland, and D. Gatica-PerezIEEE Transactions on Audio, Speech and Language Processing, Vol. 19, No. 4, pp. 847–860May 2011Speech
Java Visual Speech Components for Rapid Application Development of GUI based Speech Processing ApplicationsS. Steidl, K. Riedhammer, T. Bocklet, F. Hoenig, and E. NoethProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 3257-3260August 2011Speech
Comparing Different Flavors of Spectro-Temporal Features for ASRB. T. Meyer, S. V. Ravuri, M. R. Schaedler, and N. MorganProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 1269-1272August 2011Speech[PDF]

The ICSI RT-09 Speaker Diarization SystemG. Friedland, A. Janin, D. Imseng, X. Anguera, L. Gottlieb, M. Huijbregts, M. Knox, and O. VinyalsIEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, Issue 2, pp. 371-381February 2012Speech[PDF]

Speaker DiarizationG. Friedland and F. ValenteIn Multimodal Signal Processing: Human Interactions in Meetings, S. Reynals, H. Bourlard, J. Carletta, and A. Popescu-Belis, eds., Cambridge University PressJune 2012Speech
Narrative Theme Navigation for Sitcoms Supported by Fan-Generated ScriptsG. Friedland, A. Janin, and L. GottliebTo appear in Multimedia Tools and Applications, Springer 2012Speech[PDF]

Fast Speaker Diarization Using a High-Level Scripting LanguageE. Gonina, G. Friedland, H. Cook, and K. KeutzerProceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2011), Big Island, HawaiiDecember 2011Speech[PDF]

On the Applicability of Speaker Diarization to Audio Concept Detection for Multimedia RetrievalR. Mertens, P.-S. Huang, L. Gottlieb, G. Friedland, and A. DivakaranProceedings of the IEEE International Symposium on Multimedia, Dana Point, California, pp. 446-451December 2011Speech[PDF]

Acoustic Super Models for Large Scale Video Event DetectionR. Mertens, H. Lei, L. Gottlieb, G. Friedland, and A. DivakaranProceedings of the ACM International Workshop on Events in Multimedia (EiMM11), Scottsdale, ArizonaNovember 2011Speech[PDF]

Multimodal Location Estimation on Flickr VideosG. Friedland, J. Choi, H. Lei, and A. JaninProceedings of the ACM International Workshop on Social Media (WSM11), Scottsdale, ArizonaNovember 2011Speech[PDF]

The 2011 ICSI Video Location Estimation SystemJ. Choi, H. Lei, and G. FriedlandProceedings of the MediaEval 2011 Workshop, Pisa, ItalySeptember 2011Speech[PDF]

Review of A. Rahman, et al., "Spatial-Geometric Approach to Physical Mobile Interaction Based on Accelerometer and IR Sensory Data Fusion"G. FriedlandACM Computing Reviews, CR139264July 2011Speech
Review of J. Ajmera, et al., "Two-Stream Indexing for Spoken Web Search"G. FriedlandACM Computing Reviews, CR139192June 2011Speech
Review of C. Simon, et al., "Visual Event Recognition Using Decision Trees"G. FriedlandACM Computing Reviews, CR138638January 2011Speech
Improving Automatic Speech Recognition by Learning from Human ErrorsB. T. MeyerProceedings of the 162nd Meeting of the Acoustical Society of America, San Diego, CaliforniaOctober 2011Speech

Pages