Publication Search Results

TitleAuthorBibliographicDatesort descendingGroupLinks
Precise Indoor Localization Using Smart PhonesE. Martin, O. Vinyals, G. Friedland, and R. BajcsyProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 787-790October 2010Speech[PDF]

Joke-O-Mat HD: Browsing Sitcoms with Human Derived TranscriptsA. Janin, L. Gottlieb, and G. FriedlandProceedings of the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 1591-1594October 2010Speech[PDF]

Narrative-Theme Navigation for Sitcoms Supported by Fan-Generated ScriptsG. Friedland, L. Gottlieb, and A. JaninProceedings of the Third International Workshop on Automated Information Extraction in Media Production (AIEMPro '10) at the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 3-8October 2010Speech[PDF]

Long Story Short - Global Unsupervised Models for Keyphrase Based Meeting SummarizationK. Riedhammer, B. Favre, and D. Hakkani-TurSpeech Communication, Vol. 52, Issue 10, pp. 801-815. DOI:10.1016/j.specom.2010.06.002October 2010Speech
A Parallel Meeting DiaristG. Friedland, J. Chong, and A. JaninProceedings of the Workshop on Searching Spontaneous Conversational Speech (SSCS) at the ACM International Conference on Multimedia (ACM Multimedia 2010), Florence, Italy, pp. 57-60October 2010Speech[PDF]

Review of C. Mueller-Tomfelder, "Tabletops - Horizontal Interactive Displays"G. FriedlandACM Computing Reviews, CR138453October 2010Speech[PDF]

The 2010 ICSI Video Location Estimation SystemJ. Choi, A. Janin, and G. FriedlandProceedings of the MediaEval 2010 Workshop, Pisa ItalyOctober 2010Speech[PDF]

Tuning-Robust Initialization Methods for Speaker DiarizationD. Imseng and G. FriedlandIEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, Issue 8, pp. 2028-2037November 2010Speech[PDF]

Selected Papers from the 11th IEEE International Symposium on Multimedia (ISM2009)G. Friedland and M.-L. Shyu, eds.International Journal on Semantic Computing, Vol. 4, No. 2November 2010Speech
Dialocalizaton: Acoustic Speaker Diarization and Visual Localization as Joint Optimization ProblemG. Friedland, C. Yeo, and H. HungACM Transactions on Multimedia Computing, Communications, and Applications, Vol. 6, No. 4, Article 27November 2010Speech[PDF]

Parallelizing Speaker-Attributed Speech Recognition for Meeting BrowsingG. Friedland, J. Chong, and A. JaninProceedings of the 2010 IEEE International Symposium on Multimedia (ISM2010), Taiwan, pp. 121-128December 2010Speech[PDF]

Structured Approaches to Data Selection for Speaker RecognitionH. LeiUC Berkeley dissertationDecember 2010Speech[PDF]

Introduction to Multimedia ComputingG. Friedland and R. JainCambridge University Press 2011Speech
Automated Information Extraction in ProductionR. Desutter, J.P. Evain, G. Friedland, A. Messina, and M. SanoSpecial issue in Multimedia Tools and Applications, Springer 2011Speech
Deep and Wide: Multiple Layers in Automatic Speech RecognitionN. MorganIEEE Transactions on Audio, Speech, and Language Processing, Special Issue on Deep Learning 2011Speech[PDF]

The Automatic Recognition of Emotions in SpeechA. Batliner, B. Schuller, D. Seppi, S. Steidl, L. Devillers, L. Vidrascu, T. Vogt, V. Aharonson, and N. AmirArticle in P. Petta, Paolo, C. Pelachaud, R. Cowie, eds., Emotion-Oriented Systems: The Humaine Handbook Cognitive Technologies, pp. 71-99, Springer 2011Speech
On the Use of Spectro-Temporal Features in Noise-Additive SpeechS. RavuriUC Berkeley Master's thesis, Spring 2011 2011Speech[PDF]

Speaker DiarizationG. FriedlandIn Speech and Audio Signal Processing, 2nd edition, B. Gold, N. Morgan, D. Ellis, eds., Wiley 2011Speech
Review of C. Simon, et al., "Visual Event Recognition Using Decision Trees"G. FriedlandACM Computing Reviews, CR138638January 2011Speech
Semantic Computing and Privacy: A Case Study Using Inferred Geo-TaggingG. Friedland and J. ChoiInternational Journal of Semantic Computing, Vol. 5, No. 1, pp. 79-93. Also Best Poster in the Electrical and Computer Science and Engineering Track at the Korean Student Technical and Leadership Conference, Chicago, Illinois, March 2012. DOI: 10.1142/S1793351X11001171March 2011Speech[PDF]

Automatic Tagging and Geo-Tagging in Video Collections and CommunitiesM. Larson, M. Soleymani, P. Serdyukov, S. Rudinac, C. Wartena, V. Murdock, G. Friedland, R. Ordelman, and G. J. F. JonesProceedings of the ACM International Conference on Multimedia Retrieval (ICMR 2011), Trento, Italy, April 2011April 2011Speech[PDF]

CUDA-Level Performance with Python-Level Productivity for Gaussian Mixture Model ApplicationsH. Cook, E. Gonina, S. Kamil, G. Friedland, D. Patterson, and A. FoxProceedings of the Third USENIX Workshop on Hot Topics in Parallelism (HotPar ’11), Berkeley, CaliforniaMay 2011Speech[PDF]

User Verification: Matching the Uploaders of Videos Across AccountsH. Lei, J. Choi, A. Janin, and G. FriedlandProceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 2404-2407May 2011Speech[PDF]

The SRI NIST 2010 Speaker Recognition Evaluation SystemN. Scheffer, L. Ferrer, M. Graciarena, S. Kajarekar, E. Shriberg, and A. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5292-5295May 2011Speech[PDF]

The IBM 2009 GALE Arabic Speech Transcription SystemB. Kingsbury, H. Soltau, G. Saon, S. Chu, H.-K. Kuo, L. Mangu, S. Ravuri, A. Janin, and N. MorganProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4672-4675May 2011Speech[PDF]

Language-Independent Constrained Cepstral Features for Speaker RecognitionE. Shriberg and A. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5296-5299May 2011Speech[PDF]

Bird Species Recognition Combining Acoustic and Sequence ModelingM. Graciarena, M. Delplanche, E. Shriberg, and A. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 341-344May 2011Speech[PDF]

Making the Most from Multiple Microphones in Meeting RecognitionA. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4992-4995May 2011Speech[PDF]

Associating Children’s Non-Verbal and Verbal Behaviour: Body Movements, Emotions, and Laughter in a Human-Robot InteractionA. Batliner, S. Steidl, and E. NöthProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 22-27May 2011Speech[PDF]

Comparing Multilayer Perceptron to Deep Belief Network Tandem Features for Robust ASRO. Vinyals and S. RavuriProceedings of the 36th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '11), Prague, Czech RepublicMay 2011Speech[PDF]

Estimating Dominance in Multi-Party Meetings Using Speaker Diarization from a Single MicrophoneH. Hung, Y. Huang, G. Friedland, and D. Gatica-PerezIEEE Transactions on Audio, Speech and Language Processing, Vol. 19, No. 4, pp. 847–860May 2011Speech
How Good Is the Crowd at "Real" WSD?J. Hong and C. F. BakerProceedings of the Fifth Linguistic Annotation Workshop (LAW-V), Portland, OregonJune 2011Speech[PDF]

Review of J. Ajmera, et al., "Two-Stream Indexing for Spoken Web Search"G. FriedlandACM Computing Reviews, CR139192June 2011Speech
Gappy Phrasal Alignment by AgreementM. Bansal, C. Quirk, and R. C. MooreProceedings of the 49th annual Meeting of the Association for Computational Linguistics, pp. 1308-1317 Portland, OregonJune 2011Speech[PDF]

The Surprising Variance in Shortest-Derivation ParsingM. Bansal and D. KleinProceedings of the 49th annual Meeting of the Association for Computational Linguistics, Portland, OregonJune 2011Speech[PDF]

Web-Scale Features for Full-Scale ParsingM. Bansal and D. KleinProceedings of the 49th annual Meeting of the Association for Computational Linguistics, pp. 693-702, Portland, OregonJune 2011Speech[PDF]

Review of A. Rahman, et al., "Spatial-Geometric Approach to Physical Mobile Interaction Based on Accelerometer and IR Sensory Data Fusion"G. FriedlandACM Computing Reviews, CR139264July 2011Speech
Improved Overlapped Speech Handling for Speaker DiarizationK. Boakye, O. Vinyals, and G. FriedlandProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 941-944August 2011Speech
Data Selection with Kurtosis and Nasality features for Speaker RecognitionH. Lei and N. MirghaforiProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 2753-2756August 2011Speech[PDF]

Improved Classification of Speaking Styles for Mental Health Monitoring using Phoneme DynamicsK. Chang, H. Lei, and J. CannyProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 85-88August 2011Speech[PDF]

Effective Arabic Dialect Classification Using Diverse Phonotactic ModelsM. Akbacak, D. Vergyri, A. Stolcke, N. Scheffer, and A. MandalProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 737-740August 2011Speech[PDF]

Constrained Cepstral Speaker Recognition Using Matched UBM and JFA TrainingM. H. Sanchez, L. Ferrer, E. Shriberg, and A. StolckeProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 141-144August 2011Speech[PDF]

Java Visual Speech Components for Rapid Application Development of GUI based Speech Processing ApplicationsS. Steidl, K. Riedhammer, T. Bocklet, F. Hoenig, and E. NoethProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 3257-3260August 2011Speech
Comparing Different Flavors of Spectro-Temporal Features for ASRB. T. Meyer, S. V. Ravuri, M. R. Schaedler, and N. MorganProceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence, Italy, pp. 1269-1272August 2011Speech[PDF]

Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location EstimationJ. Choi and G. FriedlandProceedings of the IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246September 2011Speech[PDF]

The 2011 ICSI Video Location Estimation SystemJ. Choi, H. Lei, and G. FriedlandProceedings of the MediaEval 2011 Workshop, Pisa, ItalySeptember 2011Speech[PDF]

Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location EstimationJ. Choi and G. FriedlandProceedings of the Fifth IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246September 2011Speech[PDF]

Improving Automatic Speech Recognition by Learning from Human ErrorsB. T. MeyerProceedings of the 162nd Meeting of the Acoustical Society of America, San Diego, CaliforniaOctober 2011Speech
Speech and Audio Signal Processing: Processing and Perception of Speech and Music, 2nd EditionB. Gold, N. Morgan, and D. EllisWileyNovember 2011Speech
Video2GPS: A Demo of Multimodal Location Estimation on Flickr VideosG. Friedland, J. Choi, and A. JaninProceedings of the ACM Multimedia Conference (MM'11), Scottsdale, ArizonaNovember 2011Speech[PDF]

Pages