
I am interested in machine learning and its applications in speech and natural language proceesing. My research advisor is Prof. Nelson Morgan.
I am currently working as an SDE Speech Scientist with Microsoft.
[1] "An Auditory-Based Frequency Modulation Feature and Feature Combination for Robust Speaker Identification," in Submission to the IEEE Trans of audio, speech and language processing, 2010 (with Q. Li).
[2] "An Auditory-Based Feature and Its Application to Robust Speaker Identification," Submitted to the IEEE Transaction of audio, speech and language processing, 2009 (with Q. Li).
[3] "An Auditory-Based Feature and Its Application to Robust Speaker Identification," International Conference on Speech and Signal Processing, Dallas, 2010 (with Q. Li).
[4] "Fusing short term and long term features for improved speaker diarization," IEEE Transaction of audio, speech and language processing, 2009 (with G. Friedland, O. Vinyals, and C. Mueller).
[5] "Estimating Dominance In Multi-Party Conversations Using Automatically Generated Audio Cues," Submitted to the IEEE Transaction of audio, speech and language processing, 2009 (with H.Hung, G. Friedland, and D. Gatica-Perez)
[6] "Correlating audio-visual cues in a dominance estimation framework," CVPR workshop on human behavior, 2009 (with H.Hung, G. Friedland, and D. Gatica-Perez).
[7] "Fusing short term and long term features for improved speaker diarization," International Conference on Speech and Signal Processing, Taipei, 2009 (with G. Friedland, O. Vinyals, and C. Mueller).
[8] "Estimating The Dominant Person In Multi-Party Conversations Using Speaker Diarization Strategies," International Conference on Speech and Signal Processing, Las Vegas, 2008 (with H.Hung, G. Friedland, and D. Gatica-Perez).
[9] "Optimization of Latent Semantic Analysis Based Language Model Interpolation for Meeting Recognition", Fifth Slovenian and First International Language Technologies Conference, Slovenia, 2006 (with Michael Pucher, ?g? ?tin).
[10] "Vocabulary and Language Model Adaptation using Information Retrieval", International Conference on Spoken Language Processing, Jeju Island, Korea, 2004 (with Brigitte Bigi, Renato De Mori).
[11] "A Novel Model TD-PSPTP for Speech Synthesis", 6th European Conference on Speech Communication and Technology, Budapest, Hungary, 1999 (with Bo Xu).
[12] "Neural Learning Approach for Duration Parameter Generation in Mandarin Speech Synthesis", 1th International Symposium on Chinese Spoken Language Processing, Singapore, 1998 (with Taiyi Huang).
Working
Experience
International Computer Science Institute,
Berkeley, CA (Research Assistant)
Panasonic Speech Technologies Laboratory, Santa Barbara, CA (Research
Engineer)
Center of Language and Speech Processing, the Johns Hopkins University,
Batimore, MD (Researach Assistant)
Bell-labs, Lucent Technologies (Summer Internship)
National Laboratory of Pattern Recognition, Institute of Automation,
Chinese Academy of Sciences, Beijing, China (Research Assistant)
Misc
I am a huge fan of New
York City Ballet Company. I also enjoy spending sometime on the
barre in the studio when I have time.