08/2000-12/2004 Purdue University, West Lafayette, IN
Ph.D. in Electrical and Computer
Engineering
·
Dissertation Title: Structural
Event Detection for Rich Transcription of Speech (done at Purdue University and ICSI)
·
Advisor: Mary Harper
·
Major: Signal
Processing and Communication
·
Minor: Computer
Engineering
09/1997-06/2000 Tsinghua
University, Beijing, China
·
Master Thesis Title: Robust
Language Model in Speech Recognition
·
Major: Information and
Signal Processing
09/1992-07/1997 Tsinghua University, Beijing,
China
·
Bachelor Thesis Title: Automatic
Segmentation of Chinese Corpus for Speech Recognition
·
Major: Information and
Signal Processing
Current Funded Activities
12/2004-present International Computer Science Institute, Berkeley, CA
·
Parsing speech:
investigation of the impact of structural events on parsing speech and the
effectiveness of incorporating syntactic knowledge for detecting metadata
events. This proposal was written together with Mary Harper and accepted by the
NSF/DARPA supported 2005 JHU
summer workshop.
12/2004-present International Computer
Science Institute, Berkeley, CA
·
EARS MDE research,
for English and Chinese.
·
Dialog Act detection
in Meeting corpus.
·
Active learning and unsupervised learning for DA modeling and MDE.
07/2002-12/2004 International Computer
Science Institute, Berkeley, CA
·
In charge of the
ICSI/SRI/UW metadata extraction system for the NIST Rich Transcription evaluations.
Our system achieved the best overall performance in the two evaluations (RT-03 and RT-04).
·
Proposed using the
maximum entropy (Maxent) and conditional random fields (CRF) approaches for
metadata event detection. Investigated a variety of features (e.g., from POS
tagger, chunk parser, and prosody model) in these approaches and feature
selection methods.
·
Collaborated on generalizing
structural event detection algorithms to the multiparty meeting corpus for
dialog act segmentation and classification, as well as a multimodal corpus
using both a prosody model and a gesture model for sentence boundary detection.
·
Investigated machine
learning techniques for dealing with the imbalanced data problem in
classification.
·
Collaborated on using
metadata information (sentence boundary hypotheses) for improving speech
recognition accuracy by re-segmenting the speech and re- recognition.
·
Investigated using
voice quality features for word fragment detection and speech synthesis model
(fujisaki model) for phrase boundary detection.
08/2000-07/2002 Purdue University,
West Lafayette, IN
·
Investigated using a
compact graphical representation as the interface between speech recognition
and subsequent speech understanding modules.
·
Investigated the
effectiveness of the combination of different language models for rescoring
word lattices or N-best lists.
·
Evaluated bottom-up (KL
divergence) and top-down (decision tree) approaches for effective triphone
acoustic model training.
09/1996-06/2000 Tsinghua University,
Beijing, China
·
Investigated different smoothing methods for language model,
search algorithms in speech recognition, class-based language model, and
language model adaptation.
·
Primary project member
of the Chinese Government supported speech recognition project. Our team
achieved the best system performance in a row of three years.
·
Participated in the development of a real-time large vocabulary
continuous Chinese speech recognition system, working on lexicon-tree design,
Chinese word segmentation algorithm, semantic-based Language Model, multi-pass
search algorithm for real-time speech recognition, and command-control module.
10/1999-11/1999 Philips Speech Processing, Vienna, Austria
·
Main developer of a real-time large vocabulary Mandarin continuous
speech recognition system, working on lexicon-tree, multi-pass searching
algorithm, and language models.
My Ph.D. thesis “Structural Event Detection for Rich
Transcription of Speech” [pdf]
[1]
Yang Liu, Nitesh
Chawla, Mary Harper, Elizabeth Shriberg, and Andreas Stolcke, "
A Study in Machine Learning from Imbalanced Data for Sentence
Boundary Detection in Speech ", To appear in Computer
Speech and Language. [pdf]
[2]
Yang Liu, Mary Harper,
Michael Johnson, and Leah Jamieson, "The Effect of Pruning and Compression
on Graphical Representations of the Output of a Speech Recognizer", Computer
Speech and Language, Vol. 7, No. 4, pp. 329-356, October 2003. [pdf]
[1] Yang Liu, Mary Harper, Elizabeth Shriberg and Andreas Stolcke, "Using Conditional Random Fields For Sentence Boundary Detection in Speech", To appear in ACL 2005.
[2]
Jeremy Ang, Yang Liu,
and Elizabeth Shriberg, “Automatic Dialog Act Segmentation and Classification
in Multiparty Meetings”, ICASSP 2005. [pdf]
[3]
Yang Liu, Elizabeth
Shriberg, Andreas Stolcke, Barbara Peskin, Jeremy Ang, Dustin Hillard, Mari
Ostendorf, Marcus Tomalin, Phil Woodland, and Mary Harper, “Structural Metadata
Research in the EARS Program”, Invited paper. ICASSP 2005.
[pdf]
[4]
Yang Liu, Andreas
Stolcke, Mary Harper, and Elizabeth Shriberg, "Comparing and Combining
Generative and Posterior Probability Models: Some Advances in Sentence Boundary
Detection in Speech", EMNLP 2004. [pdf]
[5]
Yang Liu, Elizabeth Shriberg, Andreas Stolcke, Dustin Hillard,
Mari Ostendorf, Barbara Peskin, and Mary Harper, "The ICSI-SRI-UW Metadata
Extraction System", ICSLP 2004. [pdf]
[6]
Yang Liu, Elizabeth Shriberg, Andreas Stolcke, and Mary Harper, "Using
Machine Learning to Cope with Imbalanced Classes in Natural Speech: Evidence
from Sentence Boundary and Disfluency Detection", ICSLP 2004. [pdf]
[7]
Lei Chen, Yang Liu,
Mary Harper, and Elizabeth Shriberg, "Multimodal Model Integration for
Sentence Unit Detection", ICMI 2004. [pdf]
[8]
Lei Chen, Yang Liu,
Eduardo Maia, and Mary Harper, "Evaluating Factors Impacting the Accuracy
of Forced Alignments in a Multimodal Corpus", LREC 2004. [pdf]
[9]
Dustin Hillard, Mari
Ostendorf, Andreas Stolcke, Yang Liu, and Elizabeth Shriberg, "Improving
Automatic Sentence Boundary Detection with Confusion Networks", HLT/NAACL
2004. [pdf]
[10]
Yang Liu, Elizabeth
Shriberg, and Andreas Stolcke, "Automatic Disfluency Identification in
Conversational Speech Using Multiple Knowledge Sources", EuroSpeech
2003. [pdf]
[11]Yang
Liu, "Word Fragment Identification Using Acoustic-Prosodic Features in
Conversational Speech", HLT/NAACL 2003 Student Workshop, 2003. [pdf]
[12]Wen
Wang, Yang Liu, and Mary Harper, "Rescoring Effectiveness of Language
Models Using Different Levels of Knowledge and Their Integration", ICASSP
2002. [pdf]
[13]Yang
Liu, Jiasong Sun, and Zuoying Wang, "Comparisons of Smoothing Methods in
Chinese Language Model", ISCSLP 2000. [pdf]
[14]Jiasong
Sun, Xiaodong Cui, Zuoying Wang, and Yang Liu, "A Language Model
Adaptation Approach Based on Text Classification", ICSLP 2000.
[pdf]
[15]Yang
Liu, Elizabeth Shriberg, Andreas Stolcke, Barbara Peskin, and Mary Harper, “The
ICSI/SRI/UW RT04 Structural Metadata Extraction System”, EARS RT-04 Workshop,
New York, November 2004. [pdf]
[16]Yang
Liu, Elizabeth Shriberg, Andreas Stolcke, Barbara Peskin, Jeremy Ang, and Mary
Harper, “ICSI-SRI-UW Structural MDE: Modeling, Analysis, and Issues”, EARS
RT-04 Workshop presentation, New York, November 2004. [ppt]
[17]Yang
Liu, Elizabeth Shriberg, Dustin Hillard et al., "ICSI/SRI/UW RT03F MDE
System and Research", EARS RT-03F Workshop presentation, Washington
D.C., November 2003. [ppt]
[18]Elizabeth
Shriberg, Yang Liu, Mari Ostendorf et al., "Metadata Extraction Research
at SRI/ICSI/UW", EARS RT-03S Workshop presentation, Boston, May
2003. [ppt]