The ICSI/SRI/UW RT04 Structural Metadata Extraction System

TitleThe ICSI/SRI/UW RT04 Structural Metadata Extraction System
Publication TypeConference Paper
Year of Publication2004
AuthorsLiu, Y., Shriberg E., Stolcke A., Peskin B., & Harper M. P.
Published inRT-04 EARS Workshop
Other Numbers8

Both human and automatic processing of speech require recognizing more than just the words. We describe the ICSI-SRI-UW metadata detection system in both broadcast news and spontaneous telephone conversations, developed as part of the DARPA EARS Rich Transcription program. System tasks include sentence boundary detection, filler word detection, and detection/correction of disfluencies. To achieve best performance, we combine information from different types of textual knowledge sources (based on words, part-of-speech classes, and automatically induced classes) with information from a prosodic classifier. The prosodic classifier employs bagging and ensemble approaches to better estimate posterior probabilities. In addition to our previous HMM approach, we investigate using a maximum entropy (Maxent) and a conditional random field (CRF) approach for various tasks. Results using these techniques are presented for the 2004 NIST Rich Transcription metadata tasks.

Bibliographic Notes

RT-04 EARS Workshop

Abbreviated Authors

Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, and M. Harper

ICSI Research Group


ICSI Publication Type

Article in conference proceedings