International Computer Science Institute Talks Talks at the International Computer Science Institute
"A Whole Sentence Maximum Entropy Language Model" (and other statistical language modeling projects at Carnegie Mellon University)

Roni Rosenfeld
Carnegie Melon University
Roni_Rosenfeld alf11.speech.cs.cmu.edu

Friday, December 19, 1997 2:00 - 4:00 p.m.

Starting with a brief introduction to statistical language modeling, I will proceed with a short summary of several ongoing language modeling projects at my lab.

The bulk of my talk will describe a new kind of language model, which models whole sentences or utterances directly using the Maximum Entropy (ME) paradigm. The new model is conceptually simpler, and more naturally suited to modeling whole-sentence phenomena, than the conditional ME models proposed to date. By avoiding the chain rule, the model treats each sentence or utterance as a ``bag of features'', where features are arbitrary computable properties of the sentence. The model is unnormalizable, but this does not interfere with training (done via sampling) or with use. Using the model is computationally straightforward. The main computational cost of training the model is in generating sample sentences from a Gibbs distribution. Interestingly, this cost has different dependencies, and is potentially lower, than in the comparable conditional ME model.

This talk will be held in the Main Lecture Hall at ICSI,
1947 Center Street, Sixth Floor, Berkeley, CA 94704-1198
(on Center between Milvia and Martin Luther King Jr. Way).
Click here for a map.