Publication
EMNLP 1998
Conference paper

A comparison of criteria for maximum entropy/minimum divergence feature selection

Abstract

In this paper we study the gain, a naturally-arising statistic from the theory of MEMD modeling [2], as a figure of merit for selecting features for an MBMD language model. We compare the gain with two popular alternatives-empirical activation and mutual information-and argue that the gain is the preferred statistic, on the grounds that it directly measures a feature's contribution to improving upon the base model.

Date

Publication

EMNLP 1998

Authors

Topics

Share