Focused named entity recognition using machine learning

Li Zhang; Yue Pan; Tong Zhang

doi:10.1145/1008992.1009042

SIGIR 2004

Conference paper

25 Jul 2004

Focused named entity recognition using machine learning

View publication

Abstract

In this paper we study the problem of finding most topical named entities among all entities in a document, which we refer to as focused named entity recognition. We show that these focused named entities are useful for many natural language processing applications, such as document summarization, search result ranking, and entity detection and tracking. We propose a statistical model for focused named entity recognition by converting it into a classification problem. We then study the impact of various linguistic features and compare a number of classification algorithms. From experiments on an annotated Chinese news corpus, we demonstrate that the proposed method can achieve near human-level accuracy.

Conference paper