Fine Grained Classification of Personal Data Entities with Language Models

Abhinav Nagpal; Riddhiman Dasgupta; Balaji Ganesan

doi:10.1145/3493700.3493707

CODS-COMAD 2022

Short paper

06 Jan 2022

Fine Grained Classification of Personal Data Entities with Language Models

View publication

Abstract

Fine grained entity classification is the task of assigning context-specific, fine grained labels to entities extracted in an NLP Pipeline. Before the advent of language models, several artificial neural network models were proposed for this task. We revisit these models and compare them with BERT-based models for the specific task of classifying Personal Data Entities (PDE). We observe that using side information from rule-based annotators improves neural model performance on this task and can complement language models.

Conference paper