Massimiliano Pronesti, Joao Bettencourt-Silva, et al.
ACL 2025
Wikification of large corpora is beneficial for various NLP applications. Existing methods focus on quality performance rather than run-time, and are therefore non-feasible for large data. Here, we introduce RedW, a run-time oriented Wikification solution, based on Wikipedia redirects, that can Wikify massive corpora with competitive performance. We further propose an efficient method for estimating RedW confidence, opening the door for applying more demanding methods only on top of RedW lower-confidence results. Our experimental results support the validity of the proposed approach.
Massimiliano Pronesti, Joao Bettencourt-Silva, et al.
ACL 2025
Jatin Ganhotra, HAGGAI Roitman, et al.
EMNLP 2020
Arvind Agarwal, Laura Chiticariu, et al.
NAACL 2021
Viviane T. Silva, Rodrigo Neumann Barros Ferreira, et al.
ACS Fall 2024