MarkushGrapher: Joint Visual and Textual Recognition of Markush StructuresLucas MorinValery Weberet al.2025CVPR 2025
ChemQuery: A Natural Language Query-driven Service for Comprehensive Exploration of Chemistry Patent LiteratureShubham GuptaRafael Teixeira de Limaet al.2025Appl. AI Lett.
Foundation models for materials discovery – current state and future directionsEdward Pyzer-knappMatteo Manicaet al.2025npj Computational Materials
Docling: An Efficient Open-Source Toolkit for AI-driven Document ConversionNikos LivathinosChristoph Aueret al.2025AAAI 2025
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG SystemsRafael Teixeira de LimaShubham Guptaet al.2025COLING 2025
INDUS: Effective and Efficient Language Models for Scientific ApplicationsBhatta BhattacharjeeAashka Trivediet al.2024EMNLP 2024
Wealth Over Woe: Global Biases in Hydro-Hazard ResearchLina SteinS. Karthik Mukkavilliet al.2024Earth's Futur.
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business DocumentsOshri NaparstekRoi Ponyet al.2024ICDAR 2024
Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIsLokesh MishraSohayl Dhibiet al.2024ACL 2024