Docling: An Efficient Open-Source Toolkit for AI-driven Document ConversionNikos LivathinosChristoph Aueret al.2025AAAI 2025
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG SystemsRafael Teixeira de LimaShubham Guptaet al.2025COLING 2025
INDUS: Effective and Efficient Language Models for Scientific ApplicationsBhatta BhattacharjeeAashka Trivediet al.2024EMNLP 2024
Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIsLokesh MishraSohayl Dhibiet al.2024ACL 2024
ESG Accountability Made Easy: DocQA at Your ServiceLokesh MishraCesar Berrospi Ramiset al.2024AAAI 2024
FETA: Towards Specializing Foundational Models for Expert Task ApplicationsAmit AlfassyAssaf Arbelleet al.2022NeurIPS 2022
DocLayNet: A Large Human-Annotated Dataset for Document-Layout SegmentationBirgit PfitzmannChristoph Aueret al.2022KDD 2022
Delivering Document Conversion as a Cloud Service with High Throughput and ResponsivenessChristoph AuerMichele Dolfiet al.2022CLOUD 2022