FETA: Towards Specializing Foundational Models for Expert Task ApplicationsAmit AlfassyAssaf Arbelleet al.2022NeurIPS 2022
Delivering Document Conversion as a Cloud Service with High Throughput and ResponsivenessChristoph AuerMichele Dolfiet al.2022CLOUD 2022
ESG Accountability Made Easy: DocQA at Your ServiceLokesh MishraCesar Berrospi Ramiset al.2024AAAI 2024
Docling: An Efficient Open-Source Toolkit for AI-driven Document ConversionNikos LivathinosChristoph Aueret al.2025AAAI 2025
Optimized Table Tokenization for Table Structure RecognitionMaxim LysakAhmed Nassaret al.2023ICDAR 2023
DocLayNet: A Large Human-Annotated Dataset for Document-Layout SegmentationBirgit PfitzmannChristoph Aueret al.2022KDD 2022
KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business DocumentsOshri NaparstekRoi Ponyet al.2024ICDAR 2024
Corpus conversion service: A machine learning platform to ingest documents at scalePeter StaarM. Dolfiet al.2018KDD 2018