Pavel Klavík, A. Cristiano I. Malossi, et al.
Philos. Trans. R. Soc. A
This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion. It is powered by state-of-the-art specialized AI models for layout analysis (DocLayNet) and table structure recognition (TableFormer), and runs efficiently on commodity hardware in a small resource budget. The code interface allows for easy extensibility and addition of new features and models.
Pavel Klavík, A. Cristiano I. Malossi, et al.
Philos. Trans. R. Soc. A
Ismail Akhalwaya, Shashanka Ubaru, et al.
ICLR 2024
Ken C.L. Wong, Satyananda Kashyap, et al.
Pattern Recognition Letters
Saurabh Paul, Christos Boutsidis, et al.
JMLR