A text extraction library supporting PDFs, images, office documents and more.
Kreuzberg is a Python library for text extraction from documents. It provides a unified async interface for extracting text from PDFs, images, office documents, and more.