kreuzberg
A Python library that supports extracting text from various formats, including PDFs, images, and office documents.
CommonProductProgrammingText extractionPDF processing
Kreuzberg is a modern Python library focused on extracting text from various documents. It provides an efficient text extraction solution through a concise API and local processing capabilities. The library supports multiple file formats, including PDF, images, and office documents, without complex configurations or external API calls. It uses an asynchronous interface design, which improves processing efficiency while maintaining a lightweight resource footprint. Kreuzberg is suitable for scenarios requiring localized text extraction, such as RAG applications. Its main advantages are ease of use, resource efficiency, and powerful functionality.
kreuzberg Visit Over Time
Monthly Visits
502571820
Bounce Rate
37.10%
Page per Visit
5.9
Visit Duration
00:06:29