pdfdeal
Python packaging of Doc2X API, enhancing PDF processing.
CommonProductProgrammingPDF processingOCR
Pdfdeal is a Python tool that packages the Doc2X API, providing local PDF processing capabilities to enhance PDF recall in RAG (Retrieval Augmented Generation). It supports various output formats, including text, Markdown, and PDF, and allows customization of OCR language and utilizes GPU acceleration. It also integrates with Doc2X, a service with a daily free usage quota of 500 pages, which excels in recognizing tables and formulas.
pdfdeal Visit Over Time
Monthly Visits
515580771
Bounce Rate
37.20%
Page per Visit
5.8
Visit Duration
00:06:42