gmft
A lightweight and high-performance deep PDF table extraction tool.
CommonProductProgrammingPDF processingTable extraction
gmft is a toolkit designed to convert tables in PDFs into various formats. It is lightweight, modular, and delivers exceptional performance. gmft relies on Microsoft's Table Transformers, recognized as one of the best-performing and most reliable solutions among many alternatives. It operates without the need for a GPU, offering high throughput and easy installation, requiring only a single line of code. It utilizes PyPDFium2, favored for its high throughput and permissive licensing. The training model TATR used by gmft is trained on the diverse dataset PubTables-1M, ensuring high reliability.
gmft Visit Over Time
Monthly Visits
515580771
Bounce Rate
37.20%
Page per Visit
5.8
Visit Duration
00:06:42