gmft

A lightweight and high-performance deep PDF table extraction tool.

CommonProductProgrammingPDF processingTable extraction
gmft is a toolkit designed to convert tables in PDFs into various formats. It is lightweight, modular, and delivers exceptional performance. gmft relies on Microsoft's Table Transformers, recognized as one of the best-performing and most reliable solutions among many alternatives. It operates without the need for a GPU, offering high throughput and easy installation, requiring only a single line of code. It utilizes PyPDFium2, favored for its high throughput and permissive licensing. The training model TATR used by gmft is trained on the diverse dataset PubTables-1M, ensuring high reliability.
Visit

gmft Visit Over Time

Monthly Visits

503747431

Bounce Rate

37.31%

Page per Visit

5.7

Visit Duration

00:06:44

gmft Visit Trend

gmft Visit Geography

gmft Traffic Sources

gmft Alternatives