AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

Baidu AI Open-Sources PP-TableMagic, a Table Recognition Model

AIbase基地

Published inAI News · 5 min read · Mar 12, 2025

On March 11th, Baidu AI announced the open-sourcing of its latest table recognition solution, PP-TableMagic, marking a significant breakthrough in structured information extraction from tables. PP-TableMagic aims to overcome the limitations of traditional table recognition technologies in complex scenarios. Through an innovative multi-model network architecture, it achieves highly accurate end-to-end table recognition and supports highly customizable model fine-tuning for all scenarios.

In today's digital age, a large amount of important tabular data remains in unstructured formats, such as statistical table images in scanned documents and financial reports in PDF files. This data cannot be directly processed automatically, making table recognition technology crucial for intelligent document understanding and data analysis. However, traditional general-purpose table recognition models often perform poorly when faced with complex table formats, failing to meet the needs of various applications. To address this, the Baidu PaddlePaddle team introduced PP-TableMagic, employing a multi-model serial network solution of "table classification + table structure recognition + cell detection," significantly improving the accuracy and adaptability of table recognition.

微信截图_20250312082522.png

The core advantage of PP-TableMagic lies in its innovative architecture. This solution uses a dual-stream architecture, classifying tables into structured and unstructured types. The end-to-end table recognition task is then broken down into two sub-tasks: cell detection and table structure recognition. Finally, a self-optimized result fusion algorithm generates the complete HTML table prediction result. Among them, PP-LCNet_x1_0_table_cls, a lightweight table classification model independently developed by the PaddlePaddle team, can accurately classify structured and unstructured tables; RT-DETR-L_table_cell_det, the industry's first open-source table cell detection model, achieves precise localization of various types of table cells; and the latest table structure recognition model, SLANeXt, excels in HTML structure parsing. Compared to its predecessors, SLANet and SLANet_plus, SLANeXt uses Vary-ViT-B, a visual encoder with stronger feature representation capabilities, further improving the accuracy of table structure recognition.

In practical applications, PP-TableMagic can directly process tables and, through customized model fine-tuning, meet the needs of different scenarios. Compared to fine-tuning traditional end-to-end table recognition models, PP-TableMagic's multi-model network architecture allows users to fine-tune only key models, avoiding performance trade-offs and reducing the workload of data annotation. Furthermore, for experienced developers, PP-TableMagic's architecture supports branch-level adjustments, allowing optimization for specific types of tabular data and further improving overall recognition capabilities.

To help users get started quickly, PP-TableMagic provides detailed installation guides and tutorials. Users can easily call the model using the Python API provided by PaddleX for table recognition and result export. Additionally, PP-TableMagic supports high-performance inference, service deployment, and edge deployment to meet the needs of various users. The Baidu PaddlePaddle team also plans to hold an online course on March 13th to provide an in-depth analysis of the technical details of PP-TableMagic and to host an industry scenario workshop, guiding users through the complete development process from data preparation to model deployment.

Open-source address: https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-rc/docs/pipeline_usage/tutorials/ocr_pipelines/table_recognition_v2.md

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team