PaddleOCR Releases v2.8.0 with New Table Recognition Algorithms

AIbase

Published inAI News · 3 min read · Jul 12, 2024

642

PaddleOCR v2.8.0, as a milestone update of the text recognition development kit under the PaddlePaddle deep learning open-source framework, introduces cutting-edge OCR technology. This version includes the winning solutions from the PaddleOCR Algorithm Model Challenge, such as the Scene Text Recognition algorithm SVTRv2 and the Table Recognition algorithm SLANet-LCNetV2, setting new standards for the OCR field.

At the same time, the project structure has been deeply optimized, with non-core modules migrated to new repositories, allowing the project to focus more on OCR core technology. In addition, historical difficult problems such as the model not running after updating Backbone, numpy version dependency conflicts, and slow performance on the Mac system have been resolved, enhancing user experience.

WeChat Screenshot_20240712084427.png

The new version also includes fixes for issues such as the loss of OCR results in layout analysis, the introduction of pyproject.toml to comply with PEP518 standards, and optimization improvements such as the sliding window operation for large image inference, enhancing the software's stability, compatibility, and performance. The support and contributions from the open-source community are crucial for every progress of PaddleOCR v2.8.0, and the efforts of PMC members and contributors are particularly thanked.

PaddleOCR is building a dedicated documentation tutorial site that will provide keyword search functionality and an elegant and comfortable interface.

Project Address: https://github.com/PaddlePaddle/PaddleOCR

1. OCR技术2. PaddleOCR算法模型挑战赛3. 场景文本识别算法SVTRv24. 表格识别算法SLANet-LCNetV2

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

1 Billion Investment! Zhipu AI Receives Support from Pudong Zhangjiang, GLM-4.1V Makes a Major Open Source Release, AGI Development Speeds Up

At the recent Zhipu Open Platform Industrial Ecosystem Conference held in Shanghai, a major development emerged in the field of artificial intelligence: Pudong Venture Capital Group and Zhangjiang Group jointly announced a strategic investment of up to 1 billion yuan in Zhipu, with the first installment already completed. This significant investment will provide strong support for Zhipu in building a trusted artificial intelligence infrastructure and accelerate its layout in the field of General Artificial Intelligence (AGI). In his keynote speech at the conference, Zhipu CEO Zhang Peng elaborated on two latest achievements in the company's efforts to move toward AGI in collaboration with ecosystem partners.

Jul 2, 2025

100

Chai-2 Makes a Shocking Debut: AI-Powered Zero-Shot Antibody Design, Accelerating Drug Development by Hundreds of Times

Artificial intelligence once again stirs up the field of drug development! Chai Discovery recently launched a new AI model called Chai-2, which has drawn widespread attention with its breakthrough technology in molecular design. Chai-2 achieves zero-shot antibody design with a success rate of 16%-20%, hundreds of times higher than traditional methods, shortening the drug development cycle from months or even years to just two weeks. Zero-shot antibody design breaks through traditional bottlenecks. Chai-2 is a multi-modal generative AI model developed by Chai Discovery, specifically designed for...

Jul 1, 2025

460

Chai Discovery Launches Chai-2 Model: Zero-shot Antibody Design Achieves 16-20% Hit Rate

Jul 1, 2025

730

New Open Source AI System OmniGen 2: Integrates Image and Text Generation Like GPT-4o

Jun 30, 2025

250

Alibaba Ovis-U1 Launches with a Bang: A Multi-Modal AI All-in-One, Open Source Empowers Global Developers

On June 29, 2025, the Alibaba International AI Team officially released the new multi-modal large model **Ovis-U1**, marking another major breakthrough in the field of multi-modal artificial intelligence. As the latest masterpiece of the Ovis series, Ovis-U1 integrates multi-modal understanding, image generation, and image editing functions, demonstrating powerful cross-modal processing capabilities, providing new possibilities for developers, researchers, and industry applications. This is a detailed report on Ovis-U1 by AIbase. Ovis-U1

Jun 30, 2025

1.1k

OpenAI announces that the 2025 Developer Conference will be held in San Francisco, expected to attract more than 1,500 developers

OpenAI has officially announced the date and location of its next developer conference (DevDay), which will be held on October 6, 2025, in San Francisco. This conference is expected to attract more than 1,500 developers and is anticipated to be the largest developer event to date. The agenda for this DevDay will be rich and diverse, featuring multiple important sessions. The conference will include live-streamed keynote speeches, during which OpenAI will share its latest developments and future vision in the field of artificial intelligence. In addition, participants will also be able to

Jun 27, 2025

310

OpenAI Releases New Model for Deep Research API: o3/o4-mini-deep research

Jun 27, 2025

1.1k

ElevenLabs Launches Voice Design v3 - Generate Any Sound You Want with Just One Sentence

Jun 27, 2025

230

Breaking News! Google Opensources Gemma3n Multimodal Model, AI Performance Can Run on Phones as if it Were in the Cloud

Jun 27, 2025

320

Black Forest Shocks Open Source FLUX.1 Kontext [dev]: Image Editing Comparable to GPT-4o

Black Forest Labs officially announced that its new image editing model FLUX.1Kontext [dev] is now open source, drawing widespread attention from the AI community. As the latest member of the FLUX.1 series, this model is praised as an open-source alternative comparable to GPT-4o, thanks to its powerful image editing capabilities and efficient performance. FLUX.1Kontext [dev] is based on a 1.2 billion parameter flow matching transformer architecture, specifically designed for image editing tasks, and supports consumer-grade hardware.

Jun 27, 2025

190

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

PaddleOCR Releases v2.8.0 with New Table Recognition Algorithms

AIbase

This article is from AIbase Daily

AI News Recommendations

1 Billion Investment! Zhipu AI Receives Support from Pudong Zhangjiang, GLM-4.1V Makes a Major Open Source Release, AGI Development Speeds Up

Chai-2 Makes a Shocking Debut: AI-Powered Zero-Shot Antibody Design, Accelerating Drug Development by Hundreds of Times

Chai Discovery Launches Chai-2 Model: Zero-shot Antibody Design Achieves 16-20% Hit Rate

New Open Source AI System OmniGen 2: Integrates Image and Text Generation Like GPT-4o

Alibaba Ovis-U1 Launches with a Bang: A Multi-Modal AI All-in-One, Open Source Empowers Global Developers

OpenAI announces that the 2025 Developer Conference will be held in San Francisco, expected to attract more than 1,500 developers

OpenAI Releases New Model for Deep Research API: o3/o4-mini-deep research

ElevenLabs Launches Voice Design v3 - Generate Any Sound You Want with Just One Sentence

Breaking News! Google Opensources Gemma3n Multimodal Model, AI Performance Can Run on Phones as if it Were in the Cloud

Black Forest Shocks Open Source FLUX.1 Kontext [dev]: Image Editing Comparable to GPT-4o