AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation MCP

Google AI Launches Comprehensive Benchmarking CardBench: Featuring Over 20 Real-World Databases and Thousands of Queries

AIbase基地

Published inAI News · 6 min read · Sep 3, 2024

164

In modern relational databases, cardinality estimation (CE) plays a critical role. Simply put, cardinality estimation predicts how many intermediate results a database query will return. This prediction significantly influences the query optimizer's choice of execution plan, such as determining join order, whether to use indexes, and selecting the best join method. If the cardinality estimate is inaccurate, the execution plan may be greatly compromised, leading to slow query speeds and severely impacting overall database performance.

However, existing cardinality estimation methods have several limitations. Traditional CE techniques rely on simplified assumptions, often failing to accurately predict the cardinality of complex queries, especially those involving multiple tables and conditions. Although learning-based CE models can provide better accuracy, their application is limited by long training times, the need for large datasets, and the lack of systematic benchmark evaluations.

To fill this gap, Google's research team has introduced CardBench, a new benchmark testing framework. CardBench includes over 20 real-world databases and thousands of queries, far surpassing previous benchmarks. This allows researchers to systematically evaluate and compare different learning-based CE models under various conditions. The benchmark supports three main settings: instance-based models, zero-shot models, and fine-tuned models, suitable for different training needs.

The design of CardBench also includes a series of tools to calculate necessary data statistics, generate realistic SQL queries, and create annotated query graphs for training CE models.

The benchmark provides two sets of training data: one for single-table queries with multiple filter condition predicates and another for binary join queries involving two tables. The benchmark includes 9,125 single-table queries and 8,454 binary join queries, ensuring a robust and challenging environment for model evaluation. The training data labels from Google BigQuery required 7 CPU years of query execution time, highlighting the significant computational investment in creating this benchmark. By providing these datasets and tools, CardBench lowers the barrier for researchers developing and testing new CE models.

In performance evaluations using CardBench, fine-tuned models have particularly stood out. While zero-shot models struggle to improve accuracy when applied to unseen datasets, especially in complex queries involving joins, fine-tuned models can achieve comparable accuracy to instance-based methods with much less training data. For example, a fine-tuned Graph Neural Network (GNN) model achieved a median q-error of 1.32 and a 95th percentile q-error of 120 in binary join queries, significantly outperforming zero-shot models. The results indicate that even with 500 queries, fine-tuning a pre-trained model can significantly improve its performance, making them suitable for practical applications where training data may be limited.

The introduction of CardBench brings new hope to the field of learning-based cardinality estimation, enabling researchers to more effectively evaluate and improve models, thereby advancing this important area.

Paper link: https://arxiv.org/abs/2408.16170

Key points:
- 📊 CardBench is a new benchmark testing framework that includes 20 real databases and thousands of queries, supporting systematic evaluation of learning-based cardinality estimation models.
- 🛠️ The benchmark provides tools for calculating data statistics, generating SQL queries, and creating query graphs, lowering the development barrier for researchers.
- 🚀 Fine-tuned models have performed exceptionally well in performance evaluations, achieving similar accuracy to traditional models with much less training data, demonstrating potential for practical applications.

Cardinality Estimation Google CardBench Learning Models

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Baidu Makes a Major Open-Source Release of the ERNIE Bot 4.5 Series with Ten New Models Unveiled!

Baidu officially released the ERNIE Bot 4.5 series models and made them fully open source. Users can experience this latest open-source technology immediately through ERNIE Bot (https://yiyan.baidu.com). This series includes multiple parameter configurations, such as Mixture of Experts (MoE) models with activated parameters of 47B and 3B, as well as dense models designed with 0.3B parameters, totaling ten different models. In terms of training and inference, the ERNIE 4.5 series models use PaddlePaddle deep learning.

Jun 30, 2025

600

Gemini2.5Pro API Returns Free, Developer Community Responds Enthusiastically

Recently, Google announced that the API of its flagship AI model, Gemini2.5Pro, has been reintroduced to the free tier of Google AI Studio. This news has triggered widespread attention and enthusiastic discussions within the developer community. According to AIbase, this move marks another important advancement in Google's efforts to popularize AI technology, offering developers lower barriers to innovation. As the most advanced AI model from Google so far, Gemini2.5Pro is known for its exceptional multimodal capabilities and strong reasoning power.

Jun 30, 2025

270

Baidu's WENXIN Series Large Models Are Open-Sourced on the PaddlePaddle Platform, Covering Multiple Latest Models

Baidu's WENXIN series large models have recently been open-sourced on its PaddlePaddle platform, including dozens of latest models such as ERNIE-4.5-VL-424B-A47B-Paddle and ERNIE-4.5-300B-A47B-Paddle. Although Baidu has not actively disclosed this open-source initiative, updates on the PaddlePaddle platform show that these actions were concentrated between June 29th and June 30th, marking its latest move. A source within the company confirmed: official

Jun 30, 2025

190

Google Launches Experimental AI Try-On App Doppl: A New Virtual Fashion Experience

Google launched a new experimental app called Doppl on Thursday in the US for iOS and Android platforms, aiming to let users see how different clothes look on them through artificial intelligence technology. The app uses AI to generate virtual images of users wearing clothes, even converting static images into dynamic videos, providing an immersive try-on experience. The core feature of Doppl allows users to upload full-body photos, then import photos or screenshots of clothing to try them on their digital version.

Jun 27, 2025

240

Giant Network's 'Space Kill' Launches AI-Native Endgame Duels: Three Domestic Large Models Participate, Creating Multi-Dimensional Intelligent Competition

Jun 27, 2025

Google Reintroduces AI-Powered Ask Photos Feature to Enhance Search Speed!

Jun 27, 2025

120

Google Launches Offerwall Tool: Help Publishers Cope with AI Search Impact, Test Shows Revenue Increased by 9%

AIbase Report — Features and Application Scenarios Offerwall allows publishers to provide website readers with various ways to access content, including small payments, participating in surveys, watching ads, etc. Publishers can also add custom options, such as subscribing to newsletters. The tool is now available for free in Google Ad Manager and uses AI intelligence to decide when to display it to visitors, maximizing engagement and revenue. After more than a year of testing, 1,000 publishers have participated in the trial. Google has partnered with third parties

Jun 27, 2025

110

Breaking News! Google Opensources Gemma3n Multimodal Model, AI Performance Can Run on Phones as if it Were in the Cloud

Jun 27, 2025

270

Gemini Will Replace Google Assistant, Android Users Welcome New Experience

Recently, Google announced that the upcoming Gemini feature will replace Google Assistant on Android devices. According to an internal email obtained by Android Police, the Gemini update will start rolling out on July 7th. This update will allow users to still control phone calls, messages, WhatsApp, and other apps through this AI assistant even when the Gemini app is closed. This change aims to enhance user experience.

Jun 27, 2025

Gemini Will Replace Google Assistant, New Privacy Protection Model is Coming!

Jun 27, 2025

150