AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

AI2 Launches Open Source Tülu3 Model, Performance Comparable to GPT-4o Mini

AIbase基地

Published inAI News · 4 min read · Dec 10, 2024

234

In the field of artificial intelligence, post-training techniques are gradually becoming an important means to enhance model performance. Recently, the Allen Institute for Artificial Intelligence (AI2) released the Tülu3 series models, a fully open-source advanced language model that competes with closed-source models like GPT-4o-mini. Tülu3 not only includes model data, code, and training recipes but also provides an evaluation framework aimed at promoting the development of post-training techniques for open-source models.

Traditionally, models that have only undergone pre-training often fail to effectively meet real-world application needs, potentially generating toxic or harmful information and struggling to follow human instructions. Therefore, post-training phases such as instruction fine-tuning and human feedback learning are particularly important. However, optimizing the post-training process remains a technical challenge, especially since improving one capability of the model may affect others.

To tackle this challenge, major companies have increased the complexity of post-training methods, attempting multi-round training and combining human and synthetic data, but most methods remain closed-source. In contrast, the release of the Tülu3 series has bridged the performance gap between open-source and closed-source models, introducing a new training approach.

The training process of Tülu3 is divided into four stages: data construction, supervised fine-tuning, preference adjustment, and reinforcement learning with verifiable rewards.

First, researchers focus on the core skills of the model, constructing training data by combining human and synthetic data.

Second, supervised fine-tuning is conducted to ensure the model performs on specific skills at least as well as other advanced models.

Third, a direct preference optimization method is used to further enhance the model's overall performance. Finally, an innovative approach of introducing reinforcement learning with verifiable rewards helps the model better accomplish tasks with verifiable outcomes.

The Tülu3 model is built on the foundation of Llama3.1, demonstrating excellent performance in areas such as reasoning, mathematics, programming, and instruction following. Compared to other open-source and closed-source models, Tülu3 shows outstanding comprehensive capabilities across multiple benchmark tests, marking a significant advancement in open-source post-training technology.

Paper link: https://allenai.org/papers/tulu-3-report.pdf

Demo: https://playground.allenai.org/

Key Points:
🌟 Tülu3 is an open-source language model launched by AI2, performing comparably to closed-source models like GPT-4o-mini.
🔧 Post-training techniques are crucial for effectively enhancing model performance in real-world applications.
📊 The training process of Tülu3 is innovative, divided into four stages: data construction, supervised fine-tuning, preference adjustment, and reinforcement learning with verifiable rewards.

Post-trainingtechniques Tülü3 GPT-4o-mini Artificialintelligenceresearchinstitute

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

OpenAI Unveils New Speech-to-Text Model: gpt-4o-transcribe, Boasting Significantly Improved Accuracy

Following previous attention in the speech AI field, OpenAI continues its exploration, releasing three new self-developed speech models: gpt-4o-transcribe, gpt-4o-mini-transcribe, and gpt-4o-mini-tts. The most notable is gpt-4o-transcribe. These new models are currently...

Mar 21, 2025

1.5k

AI2 Open Source Training Plan Tülu 3 Breaks the Monopoly of Post-Training Techniques for Large Models

Nov 22, 2024

1.9k

OpenAI Launches 'Predictive Output' Feature: Increases GPT-4o Speed by Approximately 5 Times

Nov 5, 2024

3.6k

OpenAI's new model gpt-4o-2024-08-06 and gpt-4o-mini support structured output

OpenAI has added structured output capabilities in its API, ensuring that generated outputs fully conform to a predefined JSON structure, significantly enhancing the reliability of the API and the accuracy of applications. This feature not only defines the structure of the JSON but also ensures the precision of the output. Additionally, the pricing has been adjusted, with input costs reduced by half and output costs decreased by one-third. The introduction of structured output addresses the limitations of the JSON mode in ensuring that outputs conform to specific structures, significantly improving the model's performance in structured output.

Aug 7, 2024

8.1k