ByteDance Unveils Open Source Secret Weapon HybridFlow, Boosting Large Model Training Speed by 20 Times and Slashing Costs!

AIbase基地

Published inAI News · 4 min read · Nov 1, 2024

620

Large models (LLMs) such as GPT and Llama have revolutionized the field of artificial intelligence, but efficiently training these massive models while aligning them with human values remains a challenge.

Reinforcement Learning with Human Feedback (RLHF) has become a widely adopted method for LLM training in recent years, but traditional RLHF frameworks face limitations in flexibility, efficiency, and scalability.

To address these issues, ByteDance's Doubao Large Model team has open-sourced a new RLHF framework called HybridFlow, which introduces new possibilities for LLM training.

RLHF typically involves three stages:

First, the actor model generates text based on the input prompt; then, the critic model, reference model, and reward model evaluate the generated text and calculate the corresponding values, reference probabilities, and reward values;

Finally, these evaluation results are used to train the actor model to produce text that aligns more closely with human preferences. Traditional RLHF frameworks often use a single controller to manage the entire data flow, which is inefficient for LLMs requiring distributed computing.

The HybridFlow framework innovatively combines single and multi-controller modes and decouples complex computations and data dependencies through a hierarchical API design, enabling flexible representation and efficient execution of the RLHF data flow.

The advantages of HybridFlow are primarily reflected in the following three aspects:

Flexible support for various RLHF algorithms and models: HybridFlow offers modular APIs, allowing users to easily implement and extend various RLHF algorithms, such as PPO, ReMax, and Safe-RLHF.

Efficient model weight reorganization: The 3D-HybridEngine component supports efficient model weight reorganization for the actor model during both training and generation phases, minimizing memory redundancy and communication overhead.

Automated model deployment and parallel strategy selection: The Auto Mapping component automatically maps models to different devices based on model load and data dependencies, and selects the optimal parallel strategy, thereby simplifying the model deployment process and enhancing training efficiency.

Experimental results show that HybridFlow significantly improves throughput when running various RLHF algorithms, up to 20.57 times. The open-source release of HybridFlow will provide a powerful tool for RLHF research and development, driving the future advancement of LLM technology.

Paper link: https://arxiv.org/pdf/2409.19256

JD.com's Embodied Intelligence Strategy Accelerates Rapidly, JoyInside Collaboration Map Exposed

According to NetEase Technology, JD.com's layout in the field of embodied intelligence is accelerating rapidly. The embodied intelligence brand JoyInside under JD.com has reached cooperation with more than ten leading robot companies, becoming the core engine for JD.com to seize the smart robot market. According to insiders, JoyInside is supported by JD's large model technology, focusing on providing smart interaction capabilities between robots and consumers. Its product strategy focuses on scenario-based applications such as one person, one dog, and one toy. Since its launch, the brand has successfully attracted leading enterprises from multiple niche fields to join.

Foxconn Launches Its First AI Inference Large Model FoxBrain, Trademark Application Submitted

Recently, Hon Hai Precision Industrial Co., Ltd. (commonly known as Foxconn) submitted a trademark registration application for "FoxBrain" to the Trademark Office of the National Intellectual Property Administration. This AI inference large model is not only Foxconn's first attempt but also the first AI model of this type in Taiwan. According to public information, the international classification of this trademark is scientific instruments, and it is currently in the "waiting for substantive examination" status. "FoxBrain" is an AI inference large model launched by the Hon Hai Research Institute, covering data analysis

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and learn about innovative AI product applications. Click to learn more about new AI products: https://top.aibase.com/1、Open Source End-to-End Speech Large Model Step-Audio-AQAA: Understand audio and directly generate natural speech. Step-Audio-AQAA is an open source end-to-end speech large model,

Honor Launches a New Battle in AI Voice Technology, the World's First Edge-side Voice Large Model to Be Launched!

Honor's official Weibo account @MagicOS announced that Honor has successfully deployed the world's first edge-side voice large model. This technological advancement is not only a breakthrough for Honor, but also hailed as a 'renewal of AI voice technology'. This significant achievement will make its debut on the overseas version of the upcoming Honor Magic V5. Honor's technological innovation is the result of its in-depth efforts in the field of artificial intelligence. It is reported that Honor has published two academic papers at the prestigious international conference InterSpeech, which have attracted widespread attention from the academic community.

AI Daily: Alibaba Tongyi Launches Qwen-TTS Model; Cursor Now Supports Web and Mobile; ByteDance Unveils Image Synthesis Technology XVerse

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Discover new AI products: https://top.aibase.com/1. Qwen-TTS Launches with a Major Breakthrough in Dialect Speech Synthesis, Achieving Realism Close to Human Voices. The Qwen-TTS model, developed by Alibaba's Tongyi team, has made significant breakthroughs in the field of speech synthesis.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

ByteDance Unveils Open Source Secret Weapon HybridFlow, Boosting Large Model Training Speed by 20 Times and Slashing Costs!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Major Breakthrough! Research Team Reveals the Hidden Reward Mechanism Inside Large Language Models

JD.com's Embodied Intelligence Strategy Accelerates Rapidly, JoyInside Collaboration Map Exposed

Foxconn Launches Its First AI Inference Large Model FoxBrain, Trademark Application Submitted

Zhipu AI Launches GLM-4.1V-Thinking Open Source! A New Leader in Multimodal Reasoning, Challenging Top Models Worldwide

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

Open Source End-to-End Speech Large Model Step-Audio-AQAA: Understand Audio and Generate Natural Speech Directly

Foxconn's Parent Company Registers a Trademark for an AI Inference Large Model

Honor Launches a New Battle in AI Voice Technology, the World's First Edge-side Voice Large Model to Be Launched!

The Revolution of Large Models! How Gemini 2.5 Pro is Transforming the Way We Process Information

AI Daily: Alibaba Tongyi Launches Qwen-TTS Model; Cursor Now Supports Web and Mobile; ByteDance Unveils Image Synthesis Technology XVerse