AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

No Need for Superclusters! Nous Research Launches Global Distributed AI Training, Disrupting the Development of Large Models

AIbase基地

Published inAI News · 5 min read · Dec 3, 2024

185

In the rapidly evolving field of generative AI, the Nous Research team is conducting a unique experiment: they are pre-training a large language model (LLM) with 1.5 billion parameters using machines distributed globally, avoiding the centralized development typically required in expensive and power-hungry data centers or superclusters.

Nous Research is also live-streaming this pre-training process on its dedicated website distro.nousresearch.com, showcasing the model's performance on various evaluation benchmarks in real-time and providing a map of the hardware locations participating in the training, covering multiple sites across the United States and Europe. As of the publication of this article, approximately 57 hours (or 2.3 days) remain for pre-training, with over 75% of the training progress already completed.

Pre-training is the first and most fundamental step in training an LLM, involving training on a vast amount of text data to learn the statistical properties and structure of the language. During this phase, the model captures patterns, grammar, and contextual relationships between vocabulary by processing a wide-ranging text dataset. This process equips the model with a broad understanding of language, enabling it to generate coherent text and perform various language-related tasks. After pre-training, the model still needs to be fine-tuned for specific tasks or domains.

If this plan is successful, Nous Research will demonstrate that it is possible to train cutting-edge LLMs without expensive superclusters or low-latency transmission, marking a new era in distributed AI training. This open-source training approach could reshape the power dynamics of generative AI, giving small teams and non-corporate actors more competitiveness in this field.

The new technology used by Nous is called Nous DisTrO (Distributed Training Over-the-Internet), designed to reduce the communication bandwidth requirements between GPUs during the pre-training process. According to the latest release from Nous Research, DisTrO can reduce communication needs by up to 10,000 times, maintaining competitive convergence rates and loss curves even under slower, more affordable internet connections.

Moreover, the core breakthrough of DisTrO lies in effectively compressing the amount of data exchanged between GPUs without compromising the model's performance. This technology builds upon earlier decoupled momentum optimization algorithms (DeMo), which also aimed to significantly reduce communication demands between GPUs while maintaining training performance.

On the hardware front, Nous Research's pre-training process is supported by several well-known partners, including Oracle, Lambda Labs, Northern Data Group, Crusoe Cloud, and Andromeda Cluster, who collectively provide the necessary heterogeneous hardware to thoroughly test DisTrO's capabilities in real distributed environments.

Blog entry: https://nousresearch.com/

Highlights:
🌐 Nous Research is conducting globally distributed AI training aimed at pre-training a 1.5 billion parameter large language model.
💻 Utilizing Nous DisTrO technology, this process significantly reduces the communication bandwidth requirements between GPUs, making low-cost training feasible.
🤝 This project has the support of multiple hardware suppliers, advancing the progress of distributed AI research.

GenerativeAI NousResearch LargeLanguageModel Pre-training

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Doubao 1.5 Deep Thinking Model Launches on Edge Large Model Gateway with Free Million Tokens

Bytedance's Volcano Engine announced the full launch of its newly released Doubao 1.5 Deep Thinking model on the edge large model gateway, offering users up to 5 million free tokens. This move has garnered significant attention in the AI field.

Apr 25, 2025

140

Dataiku Unveils New Features to Empower Efficient Enterprise AI Agent Management

Apr 25, 2025

Google Research Shows AI Could Save Employees 122 Hours Annually

Apr 25, 2025

110

Adobe Firefly Platform Integrates OpenAI and Google AI Models, Enhancing Creative Tools

Apr 25, 2025

GPT-4.1 Model Faces Scrutiny: Alignment and Stability Concerns Raised

Apr 24, 2025

ByteDance Releases Efficient Pre-training Length Scaling Technology, Breaking Through Long Sequence Training Bottlenecks

Apr 23, 2025

210

Fujitsu and Nutanix Launch Takane, a Japanese Large Language Model, Targeting the Enterprise Private AI Market

Fujitsu and Nutanix have collaborated to release Takane, a powerful Japanese large language model designed for enterprise private cloud deployments. This collaboration aims to provide businesses with a secure and efficient solution for leveraging AI within their own infrastructure.

Apr 23, 2025

190

Can AI Movies Win Oscars? Academy's New Rules Spark Industry Debate!

Apr 23, 2025

160

OpenAI Executive Hints at Potential Chrome Acquisition if Google is Forced to Divest

Apr 23, 2025

Gartner Report: Task-Specific AI to Outpace General-Purpose AI by Threefold in 2027

Apr 23, 2025