Nous Research Launches Optimizer DisTrO, Enabling AI Models to be Trained Under Ordinary Network Conditions

AIbase基地

Published inAI News · 6 min read · Sep 11, 2024

259

Recently, the research team at Nous Research brought exciting news to the tech community with the introduction of a new optimizer called DisTrO (Distributed Internet Training). The birth of this technology signifies that powerful AI models are no longer exclusive to large companies; ordinary individuals now have the opportunity to train them efficiently on their personal computers at home.

The magic of DisTrO lies in its ability to significantly reduce the amount of information that needs to be transferred between multiple graphics processing units (GPUs) during AI model training. This innovation allows powerful AI models to be trained under ordinary network conditions, enabling collaboration among individuals or institutions worldwide to jointly develop AI technologies.

According to Nous Research's technical paper, the efficiency of DisTrO is astonishing. Training efficiency using DisTrO is 857 times higher than a common algorithm—All-Reduce—while the amount of information required to be transferred in each training step has been reduced from 74.4GB to 86.8MB. This improvement not only makes training faster and more affordable but also means more people have the opportunity to participate in this field.

Nous Research stated on their social media platform that with DisTrO, researchers and institutions no longer need to rely on a single company to manage and control the training process, providing them with more freedom to innovate and experiment. This open competitive environment helps drive technological progress, ultimately benefiting society as a whole.

The demand for hardware in AI training often deters many. Especially high-performance Nvidia GPUs have become increasingly scarce and expensive, with only well-funded companies able to afford such training burdens. However, Nous Research's philosophy is completely opposite; they are committed to opening up AI model training to the public at a lower cost, striving to enable more people to participate.

The working principle of DisTrO is to reduce the need for full gradient synchronization between GPUs, thereby reducing communication overhead by four to five orders of magnitude. This innovation allows AI models to be trained over slower internet connections, with many households now easily able to access speeds of 100Mbps download and 10Mbps upload.

In preliminary tests on Meta's Llama2 large language model, DisTrO showed training effects comparable to traditional methods while significantly reducing the required communication volume. Researchers also stated that although tests have been conducted on smaller models so far, they initially speculate that as model sizes increase, the reduction in communication needs could be even more significant, possibly reaching 1000 to 3000 times.

It is worth noting that while DisTrO makes training more flexible, it still relies on GPU support, albeit now these GPUs do not need to be located in the same place but can be dispersed worldwide and collaborate via ordinary internet. We have seen that DisTrO, when rigorously tested with 32 H100 GPUs, matches the convergence speed of the traditional AdamW+All-Reduce method but dramatically reduces communication needs.

DisTrO is not only suitable for large language models but also has the potential to be used for training other types of AI, such as image generation models, with future applications looking promising. Additionally, by enhancing training efficiency, DisTrO could reduce the environmental impact of AI training by optimizing the use of existing infrastructure and reducing the need for large data centers.

Through DisTrO, Nous Research is not only advancing the technological progress of AI training but also fostering a more open and flexible research ecosystem, opening up infinite possibilities for future AI development.

Reference: https://venturebeat.com/ai/this-could-change-everything-nous-research-unveils-new-tool-to-train-powerful-ai-models-with-10000x-efficiency/

Huang Renxun Meets with MiniMax Founder Yan Junjie for an In-depth Meeting, New AI Opportunities Are Coming！

NVIDIA CEO Jensen Huang met with MiniMax founder Yan Junjie in Beijing, praising China's AI innovation. MiniMax, founded just two years ago, has made breakthroughs including the open-source M1 model, Hailuo02 video tool, and $300M funding at $4B valuation. The meeting highlights potential for US-China tech collaboration.....

Zuckerberg Reorganizes Meta AI Team, a New 3400-Person Structure Emerges

Meta reorganized its AI structure to establish a Superintelligent Lab, integrating 3400 employees, led by Alexandr Wang as Chief AI Officer. The new structure is divided into four teams: AGI Basic Research, AI Product Development (including Meta AI Assistant), the Basic AI Lab led by Yann LeCun, and a group focused on Llama5 development. Meta is offering high salaries to attract talent from companies like OpenAI and Apple, but this has raised doubts within the original team about the influx of high-paid outsiders. Recently, two AI leads from Apple have joined.

Li Auto Receives the First Batch of Automotive Generative AI Security Evaluation Certifications

Li Auto received the first domestic batch of dual safety certifications for automotive generative AI at the 2025 China Automotive Forum, becoming the first automaker to pass the national standards GB/T 45654 and GB 45438-2025. The certification was jointly issued by the CCIA Automotive Cybersecurity Working Committee and the AI-Generated Content Identification Service Platform, covering the fields of content security and identification. This achievement marks Li Auto's leading position in the industry regarding the safety of in-vehicle AIGC technology, setting a benchmark for the safe development of intelligent vehicles, while enhancing consumer confidence.

LTX-Video 13B Released! Generate High-Definition Videos 30 Times Faster, Open Source AI Makes Creation Boundless!

Lightricks releases open-source LTX-Video13B, a 13B-parameter video generation model with multi-scale rendering, achieving 30x faster speeds. It runs on consumer GPUs, supports 1216×704 real-time generation, and offers text/image/video-to-video modes. The model enhances coherence and detail, enabling keyframe control and style transfer. Free for SMEs, it includes training tools and optimized versions to democratize AI video creation.....

The First AI-Based Malware LameHug Emerges, Stealing Data from Windows Devices

New malware LameHug uses Alibaba Qwen2.5 large model to attack Windows systems, spreads through email attachments, and can dynamically generate data stealing instructions. The software collects system information and steals sensitive files, with multiple variants already discovered. Experts warn that this is the first publicly known AI-based malware and recommend users to remain vigilant and update their protection measures.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Nous Research Launches Optimizer DisTrO, Enabling AI Models to be Trained Under Ordinary Network Conditions

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Huang Renxun Meets with MiniMax Founder Yan Junjie for an In-depth Meeting, New AI Opportunities Are Coming！

Zuckerberg Reorganizes Meta AI Team, a New 3400-Person Structure Emerges

Li Auto Receives the First Batch of Automotive Generative AI Security Evaluation Certifications

ChatGPT Voice Mode Launches! Convert Meetings and Generate Plans with One Click - AI Boosts Efficiency Dramatically!

LTX-Video 13B Released! Generate High-Definition Videos 30 Times Faster, Open Source AI Makes Creation Boundless!

Perplexity Enters India: New Strategy to Challenge OpenAI in the AI Race

Apple Bows to NVIDIA! MLX Framework Supports CUDA, AI Field Competition Intensifies

Mistral AI Launches New Feature Le Chat to Catch Up with ChatGPT

The First AI-Based Malware LameHug Emerges, Stealing Data from Windows Devices

5.63% Error Rate Sets New Low: NVIDIA AI Launches Commercial-Grade Ultra-High-Speed Speech Recognition Model Canary-Qwen-2.5B