NVIDIA's Breakthrough in Open Source: New Model Training Saves 1.8x Computing Power!

AIbase基地

Published inAI News · 3 min read · Aug 16, 2024

235

Leading global AI company Nvidia has recently open-sourced two new large-scale models: Nemotron-4-Minitron-4B and Nemotron-4-Minitron-8B. The open-sourcing of these models represents not only a leap forward in technology but also ignites an efficiency revolution in the AI field.

Traditional large AI model training requires extensive data and computational power. However, Nvidia has significantly reduced this demand by employing efficient training methods such as structured pruning and knowledge distillation. Specifically, compared to training from scratch, the new models require 40 times less training token data and save 1.8 times the computational cost. This achievement stems from Nvidia's deep optimization of the existing Llama-3.18B model.

Structured pruning is a neural network compression technique that simplifies the model structure by removing unimportant weights. Unlike random pruning, structured pruning preserves the structure of the weight matrices, making the pruned model more suitable for efficient operation on hardware like GPUs and TPUs by removing entire neurons or attention heads.

Knowledge distillation is a method that improves performance by having a student model mimic a teacher model. In Nvidia's practice, through logit-based knowledge distillation, the student model can learn the deep insights of the teacher model, maintaining excellent performance even with significantly reduced training data.

The Minitron-4B and Minitron-8B models trained with structured pruning and knowledge distillation have seen a 16% improvement in scores on the MMLU, rivaling well-known models like Mistral7B, Gemma7B, and Llama-38B in performance. This result validates the effectiveness of Nvidia's approach and offers new possibilities for the training and deployment of large AI models.

Nvidia's open-source initiative not only showcases its leadership in AI technology but also brings valuable resources to the AI community. As AI technology continues to advance, we look forward to seeing more innovative methods that drive AI towards greater efficiency and intelligence.

Model addresses:

https://huggingface.co/nvidia/Nemotron-4-Minitron-4B-Base

https://huggingface.co/nvidia/Nemotron-4-Minitron-8B-Base

SandboxAQ, Quantum AI Startup, Raises $450 Million with Google and Nvidia Investment

Quantum artificial intelligence startup SandboxAQ announced the successful completion of its Series E funding round, raising $450 million. This round attracted investment from industry giants including Google, Nvidia, and BNP Paribas, bringing SandboxAQ's total funding to $950 million. The company stated that the funds will be used to accelerate the development of its large quantum models and foster collaborations across various industries. Image note: Image generated by AI, image licensing provider Midjourney.

NVIDIA Unveils Llama 3.1 Nemotron Ultra 253B: Redefining AI Performance Standards

NVIDIA, a global leader in chip and AI technology, recently launched a groundbreaking new open-source large language model, Llama 3.1 Nemotron Ultra 253B, generating significant excitement within the AI community. Built upon Meta's Llama-3.1-405B, this model boasts innovative optimizations that surpass competitors like Llama 4 Behemoth and Maverick in performance, while demonstrating superior resource efficiency and exceptional multi-tasking capabilities.

NVIDIA Unveils Llama 3.1 Nemotron Ultra 253B: A New Benchmark in Performance

On April 8th, 2025, NVIDIA launched Llama 3.1 Nemotron Ultra 253B, an open-source model optimized from Llama-3.1-405B. With 25.3 billion parameters, it surpasses Meta's Llama 4 Behemoth and Maverick, becoming a focal point in the AI field. This model demonstrates superior performance in benchmarks such as GPQA-Diamond, AIME 2024/25, and LiveCodeBench, achieving inference throughput comparable to DeepSeek.

Nvidia to Acquire Lepton AI, a Cloud Computing Push

According to The Information, Nvidia is on the verge of acquiring Lepton AI, a prominent AI startup, in a deal reportedly worth hundreds of millions of dollars. This move signals Nvidia's aggressive push into the cloud computing and enterprise software market, aiming to compete with major cloud providers like Amazon and Google. Founded two years ago by renowned scientist Yangqing Jia and his team, Lepton AI focuses on building new infrastructure for the AI era, simplifying the building and deployment of AI models. Unlike many startups...

Nvidia Poised to Acquire Lepton AI, Entering Server Leasing Market

Reports suggest that Nvidia, the global semiconductor giant, is nearing a deal to acquire the startup Lepton AI for potentially hundreds of millions of dollars. Founded two years ago, Lepton AI focuses on leasing servers equipped with Nvidia AI chips to other businesses. According to The Information, the deal is progressing rapidly, although Nvidia has yet to officially comment.