Taotian Group Collaborates with Aicheng Technology to Open Source the Megatron-LLaMA Large Model Training Framework

机器之心

Published inAI News · 1 min read · Sep 13, 2023

Taobao TIAN Group, in collaboration with Aicheng Technology, has open-sourced the large-scale model training framework Megatron-LLaMA. This initiative aims to enhance the training performance of large language models, reduce training costs, and maintain compatibility with the LLaMA community. The framework achieves a 176% speedup in 32-card training scenarios and exhibits high tolerance for network instability. Megatron-LLaMA will focus on adaptive optimal configuration selection, support for model structure modifications, and the delivery of top-tier performance training solutions across various hardware environments.

Large Language Model Training Framework Megatron-LLaMA

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

OpenAI Video Generation Model Sora 2 Launches on Microsoft Azure Platform: Pricing at $0.10 per Second, Enters Public Preview Phase

Microsoft launches OpenAI's Sora2 video generation model on Azure AI for public preview, offering cloud API access to businesses and developers. This multimodal tool processes text, image, and video inputs to create new content, advancing generative AI video into commercial applications like advertising.....

Oct 17, 2025

LLaVA-OneVision-1.5, a Fully Open-Source Multimodal Model That Exceeds Qwen2.5-VL

LLaVA-OneVision-1.5, a breakthrough multimodal model, evolved over two years from basic image-text alignment to handling images/videos. It offers an open, efficient training framework for building high-quality vision-language models via three-stage training.....

Oct 17, 2025

Google DeepMind and Yale University Collaborate to Develop AI Model C2S-Scale 27B for Cancer Treatment Pathways

Google DeepMind & Yale developed C2S-Scale27B, a 27B-parameter AI model based on Gemma, analyzing cell behavior & cancer-immune interactions. Validated in live cells, it offers new cancer treatment insights.....

Oct 17, 2025

ByteDance Releases Dou Bao Large Model 1.6: The First Domestic Model Supporting Adjustable Thinking Depth

ByteDance's Volcano Engine launches Doubao 1.6, China's first adjustable-length AI model. Features four thinking-depth options to balance output quality and response speed. Key innovation: 77.5% fewer tokens consumed in low-speed mode.....

Oct 17, 2025

Douyin's Duanbao Large Model: Daily Calls Exceed 30 Trillion Tokens, Rapid Growth is Remarkable!

Doubao model's usage surged from 120B tokens in May 2024 to over 30T tokens by Sept 2025, a 253x growth, reflecting rapid adoption across industries.....

Oct 16, 2025

100

Volcano Engine Launches Four Powerful Large Models, Voice Synthesis and Replication Features Upgraded

Volcano Engine launched four Doubao AI models at Wuhan AI Expo: upgraded 1.6 with four thinking lengths, lightweight 1.6lite, and new voice synthesis 2.0 & cloning 2.0, enhancing intelligence for flexible enterprise solutions.....

Oct 16, 2025

140

Volc Engine Launches Doubao Large Model 1.6 Upgrade, Daily Token Requests Exceed 30 Trillion!

Volc Engine upgrades the Doubao Large Model matrix and launches the 'Intelligent Model Routing' service to help enterprises move toward the Agentic AI era. Key upgrades include the Doubao Large Model 1.6 enhancing reasoning capabilities, supporting four different thinking lengths, and launching three new models: 1.6lite, Speech Synthesis 2.0, and Voice Cloning 2.0, further enriching the product ecosystem.

Oct 16, 2025

100

Google Launches Veo 3.1 Video Generation Model: New Audio Features and Fine-Grained Editing Capabilities

Google upgrades the video generation model Veo 3.1, improving audio output, editing control accuracy, and image-to-video quality, enabling more realistic videos and precise response to instructions. New features allow adding objects to videos and automatically matching the visual style. The ability to remove objects will be introduced in the Flow tool, enhancing editing flexibility.

Oct 16, 2025

120

China's Standardization Meets AI Experts! The First Domestic Standard Large Model, Tongdao Internet Version, is Now Available, Saying Goodbye to the Era of Manually Searching for Standards!

China's first standard AI model 'Tongdao' launched, enhancing efficiency and precision in standardization work via AI-driven solutions.....

Oct 16, 2025

120

DeepMind Collaborates with Yale University to Release C2S-Scale27B Model: AI Discovers New Approaches for Cancer Treatment

Google DeepMind & Yale's C2S-Scale27B model, built on Gemma, identifies Silmitasertib as a conditional enhancer for immune response against cancer cells, revealing new treatment pathways.....

Oct 16, 2025

110

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Taotian Group Collaborates with Aicheng Technology to Open Source the Megatron-LLaMA Large Model Training Framework

机器之心

This article is from AIbase Daily

AI News Recommendations

OpenAI Video Generation Model Sora 2 Launches on Microsoft Azure Platform: Pricing at $0.10 per Second, Enters Public Preview Phase

LLaVA-OneVision-1.5, a Fully Open-Source Multimodal Model That Exceeds Qwen2.5-VL

Google DeepMind and Yale University Collaborate to Develop AI Model C2S-Scale 27B for Cancer Treatment Pathways

ByteDance Releases Dou Bao Large Model 1.6: The First Domestic Model Supporting Adjustable Thinking Depth

Douyin's Duanbao Large Model: Daily Calls Exceed 30 Trillion Tokens, Rapid Growth is Remarkable!

Volcano Engine Launches Four Powerful Large Models, Voice Synthesis and Replication Features Upgraded

Volc Engine Launches Doubao Large Model 1.6 Upgrade, Daily Token Requests Exceed 30 Trillion!

Google Launches Veo 3.1 Video Generation Model: New Audio Features and Fine-Grained Editing Capabilities

China's Standardization Meets AI Experts! The First Domestic Standard Large Model, Tongdao Internet Version, is Now Available, Saying Goodbye to the Era of Manually Searching for Standards!

DeepMind Collaborates with Yale University to Release C2S-Scale27B Model: AI Discovers New Approaches for Cancer Treatment

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Taotian Group Collaborates with Aicheng Technology to Open Source the Megatron-LLaMA Large Model Training Framework

机器之心

This article is from AIbase Daily

AI News Recommendations

OpenAI Video Generation Model Sora 2 Launches on Microsoft Azure Platform: Pricing at $0.10 per Second, Enters Public Preview Phase

LLaVA-OneVision-1.5, a Fully Open-Source Multimodal Model That Exceeds Qwen2.5-VL

Google DeepMind and Yale University Collaborate to Develop AI Model C2S-Scale 27B for Cancer Treatment Pathways

ByteDance Releases Dou Bao Large Model 1.6: The First Domestic Model Supporting Adjustable Thinking Depth

Douyin's Duanbao Large Model: Daily Calls Exceed 30 Trillion Tokens, Rapid Growth is Remarkable!

Volcano Engine Launches Four Powerful Large Models, Voice Synthesis and Replication Features Upgraded

Volc Engine Launches Doubao Large Model 1.6 Upgrade, Daily Token Requests Exceed 30 Trillion!

Google Launches Veo 3.1 Video Generation Model: New Audio Features and Fine-Grained Editing Capabilities

China's Standardization Meets AI Experts! The First Domestic Standard Large Model, Tongdao Internet Version, is Now Available, Saying Goodbye to the Era of Manually Searching for Standards!

DeepMind Collaborates with Yale University to Release C2S-Scale27B Model: AI Discovers New Approaches for Cancer Treatment

GEO Services