360 Announces Full Integration of 360 Smart Brain Large Model into the 360 Ecosystem, Officially Open to the Public

ByteDance's Doubao large model team recently announced a breakthrough in addressing key bottlenecks in Mixture-of-Experts (MoE) architecture, open-sourcing a significant optimization technology called COMET. This technology dramatically improves large model training efficiency, achieving a remarkable 1.7x speedup and a 40% reduction in training costs. Image Note: Image generated by AI, image licensing provider Midjourney. COMET has been deployed in ByteDance's multi-thousand-GPU cluster training, resulting in millions of GPU hours saved.
On March 10th, Zhiyuan Robotics officially launched its first general-purpose embodied base large model – Genie Operator-1 (GO-1). This announcement has garnered significant attention, particularly regarding its potential in home service robots and the new hope it offers for future household management. According to Zhiyuan Robotics' official introduction, the GO-1 large model, trained on a vast amount of human video data, demonstrates excellent performance in executing various household tasks such as delivering cups, preparing meals, and greeting guests. In terms of technical performance,
Recently, the Modelers community officially launched Step-Video and Step-Audio, two open-source multimodal large models developed by Step-Star. These models are designed for video generation and voice interaction, respectively, aiming to provide developers and enterprise users with more powerful AI tools. Step-Video, formally known as Step-Video-T2V, is a 30-billion parameter model, making it the world's largest open-source video generation model. This model can directly generate 20...
Reports indicate that the National Supercomputing Internet Platform has integrated Alibaba's Qwen large language model, officially providing the QwQ-32B API service. Users can access up to 1 million tokens free of charge, offering a valuable opportunity for developers and researchers. QwQ-32B is the latest open-source inference model from Alibaba's Tongyi team, demonstrating excellent performance. According to multiple authoritative benchmark tests, QwQ-32B's capabilities rival those of a fully-fledged 671B model.