AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

The World Model: AI-Generated Videos and Images, Interpreting 1 Million Data at Once

AIGC开放社区

Published inAI News · 1 min read · Feb 27, 2024

Translated data: Recently, researchers from the University of California, Berkeley, have open-sourced the Large World Model (LWM), which can interpret 1 million data points at once and possesses the ability to generate videos and images from text. This model has overcome the challenge of long-sequence attention computation through Ring Attention technology, enabling efficient processing of multi-modal information. It has achieved remarkable results after undergoing two stages of training: language model pre-training and multi-modal pre-training.

Multimodal Generation World Model Ring Attention

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

OmniSVG: A New Benchmark in Multimodal Vector Graphic Generation from Fudan University and Jieyue Xingchen

Fudan University and Jieyue Xingchen, a leading domestic AI innovation company, recently announced the upcoming release of OmniSVG, an end-to-end multimodal SVG generation model. This news has quickly garnered widespread attention in the technology and design fields. According to AIbase, OmniSVG's core strength lies in its powerful generation capabilities, supporting vector graphic generation from simple icons to complex anime characters, providing a new intelligent solution for digital art creation. The launch of this model is poised to redefine the technical boundaries of vector graphic generation. Multimodal Generation: Flexible response.

Apr 10, 2025

120

Wayve Unveils GAIA-2, a Full-World Model Generating 5-View Videos and Simulating High-Risk Scenarios

Wayve recently launched GAIA-2, its latest video-generating world model. A significant upgrade from its predecessor, GAIA-1, this groundbreaking technology generates highly diverse and controllable driving scenario videos to greatly enhance the safety of assisted and autonomous driving systems. The release of GAIA-2 marks a significant step forward for Wayve in leveraging generative AI to enable safer and smarter transportation. GAIA-2 offers a substantial improvement in scene diversity compared to GAIA-1.

Mar 31, 2025

370

Jieyue Xingchen and Zhiyuan Robotics Partner to Explore Large Model + Embodied Robot Applications

Shanghai Jieyue Xingchen Intelligent Technology Co., Ltd. and Zhiyuan Robotics have officially signed a deep strategic cooperation agreement. Both parties will conduct in-depth cooperation in the fields of base large models and robot research and development, jointly exploring technological breakthroughs and application innovations of "large model + embodied robot." This cooperation involves world model technology research and development, data cooperation in the field of embodied intelligence, and the implementation of application scenarios such as new retail, aiming to promote the large-scale application of embodied intelligence technology in areas such as home services, new retail, and intelligent manufacturing.

Mar 13, 2025

170

AI Daily: Baidu Wenxin 4.5 Series Large Model to be Open-Sourced; Kunlun Wanwei Releases World Model Matrix-Zero; Apple Expected to Introduce AI Features to China iPhone by Mid-2025

Welcome to the AI Daily column! Here is your guide to exploring the world of artificial intelligence every day. We present you with the hottest topics in the AI field, focusing on developers to help you understand technology trends and discover innovative AI product applications. Click here to learn about new AI products: https://top.aibase.com/1. Baidu: The Wenxin Large Model 4.5 series will be released in the coming months and officially open-sourced on June 30. Today, Baidu announced that it will launch the Wenxin Large Model 4.5 series in the coming months and plans to open-source it on June.

Feb 14, 2025

700

Kunlun Wanwei Releases Matrix-Zero World Model: China's First 3D Scene and Interactive Video Generation

Feb 14, 2025

6.5k

Kunlun Wei Releases Matrix-Zero World Model Supporting 3D Scene and Interactive Video Generation

On February 14, 2025, Kunlun Wei Group officially launched the Matrix-Zero world model, marking a significant step for China in the field of spatial intelligence. Matrix-Zero includes two sub-models: a large model for 3D scene generation and a large model for interactive video generation, aiming to reshape digital content creation patterns through AI technology and promote innovation in industries such as film production, game development, and embodied intelligence.

Feb 14, 2025

4.2k

Tim Brooks Joins DeepMind to Build a New Generation of World Model Team

In the AI field, talent mobility has always been a fiercely competitive arena. Recently, the job switch of Tim Brooks has drawn widespread attention. After working at Sora for only three months, he announced his move to the competitor Google DeepMind. This move is seen as a response to serious challenges faced by Sora's technology, particularly in generating speed and performance, which have been underwhelming compared to other players in the industry. Reports indicate that Sora has numerous technical issues and struggles to compete with Luma and Stabilit.

Jan 7, 2025

1.7k

Can Diffusion Models Play Games? DIAMOND Excels in Atari, Visual Details are Key!

Reinforcement learning has seen significant success in recent years, but its low sample efficiency limits its application in the real world. World models, as a type of environment generation model, offer hope for addressing this issue. They can serve as simulated environments to train reinforcement learning agents with higher sample efficiency. Currently, most world models simulate environment dynamics through discrete latent variable sequences. However, this method of compressing into a compact discrete representation may overlook visual details that are critical for reinforcement learning. Meanwhile, diffusion models have become prominent in the field of image generation.

Nov 18, 2024

1.5k

Should Autonomous Driving Embrace the Metaverse? Jige Technology Uses AI to Enhance 4D Scene Reconstruction!

Recently, Jige Technology proposed a new framework called DriveDreamer4D, aimed at leveraging prior knowledge from world models to improve the reconstruction quality of 4D driving scenes. Traditional 4D scene reconstruction methods primarily rely on two main approaches: NeRF and 3DGS. NeRF is like a super painter, capable of rendering a collection of photos into a 3D model using neural networks. On the other hand, 3DGS simulates various objects in the scene using a set of three-dimensional Gaussian functions. However, both methods have their limitations.

Oct 28, 2024

2.7k

Zhiyuan Releases Native Multimodal World Model Emu3: Achieving Text, Image, and Video Understanding and Generation Solely Through Next Token Prediction

Oct 21, 2024

1.3k