Recently, researchers from UC Berkeley introduced the Large World Model (LWM), which is on par with Google's Gemini 1.5 Pro in handling long videos and language sequences. LWM is trained using RingAttention technology, capable of processing extremely long texts and videos with excellent performance. Despite the buzz around models like Gemini 1.5 and Sora, there are still limitations that require further research and exploration.