STAR is an innovative video super-resolution technology that addresses the issue of over-smoothing found in traditional GAN methods by combining text-to-video diffusion models with video super-resolution. This technology not only recovers video details but also maintains temporal and spatial consistency, making it suitable for various real-world video scenarios. STAR was jointly developed by Nanjing University and ByteDance, boasting high academic value and application prospects.