ACT 1 (Advanced Cinematic Transformer) is a text-to-video synthesis system developed by Hotshot Research. It can generate high-definition videos with various aspect ratios and no watermark, providing an engaging user experience. The system is trained on a massive high-resolution text-video dataset to achieve high-fidelity spatial, temporal, and aesthetic quality alignment.