Translation: Recently, Tencent AI Lab has collaborated with the University of Sydney to introduce GPT4Video, addressing the shortcomings of multi-modal language models in video generation. This framework incorporates a video comprehension module, a foundational LLM structure, and a video generation module, enhancing the safety of video creation through secure fine-tuning methods. The dataset released is set to advance research in the field of multi-modal LLMs.