InstructVideo

A text-to-video instruction generation model.

ChineseSelectionVideoText-to-videoDiffusion models
InstructVideo is a method for training text-to-video diffusion models using reward fine-tuning guided by human feedback. It employs an editing-based reward fine-tuning approach, which reduces fine-tuning cost while enhancing efficiency. Leveraging pre-established image reward models, it provides reward signals through segment-wise sparse sampling and temporal decay rewards, significantly improving the visual quality of generated videos. InstructVideo not only enhances the visual quality of generated videos but also maintains strong generalization capabilities. For more information, please visit the official website.
Visit

InstructVideo Alternatives