Emu Edit
Precise image editing, one-stop shop for multi-task needs
CommonProductImageImage EditingMulti-task Learning
Emu Edit is a multi-task image editing model that performs precise image editing by recognizing and generating tasks. It has made the latest technological breakthroughs in this field. Emu Edit's architecture is optimized for multi-task learning and trained on numerous tasks, including region-based editing, free-form editing, and computer vision tasks such as detection and segmentation. In addition, to more effectively handle these various tasks, we have introduced the concept of learned task embeddings to guide the generation process for accurately executing editing instructions. Our model, through multi-task training and the use of learned task embeddings, can significantly improve its ability to accurately execute editing instructions.
Emu Edit also supports rapid adaptation to unseen tasks through task inversion for few-shot learning. In this process, we keep the model weights unchanged and only update the task embeddings to adapt to new tasks. Our experiments demonstrate that Emu Edit can quickly adapt to new tasks such as super-resolution and contour detection. This makes Emu Edit particularly advantageous for task inversion when labeled samples are limited or computational budgets are restricted.
To support the strict and well-founded evaluation of instruction-based image editing models, we have also collected and publicly released a new benchmark dataset containing seven different image editing tasks: background modification, global image change, style modification, object removal, object addition, local modification, and color/texture modification. In addition, to allow for a fair comparison with Emu Edit, we also share Emu Edit's generation results on the dataset.
Emu Edit 2023 Meta retains all copyrights
Emu Edit Visit Over Time
Monthly Visits
1048
Bounce Rate
43.29%
Page per Visit
1.9
Visit Duration
00:00:49