Generative Powers of Ten
Generates videos with multi-scale continuous zoom based on text descriptions.
CommonProductDesignGenerative ModelMulti-scale
Generative Powers of Ten is a method for generating multi-scale consistent content using text-to-image models. It enables extreme semantic zoom of a scene, ranging from a wide-angle landscape view of a forest to a macro shot of an insect on a branch. This representation allows us to render continuous zoom videos or interactively explore different scales of a scene. We achieve this through a joint multi-scale diffusion sampling method that encourages consistency across different scales while preserving the integrity of each individual sampling process. Since each generated scale is guided by different text prompts, our method can achieve a deeper level of zoom than traditional super-resolution methods, which may struggle to create new contextual structures at completely different scales. We conducted qualitative comparisons of our method against image super-resolution and external sketching techniques and demonstrated that our method is most effective at generating consistent multi-scale content.
Generative Powers of Ten Visit Over Time
Monthly Visits
881
Bounce Rate
57.98%
Page per Visit
1.0
Visit Duration
00:00:00