Generative Powers of Ten

Generates videos with multi-scale continuous zoom based on text descriptions.

CommonProductDesignGenerative ModelMulti-scale
Generative Powers of Ten is a method for generating multi-scale consistent content using text-to-image models. It enables extreme semantic zoom of a scene, ranging from a wide-angle landscape view of a forest to a macro shot of an insect on a branch. This representation allows us to render continuous zoom videos or interactively explore different scales of a scene. We achieve this through a joint multi-scale diffusion sampling method that encourages consistency across different scales while preserving the integrity of each individual sampling process. Since each generated scale is guided by different text prompts, our method can achieve a deeper level of zoom than traditional super-resolution methods, which may struggle to create new contextual structures at completely different scales. We conducted qualitative comparisons of our method against image super-resolution and external sketching techniques and demonstrated that our method is most effective at generating consistent multi-scale content.
Visit

Generative Powers of Ten Visit Over Time

Monthly Visits

659

Bounce Rate

51.50%

Page per Visit

1.0

Visit Duration

00:00:00

Generative Powers of Ten Visit Trend

Generative Powers of Ten Visit Geography

Generative Powers of Ten Traffic Sources

Generative Powers of Ten Alternatives