FouriScale

Frequency-based method for training free high-resolution image synthesis.

CommonProductImageHigh-resolution imageFrequency analysis
FouriScale explores high-resolution image generation from pre-trained diffusion models from a frequency analysis perspective. Through an innovative, no-training method, it replaces the original convolutional layers in pre-trained diffusion models with a combination of dilation techniques and low-pass operations, further enhanced by a fill-and-crop strategy. This allows for flexible handling of various aspect ratios in text-to-image generation. Guided by FouriScale, this method successfully balances the structural integrity and fidelity of generated images, achieving remarkable capabilities for arbitrary-sized, high-resolution, and high-quality generation. With its simplicity and compatibility, this method provides valuable insights for future explorations in ultra-high-resolution image synthesis.
Visit

FouriScale Visit Over Time

Monthly Visits

490881889

Bounce Rate

37.92%

Page per Visit

5.6

Visit Duration

00:06:18

FouriScale Visit Trend

FouriScale Visit Geography

FouriScale Traffic Sources