FiT: A Novel Transformer Architecture for Image Generation Without Resolution and Aspect Ratio Constraints

站长之家

Published inAI News · 1 min read · Feb 21, 2024

Translated data: Flexible Image Transformer (FiT) is an innovative Transformer architecture for image generation models, specifically designed to create images without limitations on resolution and aspect ratio. FiT treats images as a series of variable-sized image patches (Tokens), enhancing adaptability to different resolutions. Through a meticulously designed network structure and techniques that do not require additional training, FiT demonstrates significant flexibility in extending image resolution. Its introduction provides a novel solution for generating images unconstrained by resolution and aspect ratio. Additionally, the article also covers the latest advancements in other related large-scale models and generative model frameworks.

Transformer Image Generation Resolution

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

ImageSlider 2.0 Joining Core Product Line; Image Generation Capabilities Significantly Upgraded

Apr 25, 2025

180

Adobe Firefly, the AI Image Generator, Coming to iOS and Android

In the latest development, Adobe announced the upcoming release of mobile versions of its AI image generation tool, Firefly, aiming for a more intense competition with OpenAI. The news was officially revealed at the MAX Creativity Conference in London. Adobe stated that the Firefly mobile application will be available soon on iOS and Android platforms, though a specific release date is yet to be determined. Alexandru Co, Adobe Firefly's Vice President...

Apr 25, 2025

100

Jimeng AI 3.0 Global Launch: Cinematic Visuals and Precise English Typography Lead the Way in AI Creation

ByteDance's Jimeng AI has officially launched Jimeng AI 3.0 globally, marking a significant expansion of its text-to-image and video generation technology into international markets. According to AIbase, the new version boasts cinematic image quality, 2K resolution output, hyperrealistic textures, and precise English typography, particularly excelling in English text generation and font control, surpassing the performance of the previous Chinese version. The launch announcement has generated significant buzz on social media platforms. Features can be experienced via the Jimeng AI website and mobile application.

Apr 24, 2025

490

AI Daily: OpenAI Launches gpt-image-1 Image Generation API; Nano AI Releases MCP Universal Toolbox; China Accounts for 60% of Global AI Patents

Apr 24, 2025

190

JSON Visuals for ChatGPT Released: Unlock Infinite Image Style Creation

JSON Visuals for ChatGPT is officially released, injecting a new creative dimension into ChatGPT's image generation capabilities. According to AIbase, this tool offers over 50 unique aesthetic codes, combined with an attribute randomizer, to generate an infinite number of style combinations. Users simply input an image and JSON style code to create personalized visual content. The release announcement has sparked enthusiastic responses on social media, with the community particularly praising its surreal tech style. Core features: Flexible style generation and randomized JSON combinations.

Apr 24, 2025

620

OpenAI Releases gpt-image-1 API: 4o Image Generation Capabilities Now Open

OpenAI has officially launched the gpt-image-1 API, marking the opening of its highly anticipated 4o image generation capabilities to developers. According to AIbase, this API is lauded by the community as the world's strongest 'image generation' tool due to its high-fidelity image generation, diverse visual styles, and powerful integration of world knowledge. The release announcement has generated significant excitement among AI developers and the creative community, with relevant documentation now publicly available via the OpenAI website and Playground platform. Core features: High-fidelity and diverse style generation

Apr 24, 2025

350

OpenAI Launches New ChatGPT Image Generation API: Developers Can Easily Integrate AI Image Creation Functionality

OpenAI recently announced that it has made its latest image generation capabilities available to developers via API, allowing them to integrate this advanced technology into various applications and services. This news offers developers a significant opportunity, particularly in the fields of image processing and creation. The newly launched image generation model, named "gpt-image-1," leverages the image generation technology behind ChatGPT. Since its launch at the end of March this year, users have been able to create realistic Ghibli-style images and various other visuals.

Apr 24, 2025

150

AI-Generated Image Copyright Case Closed: Simple Prompts Do Not Constitute Works

According to Legal Daily, the Suzhou Intermediate People's Court in Jiangsu Province recently issued a final ruling in a copyright dispute over AI-generated images. The court ruled that the images of the "Illusion Wings Transparent Art Chair" series, generated by plaintiff designer Feng Moumou using AI software, do not constitute works in the sense of copyright law. The court therefore rejected Feng's lawsuit against the defendants, Zhu Moumou et al., for infringement. Plaintiff Feng Moumou created a series of art chair images using an AI image generation software and published them on social media seeking mass production cooperation. Defendant Zhu Moumou contacted the plaintiff seeking cooperation but was refused. Afterwards...

Apr 23, 2025

270

ByteDance Releases Efficient Pre-training Length Scaling Technology, Breaking Through Long Sequence Training Bottlenecks

Apr 23, 2025

240

Hailuo Launches Image Character Reference Feature, Enabling Multi-Angle, Dynamic Pose Generation from a Single Image

MiniMax's Hailuo AI has officially launched a groundbreaking new feature for its Hailuo Image – Character Reference. According to AIbase, this feature allows users to generate character images with multiple angles, dynamic poses, and rich expressions based on a single reference image. It supports cinematic lighting and composition and offers comprehensive prompt control. The community has responded enthusiastically to the launch, and details are available on the Hailuo website.

Apr 23, 2025

480

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

FiT: A Novel Transformer Architecture for Image Generation Without Resolution and Aspect Ratio Constraints

站长之家

This article is from AIbase Daily

AI News Recommendations

ImageSlider 2.0 Joining Core Product Line; Image Generation Capabilities Significantly Upgraded

Adobe Firefly, the AI Image Generator, Coming to iOS and Android

Jimeng AI 3.0 Global Launch: Cinematic Visuals and Precise English Typography Lead the Way in AI Creation

AI Daily: OpenAI Launches gpt-image-1 Image Generation API; Nano AI Releases MCP Universal Toolbox; China Accounts for 60% of Global AI Patents

JSON Visuals for ChatGPT Released: Unlock Infinite Image Style Creation

OpenAI Releases gpt-image-1 API: 4o Image Generation Capabilities Now Open

OpenAI Launches New ChatGPT Image Generation API: Developers Can Easily Integrate AI Image Creation Functionality

AI-Generated Image Copyright Case Closed: Simple Prompts Do Not Constitute Works

ByteDance Releases Efficient Pre-training Length Scaling Technology, Breaking Through Long Sequence Training Bottlenecks

Hailuo Launches Image Character Reference Feature, Enabling Multi-Angle, Dynamic Pose Generation from a Single Image