AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

Translated Kuaishou's Open-Source Image Generation Model Kolors Enables Text Integration into Imagery

AIbase

Published inAI News · 5 min read · Jul 8, 2024

354

Quick Hands released a big move today by opening its in-house image generation model——“Kolors”. This is not an ordinary model; it has been trained on tens of billions of text-image pairs, equipped with a General Language Model (GLM) as a text encoder, supporting bilingual Chinese and English prompts, and can handle contexts up to 256 tokens.

Key Features of Kolors:

Bilingual Support:Utilizes the General Language Model (GLM) as a text encoder, enabling the model to not only master English but also perfectly understand and apply Chinese prompts.
Long Text Processing:Supports a context length of up to 256 tokens, allowing creators to detail their thoughts, whether complex scenes or rich stories.
Massive Data Training:Trained on tens of billions of text-image pairs, the model has a vast knowledge base, capable of generating diverse and accurate images.
Optimization for Chinese Cultural Elements:Especially optimized for Chinese cultural elements, the generated images are more in line with Chinese cultural characteristics, meeting localized needs.
Chinese Text Generation:“Kolors”not only understands Chinese but can also embed Chinese text into the generated images, adding more expressiveness to the images.

An AIbase test found that currently, Kolors performs better in inserting Chinese into images, with most outputs being correct, but with English, there is a tendency to have missing or incorrect characters.

QQ截图20240708112714.jpg

QQ截图20240708111705.jpg

As can be seen, the above-generated "lieping" (lying down) cat has no problem with Chinese characters, but when I change it to "AIbase", there are missing or omitted characters. In terms of Chinese output, Kolors performs well, but note that the text should not be too long; too long and it's prone to errors.

QQ截图20240708112728.jpg

This model is not just a simple tool; it is backed by the powerful technology of Quick Hands. Trained on massive data, it has special optimization for Chinese cultural elements, making the generated images more Chinese in flavor. This is not just a technical breakthrough but also a cultural inheritance.

The open-source plan also includes support for CN (ControlNet), LoRa (Low-Rank Adaptation), IPA (Image Prompt Adaptation), and direct support for ComfyUI, all of which are designed to make your creative process more smooth and personalized.

Technical Details:

"Kolors" is based on the SDXL model architecture and integrates the ChatGLM256 technology to enhance bilingual understanding and text generation capabilities.
It is worth noting that running this model requires a large amount of video memory, about 19GB, which may have certain requirements for hardware devices.

Quick Hands' open-source of "Kolors" is not only a contribution to the technical community but also a bold push for creative freedom. This demonstrates Quick Hands' determination and strength in AI technology, and shows the endless possibilities of AI in artistic creation.

Official Kolors Website: https://top.aibase.com/tool/kuaishouketudamoxingkolors

Project Address: https://top.aibase.com/tool/kolors

"Ketu AI Headlines"

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team