JUMPSTAR Releases Image Generation Model Step-1X-Medium with New Features such as Image-to-Image Generation

AIbase基地

Published inAI News · 5 min read · Dec 26, 2024

322

Shanghai Jumpspace Intelligent Technology Co., Ltd. recently announced a major upgrade to its image generation model, the Step-1X series, with the launch of the more powerful Step-1X-Medium version. This upgraded version has achieved significant improvements in several areas: based on the MMDit architecture, the generation speed has increased by over 30%; after targeted training, the new version has enhanced understanding capabilities and better consistency between images and text, resulting in more natural detail and texture in the generated images.

The Step-1X-Medium introduces a "Picture-to-Picture" feature, allowing users to simply upload an image and provide basic instructions to enhance details, apply style transfer, or make local modifications to the original image. Additionally, the new version has upgraded its ability to create "Chinese-style" content, better capturing the essence of Eastern facial features and presenting a more advanced and refined image texture. Furthermore, Step-1X-Medium supports adding English text in prompts, enabling the generated images to display English copy.

The upgraded Step-1X-Medium aims to be a powerful assistant for creators, deeply understanding the input ideas and providing more accurate and perfect output results. Currently, the new capabilities of Step-1X-Medium are available to users through API calls in the "Experience Center" of the Jumpspace open platform.

WeChat Screenshot_20241226081214.png

The new Step-1X-Medium has reached a new level in image generation quality, capable of producing more diverse scenes with stronger consistency between images and text. It can also deeply optimize Eastern character imagery, easily capturing the essence of Chinese style, generating consistently styled comic pages for fans of Chinese, Japanese, and American comics. For brand designers, Step-1X-Medium can generate advertising, product packaging, and marketing materials that align with brand tone, better showcasing the cultural core of the brand.

The "Base Image" feature launched with Step-1X-Medium allows creators to upload a base image, enabling the model to quickly understand the structure and style of the image and enhance details, transform styles, or refine specific areas based on the original creative concept. Additionally, Step-1X-Medium supports the SRef (Style Reference) generation function, providing style reference images from which the model extracts aesthetic styles and atmospheric features to incorporate into the composition of the generated images.

The advancements in AI technology allow Step-1X-Medium to add short English text in prompts, enhancing the visual artwork. This upgrade not only improves the quality and efficiency of image generation but also offers creators more creative space and possibilities.

Experience Link: https://platform.stepfun.com/

Hume AI Open Sources TADA: 5x Speed, Zero Hallucination TTS That Can Run 700-Second Audio on Mobile Devices

Hume AI opensources the TADA speech generation model, which uses a text-acoustic dual alignment architecture, significantly improving the efficiency and reliability of TTS systems. By achieving 1:1 strict synchronization between text tokens and acoustic representations, it effectively solves the content hallucination problem in traditional LLM-based TTS. The model has been validated through more than a thousand samples and shows excellent performance.

OpenRouter Launches Anonymous Models Hunter Alpha and Healer Alpha: Up to 1T Parameters, Support for Multimodal Input

The OpenRouter platform has added two new models, Hunter Alpha and Healer Alpha. Hunter Alpha has up to 1 trillion parameters, supports a 1 million token context, and multimodal input, designed for agent scenarios, excelling in complex reasoning and multi-step tasks. Healer Alpha has a context window of 262K tokens. Both models have attracted community attention.

The Turing Award Winner Was Shocked! Claude 1 Hour Solved a Mathematical Puzzle That Had Baffled Donald Knuth for 30 Years

Donald Knuth was amazed by AI solving a mathematical problem he had been working on for weeks in just one hour. The computer science giant revealed in a short essay that Claude Opus 4.6 solved a mathematical problem he could trace back to 30 years ago in just one hour, demonstrating the amazing potential of artificial intelligence in the field of logical reasoning.

Qwen AI Glasses are temporarily sold out across all channels. They will be available again at 10:00 AM tomorrow.

The Qwen AI Glasses G1 Series was released on March 8th. It is equipped with dual flagship chips and 64GB of storage, with an official price of 2899 yuan. The lowest price after discounts is 1997 yuan. The product has been highly popular in the market, ranking first on the smart glasses bestseller list within 3 hours. It is temporarily sold out across all channels and will be available again at 10:00 AM on March 9th.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

JUMPSTAR Releases Image Generation Model Step-1X-Medium with New Features such as Image-to-Image Generation

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Enjoy Semi-Solid Battery from the Start! SAIC MG 4X Makes a Stunning Debut: Equipped with Illuminated Logo, the Intelligent Driving Game Model is Also Here

AI-Driven Revenue Record: Adobe's Q1 Fiscal Year 2026 Revenue Reaches $6.4 Billion, CEO Announces Resignation

Hume AI Open Sources TADA: 5x Speed, Zero Hallucination TTS That Can Run 700-Second Audio on Mobile Devices

OpenRouter Launches Anonymous Models Hunter Alpha and Healer Alpha: Up to 1T Parameters, Support for Multimodal Input

Shenzhen Longgang to Jointly Host a 1,000-Person Lobster Festival with Kimi to Support OpenClaw Deployment

1-Minute Connection to WeCom! Tencent's AI Smart Agent WorkBuddy Launches, Entering the Desktop Era of Intelligent Agent Battle

The Turing Award Winner Was Shocked! Claude 1 Hour Solved a Mathematical Puzzle That Had Baffled Donald Knuth for 30 Years

Qwen AI Glasses are temporarily sold out across all channels. They will be available again at 10:00 AM tomorrow.

6 Months to Gain 1 Million Fans! Instant AI Completely Disrupts News Reading: Rejecting Useless Feeds

1Gbps Peak Record Broken! Huawei Collaborates with China Telecom to Launch 5G-A Ultra Uplink Technology: Piloted on Beijing-Shanghai High-Speed Railway, Satisfaction Exceeds 98%

AI News Recommendations

Enjoy Semi-Solid Battery from the Start! SAIC MG 4X Makes a Stunning Debut: Equipped with Illuminated Logo, the Intelligent Driving Game Model is Also Here

AI-Driven Revenue Record: Adobe's Q1 Fiscal Year 2026 Revenue Reaches $6.4 Billion, CEO Announces Resignation

Hume AI Open Sources TADA: 5x Speed, Zero Hallucination TTS That Can Run 700-Second Audio on Mobile Devices

OpenRouter Launches Anonymous Models Hunter Alpha and Healer Alpha: Up to 1T Parameters, Support for Multimodal Input

Shenzhen Longgang to Jointly Host a 1,000-Person Lobster Festival with Kimi to Support OpenClaw Deployment

1-Minute Connection to WeCom! Tencent's AI Smart Agent WorkBuddy Launches, Entering the Desktop Era of Intelligent Agent Battle

The Turing Award Winner Was Shocked! Claude 1 Hour Solved a Mathematical Puzzle That Had Baffled Donald Knuth for 30 Years

Qwen AI Glasses are temporarily sold out across all channels. They will be available again at 10:00 AM tomorrow.

6 Months to Gain 1 Million Fans! Instant AI Completely Disrupts News Reading: Rejecting Useless Feeds

1Gbps Peak Record Broken! Huawei Collaborates with China Telecom to Launch 5G-A Ultra Uplink Technology: Piloted on Beijing-Shanghai High-Speed Railway, Satisfaction Exceeds 98%

GEO Services