AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

Liquid AI Launches STAR Model Architecture, Efficiency Surpasses Traditional Transformer

AIbase基地

Published inAI News · 5 min read · Dec 3, 2024

227

In the current competition of developing large language models (LLMs), major AI companies are facing increasing challenges, leading to a growing interest in alternatives beyond the "Transformer" architecture. Since its introduction by Google researchers in 2017, the Transformer architecture has become the foundation of today's generative AI. To address this challenge, a startup incubated by MIT, Liquid AI, has launched an innovative framework called STAR (Synthesis of Tailored Architectures).

The STAR framework utilizes evolutionary algorithms and numerical encoding systems, aiming to automate the generation and optimization of AI model architectures. Liquid AI's research team notes that STAR's design approach differs from traditional architecture design by employing a hierarchical encoding technique known as the "STAR Genome," which explores a broad design space of potential architectures. Through combinations and mutations of the genome, STAR can synthesize and optimize architectures that meet specific performance and hardware requirements.

In tests focused on autoregressive language modeling, STAR demonstrated superior capabilities compared to traditional optimized Transformer++ and hybrid models. In terms of optimization quality and cache size, the architectures evolved by STAR reduced cache size by up to 37% compared to hybrid models, and achieved a 90% reduction compared to traditional Transformers. This efficiency does not compromise the model's predictive performance; in some cases, it even surpasses competitors.

Research also indicates that STAR's architecture is highly scalable, with a STAR evolutionary model expanding from 125 million parameters to 1 billion parameters performing comparably or better than existing Transformer++ and hybrid models in standard benchmarks, while significantly reducing inference cache requirements.

Liquid AI states that the design principles of STAR incorporate concepts from dynamic systems, signal processing, and numerical linear algebra, creating a flexible search space for computational units. A notable feature of STAR is its modular design, allowing it to encode and optimize architectures at multiple levels, providing researchers with insights into effective combinations of architectural components.

Liquid AI believes that STAR's efficient architecture synthesis capabilities will be applicable across various fields, especially in scenarios that require balancing quality and computational efficiency. While Liquid AI has yet to announce specific commercial deployments or pricing plans, its research results mark a significant advancement in the field of automated architecture design. As the AI field continues to evolve, frameworks like STAR may play a crucial role in shaping the next generation of intelligent systems.

Official blog: https://www.liquid.ai/research/automated-architecture-synthesis-via-targeted-evolution

Key Points:
🌟 The STAR framework launched by Liquid AI automatically generates and optimizes AI model architectures through evolutionary algorithms.
📉 The STAR model reduces cache size by up to 90% while outperforming traditional Transformers in performance.
🔍 The modular design of STAR can be applied across various fields, driving further development in AI system optimization.

LargeLanguageModel Transformer LiquidAI STARFramework

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

ByteDance Unveils QuaDMix: A Unified Framework for Large Language Model Pre-training Data Quality and Diversity

Apr 28, 2025

500

Zhipu AI and Shengshu Technology Announce Strategic Partnership to Focus on Large Model Joint Innovation

On April 27, Zhipu AI (Z.ai) and Shengshu Technology (shengshu.com), two leading artificial intelligence companies under Tsinghua University, announced a major strategic partnership. This collaboration aims to leverage both companies' technological expertise in large language models and multi-modal generative models to jointly advance the technological innovation and industrial application of domestic large models.

Apr 27, 2025

370

Step1X-Edit: A New Benchmark in Open-Source Image Editing, Rivaling Closed-Source Models like GPT-4o

Step1X-Edit is a groundbreaking open-source image editing model that achieves performance comparable to leading closed-source models such as GPT-4o. It offers a powerful and versatile solution for various image manipulation tasks.

Apr 27, 2025

350

Doubao 1.5 Deep Thinking Model Launches on Edge Large Model Gateway with Free Million Tokens

Bytedance's Volcano Engine announced the full launch of its newly released Doubao 1.5 Deep Thinking model on the edge large model gateway, offering users up to 5 million free tokens. This move has garnered significant attention in the AI field.

Apr 25, 2025

840

GPT-4.1 Model Faces Scrutiny: Alignment and Stability Concerns Raised

Apr 24, 2025

510

ByteDance Releases Efficient Pre-training Length Scaling Technology, Breaking Through Long Sequence Training Bottlenecks

Apr 23, 2025

440

Fujitsu and Nutanix Launch Takane, a Japanese Large Language Model, Targeting the Enterprise Private AI Market

Fujitsu and Nutanix have collaborated to release Takane, a powerful Japanese large language model designed for enterprise private cloud deployments. This collaboration aims to provide businesses with a secure and efficient solution for leveraging AI within their own infrastructure.

Apr 23, 2025

280

Revolutionizing Video Creation! Alibaba's VACE Model Unifies Text, Image, and Video Inputs

Scientists at Alibaba Group have introduced VACE, a universal AI model designed to unify a wide range of video generation and editing tasks. At the heart of VACE is an enhanced Diffusion Transformer architecture, innovating with a novel input format called "Video Conditional Unit" (VCU). VCU distills diverse modalities such as text prompts, reference images or video sequences, and spatial masks into a unified representation, and through a specialized mechanism coordinates different inputs to avoid conflicts. Concept decoupling enables fine-grained control.

Apr 23, 2025

200

Gartner Report: Task-Specific AI to Outpace General-Purpose AI by Threefold in 2027

Apr 23, 2025

260

MAGI-1, the World's First Autoregressive Video Generation Model, Officially Launched; Swin Transformer Team Leads a New Wave in Video Creation

A powerful new contender has emerged in the field of video generation—MAGI-1. Developed by Sand AI, a startup led by Cao Yue, winner of the Marr Prize and Tsinghua University's Special Award, this autoregressive video generation model is redefining the possibilities of video creation. MAGI-1 generates videos by predicting sequences of video blocks, garnering significant attention for its natural and fluid results and multiple downloadable versions. MAGI-1 boasts superior performance in video generation. Firstly, it delivers a seamless and smooth video experience, capable of generating...

Apr 22, 2025

930