Breaking! AI Scientists Take a New Approach, Using Large Models to Automatically Explore Artificial Life

AIbase基地

Published inAI News · 6 min read · Dec 25, 2024

328

Recently, scientists at Sakana AI have made groundbreaking progress in the field of artificial intelligence, successfully utilizing vision-language foundation models (FMs) for the first time to automate the search for Artificial Life (ALife) simulations. This new method, called ASAL (Automated Search for Artificial Life), brings a revolutionary transformation to research in the field of artificial life and is expected to accelerate development in this area.

Traditional artificial life research primarily relies on manual design and trial-and-error methods, while ASAL changes this status quo. The core of this method is to evaluate the videos generated by simulations using foundation models, thereby automatically searching for interesting ALife simulations. ASAL discovers life forms mainly through three mechanisms:

Supervised Target Search: Searching for simulations that produce specific phenomena through text prompts. For example, researchers can set targets like "one cell" or "two cells," allowing the system to automatically identify simulations that meet these criteria. Open-ended Search: Seeking simulations that generate endless novelty over time. This approach helps discover simulations that remain interesting to human observers. Heuristic Search: Looking for a diverse set of interesting simulations to reveal "alien worlds."

ASAL's versatility enables it to be effectively applied to various ALife substrates, including Boids, Particle Life, Game of Life, Lenia, and Neural Cellular Automata. Researchers have discovered unprecedented life forms within these substrates, such as unusual clustering patterns in Boids, new self-organizing cells in Lenia, and open-ended cellular automata similar to Conway's Game of Life.

Moreover, ASAL supports quantitative analysis of phenomena that were previously only qualitatively assessed. The foundation models possess representation capabilities similar to humans, allowing ASAL to measure complexity in a way that aligns with human cognition. For instance, researchers can quantify the plateau phase in Lenia simulations by measuring the rate of change of CLIP vectors during the simulation process.

The innovation of this research lies in its use of pre-trained foundation models, particularly the CLIP (Contrastive Language-Image Pre-training) model, to evaluate the videos of simulations. The CLIP model aligns the representations of images and text through contrastive learning, enabling it to understand human concepts of complexity. The ASAL approach is not limited to specific foundation models or simulation substrates, meaning it can be compatible with future models and substrates.

Researchers have also experimentally validated the effectiveness of ASAL, testing it with different foundation models (such as CLIP and DINOv2) and various ALife substrates. Results indicate that CLIP slightly outperforms DINOv2 in generating diversity that aligns with human cognition, but both significantly surpass low-level pixel representations. This highlights the importance of using deep foundation model representations to measure human concepts of diversity.

This research opens new avenues in the field of artificial life, allowing researchers to focus on higher-level questions, such as how to best describe the phenomena they wish to observe, and then let automated processes search for these outcomes. The emergence of ASAL not only aids scientists in discovering new life forms but also enables the quantitative analysis of complexity and openness in life simulations. Ultimately, this technology holds the potential to help people understand the nature of life and all possible forms of life that may exist in the universe.

Project Code: https://github.com/SakanaAI/asal/

Paper Link: https://arxiv.org/pdf/2412.17799

Mistral AI Releases Devstral2507: Designed for Code-Centric Language Modeling

Mistral AI launched the Devstral2507 series with two AI models: the open-source Devstral Small1.1 (24 billion parameters, SWE-Bench score of 53.6%) and the enterprise version Devstral Medium2507 (score of 61.6%). Small1.1 supports a 128k context window and local deployment, while Medium2507 outperforms some commercial models. Both are optimized for code reasoning and program synthesis, and support integration with agent frameworks.

AI Daily: xAI Shockingly Launches Grok4; Microsoft Opensources New Phi-4-mini Version; Shanghai has Cumulatively 82 Large Models Passed Filing

1. xAI launches Grok4 with enhanced math/coding capabilities; 2. Microsoft open-sources efficient Phi-4-mini for edge devices; 3. Shanghai approves 82 specialized AI models; 4. Hugging Face releases Reachy Mini robot; 5. Perplexity debuts Comet AI browser; 6. OpenAI plans first open-weight model; 7. Google releases GPU-friendly MedGemma; 8. OpenAI acquires AI hardware firm for $6.5B.....

Shanghai has completed the filing of 82 large models

At the 2025 World Artificial Intelligence Conference, it was revealed that Shanghai has filed 82 large models and is actively promoting AI demonstration applications in key industries such as manufacturing and finance. Xuhui Moshu Space and Pudong Moli Community have become industrial carriers, gathering 500 and 200 AI companies respectively. Shanghai has established a full-cycle financing support system from the early stages to the mature stage through national and municipal artificial intelligence funds, with a focus on key areas such as computing power and language data.

AI Daily: Tencent Huyaun Launches 3D Generation Large Model Hunyuan3D-PolyGen; DingTalk AI Spreadsheet Makes a Big Entry; Alibaba Launches Multimodal Large Language Model HumanOmniV2

1.Tencent's Hunyuan3D-PolyGen boosts 3D modeling efficiency by 70% with BPT tech. 2.Alibaba's HumanOmniV2 achieves 69.33% accuracy in multilingual input. 3.DingTalk AI processes 1k tasks/hour with 'spreadsheet-as-document'. 4.Baidu PaddleOCR3.1 improves 37-language recognition by 30%. 5.Microsoft Deep Research opens API. 6.HKPolyU & OPPO's DLoRAL speeds video enhancement 10x. 7.Google opens MCP Toolbox for SQL. 8.Microsoft Win11 to add AI dynamic....

Aliyun Open-Sources Network Agent WebSailor, Surpassing Numerous Closed-Source Models

Aliyun open-sources the network agent WebSailor. Its 32B and 72B versions performed well in the BrowseComp evaluation, surpassing multiple closed-source models, ranking just behind OpenAI DeepResearch. The project has been released on GitHub with construction plans and datasets, promoting open innovation in the AI field and providing developers with a smarter web interaction tool.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Breaking! AI Scientists Take a New Approach, Using Large Models to Automatically Explore Artificial Life

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Mistral AI Releases Devstral2507: Designed for Code-Centric Language Modeling

Google Announces the Latest Class of Students at the American Artificial Intelligence Infrastructure Institute

Personification of Large AI Models: Grok 4 and Empathy with Musk?

AI Daily: xAI Shockingly Launches Grok4; Microsoft Opensources New Phi-4-mini Version; Shanghai has Cumulatively 82 Large Models Passed Filing

Shanghai has completed the filing of 82 large models

OpenAI Plans to Release Open-Weight Models, Breaking the Closed-Source Convention

NVIDIA Collaborates with Hong Kong University and Others to Launch Fast KV Cache, Aiding in Accelerating Diffusion Models

AI Daily: Tencent Huyaun Launches 3D Generation Large Model Hunyuan3D-PolyGen; DingTalk AI Spreadsheet Makes a Big Entry; Alibaba Launches Multimodal Large Language Model HumanOmniV2

New Breakthrough in Cyclic Models: 500 Steps of Training Makes Ultra-Long Sequences No Longer Difficult!

Aliyun Open-Sources Network Agent WebSailor, Surpassing Numerous Closed-Source Models