Unlocking the 'Black Box' of Language Models! Google DeepMind Releases a Visualization Tool Gemma Scope

AIbase基地

Published inAI News · 4 min read · Aug 1, 2024

342

In the realm of artificial intelligence, language models are often likened to a mysterious black box. We feed them text, and they spit out meaning. But what exactly happens during this process? Google DeepMind's latest research, Gemma Scope, sheds light on this enigma.

The activations of language models are typically viewed as sparse, linear combinations of vectors, but the true meaning behind these combinations remains elusive. To address this issue, Sparse Autoencoders (SAEs), an unsupervised learning method, are highly anticipated. However, this technology is still in its infancy, with high training costs and slow research progress.

Google DeepMind has trained and released Gemma Scope, a set of Sparse Autoencoders trained on the Gemma2 model. It decomposes and reconstructs the activations of language models through encoders and decoders, aiming to reveal meaningful features.

Gemma Scope employs an innovative JumpReLU SAEs, which uses a shifted Heaviside step function as a gating mechanism to control activations, effectively managing the number of potential features. This design not only optimizes reconstruction loss but also directly regularizes the number of active latent features.

Gemma Scope has been meticulously trained on the activations of the Gemma2 model. During training, the model's activation vectors are normalized, and SAEs are trained at different layers and positions, including attention head outputs, MLP outputs, and post-MLP residual streams.

Gemma Scope's performance has been evaluated from multiple perspectives. Experimental results show that the Delta loss of residual stream SAEs is generally higher, and sequence length significantly impacts SAE performance. Additionally, performance varies across different subsets of datasets, with Gemma Scope performing best on DeepMind mathematics.

The release of Gemma Scope offers potential solutions to a range of open questions. It not only helps us understand SAEs more deeply but also improves performance on practical tasks and allows for red team testing of SAEs to determine if they have truly identified "true" concepts in the model.

With the application of Gemma Scope, we are poised to make significant strides in AI interpretability and security. It will help us better understand the internal workings of language models, enhancing their transparency and reliability.

Paper link: https://storage.googleapis.com/gemma-scope/gemma-scope-report.pdf

Online experience: https://www.neuronpedia.org/gemma-scope#main

Google DeepMind Open Sources GenAI Processors: One-Click Building of Real-Time AI Workflows

Google DeepMind open sources the GenAI Processors Python library, helping developers build efficient generative AI workflows. The library supports asynchronous processing of multimodal data and optimizes Gemini API application development, significantly reducing latency in real-time applications. Core features include a modular Processor interface, streaming API design, and concurrency optimization, enabling rapid development of real-time applications such as intelligent assistants. Currently only supports Python, but with an open community contribution model, future plans include expanding functionality to cover more scenarios.

Mistral AI Releases Devstral2507: Designed for Code-Centric Language Modeling

Mistral AI launched the Devstral2507 series with two AI models: the open-source Devstral Small1.1 (24 billion parameters, SWE-Bench score of 53.6%) and the enterprise version Devstral Medium2507 (score of 61.6%). Small1.1 supports a 128k context window and local deployment, while Medium2507 outperforms some commercial models. Both are optimized for code reasoning and program synthesis, and support integration with agent frameworks.

AI Daily: xAI Shockingly Launches Grok4; Microsoft Opensources New Phi-4-mini Version; Shanghai has Cumulatively 82 Large Models Passed Filing

1. xAI launches Grok4 with enhanced math/coding capabilities; 2. Microsoft open-sources efficient Phi-4-mini for edge devices; 3. Shanghai approves 82 specialized AI models; 4. Hugging Face releases Reachy Mini robot; 5. Perplexity debuts Comet AI browser; 6. OpenAI plans first open-weight model; 7. Google releases GPU-friendly MedGemma; 8. OpenAI acquires AI hardware firm for $6.5B.....

Shanghai has completed the filing of 82 large models

At the 2025 World Artificial Intelligence Conference, it was revealed that Shanghai has filed 82 large models and is actively promoting AI demonstration applications in key industries such as manufacturing and finance. Xuhui Moshu Space and Pudong Moli Community have become industrial carriers, gathering 500 and 200 AI companies respectively. Shanghai has established a full-cycle financing support system from the early stages to the mature stage through national and municipal artificial intelligence funds, with a focus on key areas such as computing power and language data.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Unlocking the 'Black Box' of Language Models! Google DeepMind Releases a Visualization Tool Gemma Scope

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Google DeepMind Open Sources GenAI Processors: One-Click Building of Real-Time AI Workflows

Mistral AI Releases Devstral2507: Designed for Code-Centric Language Modeling

Google Announces the Latest Class of Students at the American Artificial Intelligence Infrastructure Institute

Google Veo3 Adds Image-to-Video Feature, Users Create Over 40 Million Videos Within Seven Weeks

Personification of Large AI Models: Grok 4 and Empathy with Musk?

Google AI Advertising Tool Launches Strongly in India, Driving New Changes in Digital Marketing!

AI Daily: xAI Shockingly Launches Grok4; Microsoft Opensources New Phi-4-mini Version; Shanghai has Cumulatively 82 Large Models Passed Filing

Shanghai has completed the filing of 82 large models

Google's Medical AI Model MedGemma Series Released, Can Run on a Single GPU

Google Smartwatch Welcomes AI Assistant Gemini with New Highlight and Search Feature Upgrade