Google DeepMind Launches New Framework InfAlign: Enhancing Alignment Capability of Language Model Inference

AIbase基地

Published inAI News · 5 min read · Jan 2, 2025

311

Generative language models face numerous challenges during the transition from training to practical application. One major issue is how to achieve optimal performance from the model during the inference phase.

Current strategies, such as reinforcement learning from human feedback (RLHF), mainly focus on improving the model's success rate, often neglecting decoding strategies during inference, such as Best-of-N sampling and controlled decoding. This gap between training objectives and actual usage can lead to inefficiencies, affecting the quality and reliability of outputs.

To address these issues, Google DeepMind and the Google Research team developed InfAlign, a machine learning framework designed to integrate with inference strategies. InfAlign incorporates methods during inference into the alignment process, aiming to bridge the gap between training and application. It adjusts reward functions based on specific inference strategies through a calibrated reinforcement learning approach. InfAlign is particularly effective with techniques like Best-of-N sampling (generating multiple responses and selecting the best) and Worst-of-N (commonly used for safety evaluations), ensuring that aligned models perform well in both controlled environments and real-world scenarios.

At the core of InfAlign is the Calibrated and Transformed Reinforcement Learning (CTRL) algorithm, which follows three steps: calibrating reward scores, transforming these scores according to the inference strategy, and solving a KL regularization optimization problem. By customizing reward transformations to specific scenarios, InfAlign aligns training objectives with inference needs. This approach not only enhances the success rate during inference but also maintains computational efficiency. Furthermore, InfAlign improves the robustness of the model, enabling it to effectively handle various decoding strategies and produce consistently high-quality outputs.

In experiments conducted using Anthropic's usefulness and harmlessness datasets, the effectiveness of InfAlign was validated. Compared to existing methods, InfAlign improved the inference success rate in Best-of-N sampling by 8%-12% and in Worst-of-N safety evaluations by 4%-9%. These improvements stem from its calibrated reward transformations, effectively addressing the miscalibration issues of reward models and ensuring consistent performance across different inference scenarios.

InfAlign represents a significant advancement in aligning generative language models. By integrating inference-aware strategies, InfAlign addresses the critical differences between training and deployment. Its solid theoretical foundation and empirical results highlight its potential for comprehensively improving AI system alignment.

Link: https://arxiv.org/abs/2412.19792

Key Points:
🌟 InfAlign is a new framework developed by Google DeepMind aimed at enhancing the performance of language models during the inference phase.
📈 This framework aligns training objectives with inference needs by adjusting reward functions for inference strategies through calibrated reinforcement learning methods.
✅ Experimental results indicate that InfAlign significantly improves the inference success rate of models across multiple tasks, demonstrating good adaptability and reliability.

Generative Language Models Google DeepMind InfAlign

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

AI Daily: Tencent Yuanbao Upgrades for One-Phrase Image and Video Search; WeChat Pay MCP Launches; Google Unveils Veo 3 Globally

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Each day, we present you with the latest content in the AI field, focusing on developers to help you understand technical trends and innovative AI product applications. Click to learn more about new AI products: https://top.aibase.com/1. Tencent Yuanbao upgrades again: one phrase search, images and videos appear instantly, making information retrieval more intuitive! The upgraded features of Tencent Yuanbao make information retrieval more intuitive and efficient. Users just need to ask a question in one phrase to get text and image results.

Jul 4, 2025

Google Launches New Veo 3 Video Generation Model Globally

Google announced the global launch of its latest video generation model, Veo3. This long-anticipated release has generated great excitement among users, as Veo3 is now available to Gemini users in over 159 countries, offering a new video creation experience. The key feature of the Veo3 video generation model is its ability to generate videos up to eight seconds long based on simple text prompts. According to Google, this technology is designed for creative users, especially those on social media who increasingly demand short-form content.

Jul 4, 2025

250

Uncovering the Secrets of Large Models! The 'Thinking Words' Behind Them Contain Astonishing Information

Recently, a research team from Renmin University, Shanghai Artificial Intelligence Laboratory, University College London, and Dalian University of Technology revealed an important finding in the reasoning process of large models: when the model is thinking, the 'thinking words' it uses actually reflect a significant increase in its internal information. This research result provides a new perspective for better understanding the reasoning mechanisms of artificial intelligence through methods of information theory. You may have seen large models output some language that seems human-like when answering questions, such as "Hmm..." or "Let me think...".

Jul 4, 2025

DeepMind introduces Crome: Enhancing the Alignment of Large Language Models with Human Feedback

In the field of artificial intelligence, reward models are a critical component for aligning large language models (LLMs) with human feedback, but existing models face the issue of "reward hacking." These models often focus on superficial features, such as the length or format of responses, rather than identifying genuine quality metrics, such as factual accuracy and relevance. The root cause lies in standard training objectives failing to distinguish between spurious associations and true causal drivers present in the training data. This failure leads to fragile reward models (RMs), which generate misaligned policies.

Jul 4, 2025

220

Google Veo 3 Video Generation Model Now Available to Pro/Ultra Subscribers, Will Add Photo-to-Video Function

Jul 4, 2025

250

China's Medical Large Model Release Volume Accounts for 70% of the Global Total! KPMG Reveals Future Market Potential

According to KPMG China's recent report, "The First 50 Health Tech Companies," China accounts for more than 70% of the global release volume of medical large models. This data not only demonstrates China's rapid development in the field of intelligent healthcare, but also reflects the wide application of large language models in the healthcare industry. The report points out that about 65% of the currently released medical large models are large language models. These models can process and generate natural language, playing a significant supporting role in the analysis of medical data, patient communication, and scientific research.

Jul 4, 2025

100

Chip Design Company Ambiq Micro Applies for U.S. IPO, Benefiting from Market Demand Driven by Generative AI

Jul 4, 2025

190

New Developments in OpenAI Copyright Lawsuit: The New York Times Will Have Access to Deleted User Data

In the long-standing copyright infringement lawsuit filed by The New York Times against OpenAI, the case has made significant progress. According to Ars Technica, the federal judge presiding over the case has authorized The New York Times and its co-plaintiffs, The New York Daily News and the Investigative Reporting Center, to access OpenAI's user logs, including deleted content, to accurately determine the scope of the infringement. The New York Times believes that ChatGPT users may delete their history after bypassing the paywall, and therefore it is necessary to conduct large-scale data collection.

Jul 4, 2025

170

Shortcut Makes Its Debut! AI Excel Assistant Surpasses Human Champions by 10 Times, Task Automation Efficiency Soars

Recently, an AI Excel assistant called Shortcut has sparked heated discussions on social media. It enables users to effortlessly complete Excel tasks without writing complex formulas or VBA code through natural language processing (NLP) technology. The AIbase editorial team has compiled the latest information from social media to provide an in-depth analysis of Shortcut's powerful features and its potential impact on the fields of data processing and financial modeling. Shortcut: An Excel Revolution Driven by Natural Language

Jul 3, 2025

8.5k

KPMG Report: China Leads in Medical Large Models, Accounting for 70% of the Global Total

A recent report titled "Health Tech 50 - The First Edition" released by KPMG China reveals that China has taken a leading position in the field of medical large models globally. The report indicates that the number of medical large models launched in China accounts for more than 70% of the global total, far surpassing other countries and regions. In terms of model categories, large language models (LLMs) are the most numerous, accounting for nearly 65%. Moreover, the report also highlights the strong growth momentum of the intelligent medical devices market in China. It is expected that by 2025, the scale of the intelligent medical devices market in China will reach 24.23 billion yuan, and it will continue to grow.

Jul 3, 2025

200

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Google DeepMind Launches New Framework InfAlign: Enhancing Alignment Capability of Language Model Inference

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: Tencent Yuanbao Upgrades for One-Phrase Image and Video Search; WeChat Pay MCP Launches; Google Unveils Veo 3 Globally

Google Launches New Veo 3 Video Generation Model Globally

Uncovering the Secrets of Large Models! The 'Thinking Words' Behind Them Contain Astonishing Information

DeepMind introduces Crome: Enhancing the Alignment of Large Language Models with Human Feedback

Google Veo 3 Video Generation Model Now Available to Pro/Ultra Subscribers, Will Add Photo-to-Video Function

China's Medical Large Model Release Volume Accounts for 70% of the Global Total! KPMG Reveals Future Market Potential

Chip Design Company Ambiq Micro Applies for U.S. IPO, Benefiting from Market Demand Driven by Generative AI

New Developments in OpenAI Copyright Lawsuit: The New York Times Will Have Access to Deleted User Data

Shortcut Makes Its Debut! AI Excel Assistant Surpasses Human Champions by 10 Times, Task Automation Efficiency Soars

KPMG Report: China Leads in Medical Large Models, Accounting for 70% of the Global Total