Shanghai AI Lab Launches New Model 'Fingerprint Recognition' Method REEF to Combat 'Shelling' Behavior

AIbase基地

Published inAI News · 7 min read · Dec 13, 2024

203

In the era of AI, large language models (LLMs) are like martial arts manuals, requiring enormous computational power and data for training, akin to a martial arts master who has spent years in seclusion honing their skills. The release of open-source models is like these masters publicly sharing their manuals, but it comes with certain licenses (such as Apache 2.0 and the LLaMA2 community license) to protect their intellectual property (IP).

However, the world is fraught with dangers, and there are always incidents of "shell models." Some developers claim to have trained new LLMs, but in reality, they are merely repackaging or fine-tuning existing foundational models (like Llama-2 and MiniCPM-V). This is similar to someone secretly learning another's martial arts skills and then claiming them as their own. To prevent such occurrences, model owners and third parties urgently need a method to identify "shell" models.

Current model fingerprinting methods mainly fall into two categories:

Injected Fingerprints: This is like secretly marking a manual, such as through watermarking methods. This approach artificially adds "triggers" during the model training or fine-tuning process, causing the model to generate specific content under certain conditions, thus identifying its source. However, this method increases training costs, affects model performance, and may even be removed. Moreover, it cannot be applied to models that have already been released.

Intrinsic Fingerprints: This is like determining the source from the content and style of the manual itself. This method utilizes the inherent properties of the model for identification, including model weights and feature representations. Among them, weight-based fingerprinting methods identify models by calculating the similarity of model weights. However, this approach is easily affected by weight changes, such as weight arrangement, pruning, and fine-tuning. On the other hand, semantic analysis methods identify models through statistical analysis of the text generated by the model. However, both methods suffer from issues of insufficient robustness.

So, is there a method that can effectively identify "shell" models without impacting model performance and resist various "fancy" modifications?

Researchers from the Shanghai Artificial Intelligence Laboratory and other institutions have proposed a new model fingerprinting method—REEF.

The working principle of REEF is as follows:

REEF is a feature representation-based fingerprinting method. It does not rely on the representation of any specific layer, but instead leverages the powerful representation modeling capability of LLMs to extract features for identification from various layers.

It compares the center kernel alignment (CKA) similarity of feature representations between two models on the same samples. CKA is a similarity metric based on the Hilbert-Schmidt independence criterion (HSIC), which measures the independence between two sets of random variables.

If the similarity is high, it indicates that the suspect model is likely derived from the victim model; conversely, it is less likely.

What are the advantages of REEF?

No training required: This means it does not affect the performance of the model and does not incur additional training costs.

Strong robustness: It is robust against various subsequent developments such as model pruning, fine-tuning, merging, rearrangement, and scaling transformations. Even if the suspect model undergoes extensive fine-tuning (with data amounts up to 700B tokens), REEF can still effectively identify whether it originated from the victim model.

Theoretical guarantees: Researchers have theoretically proven that CKA is invariant to weight arrangement and scaling transformations.

Experimental results show that REEF performs exceptionally well in identifying "shell" models, outperforming existing weight-based and semantic analysis methods.

The emergence of REEF provides a new tool for protecting the intellectual property of LLMs and helps combat unethical or illegal activities such as unauthorized use or replication of models.

Paper link: https://arxiv.org/pdf/2410.14273

Uncovering the Secrets of Large Models! The 'Thinking Words' Behind Them Contain Astonishing Information

Recently, a research team from Renmin University, Shanghai Artificial Intelligence Laboratory, University College London, and Dalian University of Technology revealed an important finding in the reasoning process of large models: when the model is thinking, the 'thinking words' it uses actually reflect a significant increase in its internal information. This research result provides a new perspective for better understanding the reasoning mechanisms of artificial intelligence through methods of information theory. You may have seen large models output some language that seems human-like when answering questions, such as "Hmm..." or "Let me think...".

Open Source Revolution! Kyutai TTS Launches: Ultra-Low Latency Speech Synthesis, the New Era of AI Voice is Here!

Recently, the French AI laboratory Kyutai announced the official open source of its new text-to-speech model, Kyutai TTS, providing global developers and researchers with a high-performance, low-latency speech synthesis solution. This breakthrough release not only promotes the development of open-source AI technology but also opens up new possibilities for multilingual voice interaction applications. AIbase provides an exclusive analysis of this technological highlight and its potential impact. Ultra-low latency, a new experience in real-time interaction. Kyutai TTS has become an industry standout with its exceptional performance.

DeepMind introduces Crome: Enhancing the Alignment of Large Language Models with Human Feedback

In the field of artificial intelligence, reward models are a critical component for aligning large language models (LLMs) with human feedback, but existing models face the issue of "reward hacking." These models often focus on superficial features, such as the length or format of responses, rather than identifying genuine quality metrics, such as factual accuracy and relevance. The root cause lies in standard training objectives failing to distinguish between spurious associations and true causal drivers present in the training data. This failure leads to fragile reward models (RMs), which generate misaligned policies.

Kunlun Xiwang Once Again Open-Sources the Reward Model Skywork-Reward-V2

On July 4, 2025, Kunlun Xiwang continued to open-source the second-generation reward model Skywork-Reward-V2 series. This series includes 8 reward models based on different foundation models, with parameter sizes ranging from 600 million to 8 billion. Upon its release, it won all seven major reward model evaluation rankings, becoming a focus in the open-source reward model field. Reward models play a key role in the reinforcement learning from human feedback (RLHF) process. To build the next generation of reward models, Kunlun Xiwang has constructed a dataset containing 40 million

China's Medical Large Model Release Volume Accounts for 70% of the Global Total! KPMG Reveals Future Market Potential

According to KPMG China's recent report, "The First 50 Health Tech Companies," China accounts for more than 70% of the global release volume of medical large models. This data not only demonstrates China's rapid development in the field of intelligent healthcare, but also reflects the wide application of large language models in the healthcare industry. The report points out that about 65% of the currently released medical large models are large language models. These models can process and generate natural language, playing a significant supporting role in the analysis of medical data, patient communication, and scientific research.

New Developments in OpenAI Copyright Lawsuit: The New York Times Will Have Access to Deleted User Data

In the long-standing copyright infringement lawsuit filed by The New York Times against OpenAI, the case has made significant progress. According to Ars Technica, the federal judge presiding over the case has authorized The New York Times and its co-plaintiffs, The New York Daily News and the Investigative Reporting Center, to access OpenAI's user logs, including deleted content, to accurately determine the scope of the infringement. The New York Times believes that ChatGPT users may delete their history after bypassing the paywall, and therefore it is necessary to conduct large-scale data collection.

Xiaopeng G7 Ultra Makes a Grand Debut! Revolutionary Intelligent Driving Large Model Unveiled

In the new energy vehicle market, Xiaopeng Automotive has once again drawn attention. On July 3rd, the Xiaopeng G7 Ultra was officially launched, becoming the first intelligent vehicle equipped with the local-end "VLA+VLM" large model. This innovative technology marks an important step forward for Xiaopeng in the field of intelligent driving. The Xiaopeng G7 Ultra is equipped with the VLA (active thinking and rapid decision-making capability) large model, making the driving experience more intelligent. In daily driving, the G7 Ultra can flexibly handle various complex driving scenarios, such as in traffic.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Shanghai AI Lab Launches New Model 'Fingerprint Recognition' Method REEF to Combat 'Shelling' Behavior

AIbase基地

This article is from AIbase Daily

AI News Recommendations

ByteDance Open Sources Trae-Agent to Enhance the Intelligent Development Experience