Peking University Joins Forces to Revolutionize Image Retrieval: Accurately Match Sketches, Artworks, and Low-Resolution Images!

AIbase基地

Published inAI News · 5 min read · Aug 22, 2024

235

In the digital era, we interact with vast amounts of images daily. But have you ever imagined how magical it would be if we could quickly find the images we want just by using a sketch, an artwork, or even a blurry photo? Researchers from Peking University's Yuan Li课题组, along with colleagues from Nanyang Technological University and Tsinghua University's Institute of Automation, have brought us such a surprise—a groundbreaking image retrieval technology that can handle diverse query styles, whether sketches, artworks, or low-resolution images, with precise matching.

The core of this technology is their proposed "Universal Style Retrieval" method. Unlike traditional text-based image retrieval, this new method can process various query styles, including combined queries such as sketches with text, artworks with text, etc. This not only enhances the flexibility of retrieval but also greatly improves the accuracy.

To achieve this goal, the research team constructed two unique datasets: DSR (Diverse-Style Retrieval Dataset) and ImageNet-X. DSR includes 10,000 natural images and corresponding texts for four retrieval styles, while ImageNet-X contains 1 million natural images with various style annotations. The establishment of these two datasets provides rich training and testing resources for the new method.

Even more exciting, the research team proposed a framework named FreestyleRet. This framework effectively solves the problem of existing models being unable to accommodate different types of retrieval vectors by extracting image styles and injecting them into the retrieval model. FreestyleRet consists of three main modules: the style extraction module, the style space construction module, and the style-inspired prompt tuning module. These modules work together to enable the retrieval model to understand and process various style query vectors.

In experiments, the FreestyleRet framework demonstrated outstanding performance. It not only achieved significant improvements in Recall@1 and Recall@5 on the DSR and ImageNet-X datasets but also showed good generalization and scalability in handling various style query vectors.

The results of this research have been published and can be found in detail on arXiv. Additionally, the related code and datasets have been open-sourced for interested researchers and developers to further explore and apply.

This is not just a technological leap in the field of image retrieval but also a significant convenience in our daily lives. Imagine, in the future, whether seeking inspiration, conducting academic research, or daily entertainment, we will be able to find the necessary image resources more quickly and accurately. This is the power of technology, making everything possible.

Paper link: https://arxiv.org/pdf/2312.02428

Image Retrieval Technology Universal Style Retrieval Peking University Nanyang Technological University

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Shengshu Technology Secures Several Billion Yuan in Funding, Driving New Trends in AI Commercialization through Video Generation

Recently, Shengshu Technology, a leading company in the field of multimodal AI, announced the successful completion of an A-round funding round worth several billion yuan. This round was led by Bohua Capital, with existing investors such as Baidu's strategic investment division and the Beijing Artificial Intelligence Industry Investment Fund continuing to participate, demonstrating strong market recognition of Shengshu Technology. The company plans to use the funds to further advance model R&D and technological innovation, explore the potential of multimodal large models, and accelerate product expansion and user services. Multimodal technology, especially in the field of video generation, is currently experiencing rapid development.

Sep 19, 2025

100

Musk's AI Company Faces Intensifying Power Struggle, Multiple Executives Leave Due to Discontent with Management Style

Recently, Elon Musk's AI company xAI has experienced a management crisis, with multiple executives leaving due to dissatisfaction with the company's management style and financial situation. Currently, the daily operations of xAI are handled by two close advisors of Musk, Jared Bunch and John Hering, and all major decisions still require Musk's approval. Image source note: The image was generated by AI, provided by the licensing service Midjourney. A source revealed that some executives of xAI expressed dissatisfaction with Bunch and Hering in internal meetings.

Sep 19, 2025

120

Meta Launches Horizon Hyperscape Capture Tool, Quest3 Users Can Create Photo-Realistic VR Scenes

Meta Corporation officially launched its VR scanning tool Meta Horizon Hyperscape Capture (Beta) today, allowing users of the Quest3 headset to scan the real world and recreate these scenes in virtual reality with photo-realistic quality. The release of this tool fulfills Meta's commitment made at the Connect 2024 conference. From Demo to Reality: Application of Gaussian Splatting Technology for Users. Last year at the Connect 2024 conference, Meta had previously introduced...

Sep 19, 2025

Shengshu Technology Completes a New Round of A-Round Funding Worth Billions of Yuan

On September 19, 2025, Shengshu Technology announced that it has completed a new round of A-round funding worth billions of yuan. This round of financing was led by Bohua Capital, with continued participation from existing shareholders such as Baidu Strategic Investment, Beijing Artificial Intelligence Industry Investment Fund, Qiming Venture Partners, Datayi Capital, and BV Baidu Ventures. Additionally, industrial partners such as Jianfa Emerging Investment also increased their investment. Since its establishment in 2023, Shengshu Technology has been driven by a strong core team composed of technical talents from world-renowned universities such as Tsinghua University, Peking University, Imperial College London, and Carnegie Mellon University.

Sep 19, 2025

AI Daily: Keling AI Launches New Digital Human Features; Tencent Hunyuan New Technology Removes Oil from Large Models; Douyin Launches AI Truth-Seeking Function

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Fresh AI products click to learn more: https://app.aibase.com/zh1, Keling AI Launches New Digital Human Features: Generate a high-definition video in 1 minute from a single image. Keling AI's digital human feature has achieved a breakthrough from static images to dynamic videos. Users just need to provide a character image

Sep 18, 2025

110

He Gained 500,000 Followers on Douyin in One Month by Making AI Pet Stand-up Comedy

Monetization Idea: Use AI technology to pair audio from human stand-up comedy or podcasts with videos or pictures of pets (cats, dogs, etc.), creating interesting short videos where the pet appears to be speaking. Monetize through traffic and fans. Suitable for: People who love pets, are interested in short video content creation, know basic video editing software operations, and want to attract fans with fun content. Difficulty Level: Moderate. Operation Process: Step 1: Determine Content and Materials Find an audio clip that you think is interesting, such as human stand-up comedy, podcast, or dialogue.

Sep 18, 2025

Kling AI Launches New Digital Human Feature: Generate a 1-Minute HD Video from a Single Image

Chinese AI platform Keling AI launched a new digital human feature, enabling static images to dynamic videos. Users can generate 1-minute 1080p videos with 48FPS by uploading a photo with text/audio. It excels in lip sync and emotional expression via multimodal AI.....

Sep 18, 2025

140

Tencent Hunyuan New Technology Makes Large Models 'Less Oily' to Make AI-Generated Images More Realistic!

Recently, the Tencent Hunyuan team released their latest research findings on their official WeChat account —— SRPO (Semantic Relative Preference Optimization), aimed at improving the realism of AI-generated images, especially addressing the 'oily' issue in the skin texture of the open-source text-to-image model Flux. This innovative technology is expected to bring about revolutionary changes in the image generation field. In today's era where digital art is becoming increasingly popular, the quality of AI-generated images has become particularly important. The Flux model, as a popular foundation model in the open-source text-to-image community, is often criticized for its

Sep 18, 2025

110

Has Hollywood in Your Pocket Caused Legal Trouble? Disney, Universal Studios and Others Sue MiniMax for Copyright Infringement

Disney, Universal, and Warner Bros sued Chinese AI firm MiniMax for allegedly using stolen intellectual property in its 'Hailuo AI' service, featuring copyrighted characters like Darth Vader and Wonder Woman.....

Sep 17, 2025

Freelance Service Market Fiverr Cuts 30% Workforce to Become an AI-First Company

Fiverr announced a 30% workforce reduction (250 employees) to embrace AI-driven 'startup mode.' CEO Kaufman emphasized painful restructuring for a leaner, flatter org, citing AI's potential to automate tasks and unlock new capabilities.....

Sep 17, 2025

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Peking University Joins Forces to Revolutionize Image Retrieval: Accurately Match Sketches, Artworks, and Low-Resolution Images!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Shengshu Technology Secures Several Billion Yuan in Funding, Driving New Trends in AI Commercialization through Video Generation

Musk's AI Company Faces Intensifying Power Struggle, Multiple Executives Leave Due to Discontent with Management Style

Meta Launches Horizon Hyperscape Capture Tool, Quest3 Users Can Create Photo-Realistic VR Scenes

Shengshu Technology Completes a New Round of A-Round Funding Worth Billions of Yuan

AI Daily: Keling AI Launches New Digital Human Features; Tencent Hunyuan New Technology Removes Oil from Large Models; Douyin Launches AI Truth-Seeking Function

He Gained 500,000 Followers on Douyin in One Month by Making AI Pet Stand-up Comedy

Kling AI Launches New Digital Human Feature: Generate a 1-Minute HD Video from a Single Image

Tencent Hunyuan New Technology Makes Large Models 'Less Oily' to Make AI-Generated Images More Realistic!

Has Hollywood in Your Pocket Caused Legal Trouble? Disney, Universal Studios and Others Sue MiniMax for Copyright Infringement

Freelance Service Market Fiverr Cuts 30% Workforce to Become an AI-First Company

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Peking University Joins Forces to Revolutionize Image Retrieval: Accurately Match Sketches, Artworks, and Low-Resolution Images!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Shengshu Technology Secures Several Billion Yuan in Funding, Driving New Trends in AI Commercialization through Video Generation

Musk's AI Company Faces Intensifying Power Struggle, Multiple Executives Leave Due to Discontent with Management Style

Meta Launches Horizon Hyperscape Capture Tool, Quest3 Users Can Create Photo-Realistic VR Scenes

Shengshu Technology Completes a New Round of A-Round Funding Worth Billions of Yuan

AI Daily: Keling AI Launches New Digital Human Features; Tencent Hunyuan New Technology Removes Oil from Large Models; Douyin Launches AI Truth-Seeking Function

He Gained 500,000 Followers on Douyin in One Month by Making AI Pet Stand-up Comedy

Kling AI Launches New Digital Human Feature: Generate a 1-Minute HD Video from a Single Image

Tencent Hunyuan New Technology Makes Large Models 'Less Oily' to Make AI-Generated Images More Realistic!

Has Hollywood in Your Pocket Caused Legal Trouble? Disney, Universal Studios and Others Sue MiniMax for Copyright Infringement

Freelance Service Market Fiverr Cuts 30% Workforce to Become an AI-First Company

GEO Services