ByteDance Launches PaSa: An Intelligent Academic Paper Search Agent Based on Large Language Models

AIbase基地

Published inAI News · 6 min read · Jan 25, 2025

869

In the field of academic research, literature retrieval is a complex and crucial information-gathering task. Researchers need to be able to handle sophisticated search capabilities within specialized knowledge domains to meet detailed research demands. However, existing academic search platforms, such as Google Scholar, often struggle to address these complex research queries. For instance, a specialized query on non-stationary reinforcement learning using the UCB method requires stronger computational and analytical abilities. Additionally, researchers typically spend a significant amount of time and effort manually browsing vast academic databases when conducting literature reviews.

Although several studies have explored the application of large language models (LLMs) in academic paper retrieval and scientific discovery, traditional search tools still struggle to meet the complex needs of specialized research. Many studies focus on developing LLM agents through optimization frameworks and prompt engineering techniques. While methods like the AGILE RL framework have significantly enhanced the comprehensive capabilities of these agents, a fully autonomous and precise academic paper retrieval solution has yet to be found, leaving a substantial gap in research.

Recently, researchers from ByteDance Research Institute and Peking University jointly proposed PaSa, an innovative LLM-based paper search agent. PaSa can autonomously execute complex search strategies, including tool invocation, paper reading, and reference selection, aiming to generate comprehensive and accurate results for complex academic queries. To optimize PaSa's performance, the research team created AutoScholarQuery, a synthetic dataset containing 35,000 fine-grained academic queries, and established RealScholarQuery as a benchmark to evaluate the agent's actual performance. The system utilizes reinforcement learning techniques to enhance search capabilities, addressing the major limitations of existing academic search methods.

The PaSa system consists of two LLM agents: the Crawler and the Selector, which work together to perform comprehensive academic paper searches. The Crawler first analyzes the user's query to generate multiple refined search queries to retrieve relevant papers and adds these papers to a dedicated paper queue. The Crawler processes each queued paper, identifies and explores key citations that may expand the research scope, and dynamically adds newly discovered relevant papers to the list. Then, the Selector evaluates whether each paper meets the original query requirements.

Experimental results show that PaSa-7b outperforms existing search methods across multiple benchmarks. On the AutoScholarQuery test set, PaSa-7b achieved a 9.64% improvement in recall compared to PaSa-GPT-4o. When facing Google-based benchmarks, the recall improvement for PaSa-7b ranged from 33.80% to 42.64%. In the more challenging RealScholarQuery scenarios, PaSa-7b demonstrated a 30.36% increase in recall and a 4.25% increase in precision.

Overall, the launch of PaSa marks a significant advancement in academic paper search technology, providing an effective solution for information retrieval in academic research. By combining large language models and reinforcement learning techniques, PaSa greatly reduces the time and effort researchers spend on literature reviews while also providing them with an efficient tool to navigate the increasingly vast and complex academic literature landscape.

Code: https://github.com/bytedance/pasa

Paper: https://arxiv.org/abs/2501.10120

Key Points:

📄 **PaSa is an intelligent academic paper search agent jointly launched by ByteDance and researchers from Peking University.**

🤖 **The system consists of two LLM agents, the Crawler and the Selector, capable of autonomously executing complex search strategies.**

🏆 **Experimental results indicate that PaSa-7b outperforms existing search methods across multiple benchmarks, significantly enhancing the efficiency and accuracy of paper retrieval.**

JD.com's Embodied Intelligence Strategy Accelerates Rapidly, JoyInside Collaboration Map Exposed

According to NetEase Technology, JD.com's layout in the field of embodied intelligence is accelerating rapidly. The embodied intelligence brand JoyInside under JD.com has reached cooperation with more than ten leading robot companies, becoming the core engine for JD.com to seize the smart robot market. According to insiders, JoyInside is supported by JD's large model technology, focusing on providing smart interaction capabilities between robots and consumers. Its product strategy focuses on scenario-based applications such as one person, one dog, and one toy. Since its launch, the brand has successfully attracted leading enterprises from multiple niche fields to join.

Foxconn Launches Its First AI Inference Large Model FoxBrain, Trademark Application Submitted

Recently, Hon Hai Precision Industrial Co., Ltd. (commonly known as Foxconn) submitted a trademark registration application for "FoxBrain" to the Trademark Office of the National Intellectual Property Administration. This AI inference large model is not only Foxconn's first attempt but also the first AI model of this type in Taiwan. According to public information, the international classification of this trademark is scientific instruments, and it is currently in the "waiting for substantive examination" status. "FoxBrain" is an AI inference large model launched by the Hon Hai Research Institute, covering data analysis

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and learn about innovative AI product applications. Click to learn more about new AI products: https://top.aibase.com/1、Open Source End-to-End Speech Large Model Step-Audio-AQAA: Understand audio and directly generate natural speech. Step-Audio-AQAA is an open source end-to-end speech large model,

Gemini Live Makes a Major Upgrade! Seamless Integration with Google Apps, Smart Life Within Reach

With the rapid development of artificial intelligence technology, Google's AI assistant Gemini Live has undergone a major upgrade. According to the latest information obtained by AIbase, Gemini Live is about to achieve deep integration with multiple Google apps, providing users with a more intelligent and efficient interaction experience. This feature not only enhances productivity but will also completely change the way users interact with the Google ecosystem. Seamless connection with Google apps, smarter operations are now more convenient. Latest news shows

Google Data Center Power Consumption Has Increased Sevenfold in Ten Years, Huge Investments Bet on a Carbon-Neutral Future

Google's latest sustainability report reveals a startling fact: within just four years, the company's data center power consumption more than doubled, rising from 14.4 million megawatt-hours in 2020 to 30.8 million megawatt-hours in 2024. If the timeline is extended to ten years, compared to an estimated 4 million megawatt-hours in 2014, Google's data center power consumption has increased sevenfold. Growing electricity demand: data centers are major energy consumers, efficiency improvements face bottlenecks. Data shows that Google's power issues are almost entirely concentrated in data centers.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

ByteDance Launches PaSa: An Intelligent Academic Paper Search Agent Based on Large Language Models

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Major Breakthrough! Research Team Reveals the Hidden Reward Mechanism Inside Large Language Models

JD.com's Embodied Intelligence Strategy Accelerates Rapidly, JoyInside Collaboration Map Exposed

Foxconn Launches Its First AI Inference Large Model FoxBrain, Trademark Application Submitted

Zhipu AI Launches GLM-4.1V-Thinking Open Source! A New Leader in Multimodal Reasoning, Challenging Top Models Worldwide

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

Open Source End-to-End Speech Large Model Step-Audio-AQAA: Understand Audio and Generate Natural Speech Directly

Foxconn's Parent Company Registers a Trademark for an AI Inference Large Model

Gemini Live Will Be Fully Integrated into Google Apps, Making the AI Assistant Smarter!

Gemini Live Makes a Major Upgrade! Seamless Integration with Google Apps, Smart Life Within Reach

Google Data Center Power Consumption Has Increased Sevenfold in Ten Years, Huge Investments Bet on a Carbon-Neutral Future