FlexPrefill

Public

Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

large-language-models natural-language-processing research sparse-attention

Creat：2025-02-18T15:02:28

Update：2025-03-26T16:55:18

https://arxiv.org/abs/2502.20766

Stars

Stars Increase

Related projects

D2l Zh

book

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

68775

1个月前

+1today

Gpt_academic

academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

68280

1个月前

+14today

LLaMA Factory

agent

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

47621

1个月前

+4today

Made With ML

data-engineering

Learn how to design, develop, deploy and iterate on production-grade ML applications.

38450

9个月前

+1today

Flowise

artificial-intelligence

Drag & drop UI to build your customized LLM flow

37658

1个月前

Google Research

35413

1个月前

HanLP

dependency-parser

中文分词词性标注命名实体识别依存句法分析成分句法分析语义依存分析语义角色标注指代消解风格转换语义相似度新词发现关键词短语提取自动摘要文本分类聚类拼音简繁转换自然语言处理

34944

1个月前

+7today

Khoj

agent

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

29233

2个月前

+16today

NLP Progress

dialogue

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

22841

1个月前

Gpt Researcher

agent

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

21105

1个月前

+2today

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

FlexPrefill

Related projects

D2l Zh

Gpt_academic

LLaMA Factory

Made With ML

Flowise

Google Research

HanLP

Khoj

NLP Progress

Gpt Researcher