MiniGPT-and-DeepSeek-MLA-Multi-Head-Latent-Attention

Public

An efficient and scalable attention module designed to reduce memory usage and improve inference speed in large language models. Designed and implemented the Multi-Head Latent Attention (MLA) module as a drop-in replacement for traditional multi-head attention (MHA) in large language models.

attention-mechanism deepseek llm mla multi-head-attention pytorch

Creat：2025-04-08T23:10:50

Update：2025-04-09T06:41:35

Stars

Stars Increase

Related projects

Ollama

Hot

deepseek

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.

137701

3周前

+119today

Dify

Hot

agent

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

92585

3周前

+229today

Lobe Chat

? Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application.

58971

1个月前

+45today

Browser Use

Hot

ai-agents

Make websites accessible for AI agents

56550

1个月前

+207today

MetaGPT

agent

? The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

54713

3周前

+35today

LLaMA Factory

Hot

agent

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

47082

3周前

+79today

Vllm

Hot

amd

A high-throughput and memory-efficient inference and serving engine for LLMs

45180

11个月前

+107today

LLMs From Scratch

Hot

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

44505

7个月前

+101today

JeecgBoot

activiti

?「AI 低代码平台」前后端分离 SpringBoot 2.x/3.x，SpringCloud，Ant Design&Vue3，Mybatis，Shiro！强大的代码生成器让前后端代码一键生成，无需写任何代码! 引领AI低代码开发模式 AI生成->OnlineCoding->代码生成->手工MERGE，帮助Java项目解决80%重复工作，让开发更关注业务，提高开发效率、节省成本，同时又不失灵活性

42313

3周前

+8today

Llama_index

agents

LlamaIndex is the leading framework for building LLM-powered agents over your data.

41011

3周前

+27today

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

MiniGPT-and-DeepSeek-MLA-Multi-Head-Latent-Attention

Related projects

Ollama

Dify

Lobe Chat

Browser Use

MetaGPT

LLaMA Factory

Vllm

LLMs From Scratch

JeecgBoot

Llama_index