AIbase
Product LibraryTool Navigation

MoH: Multi-Head Attention as Mixture-of-Head Attention

Creat2024-10-08T15:52:37
Update2025-03-26T22:51:42
https://arxiv.org/abs/2410.11842
233
Stars
0
Stars Increase

Related projects