TokenFormer
Public[ICLR2025 Spotlight?] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
[ICLR2025 Spotlight?] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters