x-transformers
PublicA concise but complete full-attention transformer with a set of promising experimental features from various papers
A concise but complete full-attention transformer with a set of promising experimental features from various papers