AIbase
Product LibraryTool Navigation

Random-MoE-as-Dropout

Public

[ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal, Shiwei Liu, Zhangyang Wang

Creat2023-02-19T06:33:11
Update2025-01-07T03:42:27
48
Stars
0
Stars Increase