chipmunk
PublicAccelerate Multimodal AI | 3.7x faster E2E video gen and 1.6x faster E2E image gen | Custom sparse CUDA kernels 9.3x faster than FlashAttention-3 and 2.5x faster than cuBLAS
Accelerate Multimodal AI | 3.7x faster E2E video gen and 1.6x faster E2E image gen | Custom sparse CUDA kernels 9.3x faster than FlashAttention-3 and 2.5x faster than cuBLAS