SparseAttention
PublicPytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"
Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"