AIbase
Product LibraryTool Navigation

Multi-Modal-Transformer

Public

The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised learning models. Additionally, it also collects many useful tutorials and tools in these related domains.

Creat2021-04-07T14:19:31
Update2025-02-10T06:17:31
225
Stars
-1
Stars Increase