violet
PublicViolet: A Vision-Language model for generating Arabic image captions using a Gemini Decoder and pretrained transformer.
Violet: A Vision-Language model for generating Arabic image captions using a Gemini Decoder and pretrained transformer.