AIbase
Product LibraryTool Navigation

visual-search-machine

Public

A custom Vision-Language Model (VLM) built from scratch, using SigLip for contrastive learning and a ViT-based encoder to generate meaningful image captions and semantic descriptions.

Creat2025-03-21T19:38:42
Update2025-04-06T02:29:36
0
Stars
0
Stars Increase