clip-synthetic-captions

Public

Tiny-scale experiment showing that CLIP models trained using detailed captions generated by multimodal models (CogVLM and LLaVA 1.5) outperform models trained using the original alt-texts on a range of classification and retrieval tasks.

clip cogvlm llava multimodal synthetic-data vision-language-model

Creat：2024-03-05T19:57:49

Update：2024-03-31T02:25:46

Stars

Stars Increase

Related projects

Ollama

Hot

deepseek

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.

136140

1周前

+129today

LLaVA

chatbot

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

22095

1个月前

+6today

GenSim

clip

Generating Robotic Simulation Tasks via Large Language Models

15940

1周前

-1today

Sglang

cuda

SGLang is a fast serving framework for large language models and vision language models.

12872

1周前

+12today

SUPIR

deep-learning

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

4973

1周前

-1today

Xtuner

agent

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

4452

1周前

+1today

Zero_nlp

bert

中文nlp解决方案(大模型、数据、模型、训练、推理)

3348

1周前

+2today

LLamaSharp

chatbot

A C#/.NET library to run LLM (?LLaMA/LLaVA) on your local device efficiently.

3093

1周前

+1today

Clip Interrogator

clip

Image to prompt with BLIP and CLIP

2798

2周前

VLM_survey

clip

Collection of AWESOME vision-language models for vision tasks

2642

1周前

+2today

AI News

AI Daily

AI Timeline

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

clip-synthetic-captions

Related projects

Ollama

LLaVA

GenSim

Sglang

SUPIR

Xtuner

Zero_nlp

LLamaSharp

Clip Interrogator

VLM_survey