vit-gpt2-image-captioning

Public

Fine-tuning an encoder-decoder transformer (ViT-Base-Patch16-224-In21k and DistilGPT2) for image captioning on the COCO dataset

bert coco-dataset distilbert encoder-decoder gpt-2 image-captioning imagenet pre-trained-language-models pytorch torch

Creat：2023-05-11T04:02:32

Update：2024-11-29T17:04:38

Stars

Stars Increase

Related projects

Leedl Tutorial

bert

《李宏毅深度学习教程》（李宏毅老师推荐?，苹果书?），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

14957

3周前

+6today

Nlp Tutorial

attention

Natural Language Processing Tutorial for Deep Learning Researchers

14543

8个月前

+2today

Clip As Service

bert

? Scalable embedding, reasoning, ranking for images and sentences with CLIP

12637

3周前

Transformers Tutorials

bert

This repository contains demos I made with the Transformers library by HuggingFace.

10715

1个月前

+14today

Chinese BERT Wwm

bert

Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）

9923

3周前

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

vit-gpt2-image-captioning

Related projects

Leedl Tutorial

Nlp Tutorial

Clip As Service

Transformers Tutorials

Chinese BERT Wwm

Nlp_chinese_corpus

Bertviz

BERTopic

BERT Pytorch

FasterTransformer