AI News

AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

CogVLM

A powerful open-source visual language model

CommonProductImagevisual language modelimage description

CogVLM is a powerful open-source visual language model. CogVLM-17B has 10 billion visual parameters and 7 billion language parameters. CogVLM-17B achieves state-of-the-art performance on 10 classic cross-modal benchmark datasets, including NoCaps, Flicker30k Captions, RefCOCO, RefCOCO+, RefCOCOg, Visual7W, GQA, ScienceQA, VizWiz VQA, and TDIUC, and ranks second or matches PaLI-X 55B on VQAv2, OKVQA, TextVQA, and COCO Captions. CogVLM can also engage in conversations with you about images.

CogVLM

CogVLM Visit Over Time

Monthly Visits

521149929

Bounce Rate

35.96%

Page per Visit

6.1

Visit Duration

00:06:29

CogVLM Visit Trend

CogVLM Visit Geography

CogVLM Traffic Sources

CogVLM Alternatives

CogVLM — A powerful open-source visual language model

•visual language model•image description

Describe Anything — A deep learning-based image and video description model.

•Image Description•Video Processing

Aya Vision 8B — An 800-million parameter multilingual vision-language model supporting OCR, image captioning, visual reasoning, and more.

•Multilingual•Vision-Language Model

M2RAG — A benchmark codebase for retrieval-augmented generation in multimodal contexts.

•Multimodal•Retrieval-Augmented Generation

R1-V

R1-V — Enhances the generalization capabilities of visual language models at a low cost of less than $3.

•Reinforcement Learning•Visual Language Model

MILS — LLMs can see and hear without any training.

•Artificial Intelligence•Multi-modal

SmolVLM-500M-Instruct — SmolVLM-500M is a lightweight multimodal model capable of processing image and text inputs to generate text outputs.

•Multimodal•Image Description

Moondream AI — An open-source visual language model that operates on multiple devices.

•Artificial Intelligence•Open-source

PaliGemma2-3b-pt-224 — PaliGemma 2 is a powerful vision-language model that supports a wide range of image and text processing tasks in multiple languages.

•Vision-Language Model•Multilingual Support

Smart Image Description Generator

Smart Image Description Generator — Utilize smart technology to generate contextually relevant descriptions for images.

•SEO•Image Description

cogagent-9b-20241220

cogagent-9b-20241220 — CogAgent-9B-20241220 is a GUI agent model based on visual language models.

•visual language model•GUI agent

CogAgent — An open-source end-to-end visual language model (VLM) based GUI agent

•Visual Language Model•GUI Agent

PicWordify — Automated generation of descriptive text for website images

•SEO•Accessibility

DeepSeek-VL2-Tiny

DeepSeek-VL2-Tiny — Advanced Large-scale Mixture of Experts Visual Language Model

•Visual Language Model•Mixture of Experts

POINTS-Yi-1.5-9B-Chat — Latest advancements in visual language models, integrating new technologies from WeChat AI.

•Visual Language Model•WeChat AI

POINTS-1-5-Qwen-2-5-7B-Chat — A leading visual language model that supports bilingual communication and high-quality control, available for free.

•Visual Language Model•Bilingual Support

OpenGVLab InternVL

OpenGVLab InternVL — An AI visual language model providing image analysis and description services.

•Image Recognition•Deep Learning

Qwen2-VL-7B — Qwen2-VL-7B is the latest visual language model that supports multimodal understanding and text generation.

•Visual Language Model•Multimodal

Qwen2-VL-2B — A state-of-the-art visual language model that supports multimodal understanding and text generation.

•Visual Language Model•Multimodal

PaliGemma 2

PaliGemma 2 — PaliGemma 2 is a powerful visual language model that is easy to fine-tune.

•Visual Language Model•Machine Learning

SmolVLM — An efficient open-source visual language model

•Visual Language Model•Multi-Modal AI

LLaVA-o1 — A visual language model capable of step-by-step reasoning.

•Visual Language Model•Step-by-Step Reasoning

Aquila-VL-2B-llava-qwen — A visual-language model that intelligently processes both image and text information.

•Visual Language Model•Multimodal

joy-caption-batch — A tool for batch generating descriptive titles for image files

•Batch Processing•Image Description

VisRAG — A retrieval-augmented generation model based on visual language modeling.

•Visual Language Model•Retrieval-Augmented Generation

AI Describe Pictures — AI technology quickly generates image descriptions.

•Image Description•Content Creation

Qwen2-VL — A next-generation visual language model that offers a clearer view of the world.

•Visual Language Model•Multilingual Support

DescribePic — An intelligent image description generator, allowing 50 free uses per day.

ChineseSelection

•Image Description•Content Creation

image-textualization — Automatically generates rich and detailed image descriptions

•Image description•Deep learning

InternLM-XComposer-2.5 — A Multifunctional Large Visual Language Model

•Visual Language Model•Long Context Processing