R1-VL

StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.

3709

1个月前

+9today

Awesome LLM Reasoning

awesome

Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 ?

3011

1个月前

InternLM XComposer

chatgpt

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

2817

1个月前

RPG DiffusionMaster

image-editting

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

1797

1个月前

+1today

Sa2VA

computer-vision

? Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

1066

1个月前

+3today

Awesome System2 Reasoning LLM

benchmark

Latest Advances on System-2 Reasoning

960

1个月前

+4today

Ovis

chatbot

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

897

1个月前

VideoChat

asr

实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，无须训练，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.

897

1个月前

+3today

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

R1-VL

Related projects

Unilm

MobileAgent

Star Vector

Awesome LLM Reasoning

InternLM XComposer

RPG DiffusionMaster

Sa2VA

Awesome System2 Reasoning LLM

Ovis

VideoChat