AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

Light-R1-14B-DS

An open-source 14B-parameter mathematical model, trained using reinforcement learning, with excellent performance.

CommonProductProductivityReinforcement LearningMathematical Model

Visit

Light-R1-14B-DS is an open-source mathematical model developed by Qihoo 360 Technology Co., Ltd. Trained using reinforcement learning based on DeepSeek-R1-Distill-Qwen-14B, it achieved high scores of 74.0 and 60.2 on the AIME24 and AIME25 mathematics competition benchmarks, respectively, surpassing many 32B parameter models. It successfully implemented reinforcement learning on an already long-chain reasoning fine-tuned model under a lightweight budget, providing the open-source community with a powerful mathematical model tool. Its open-source nature promotes the application of natural language processing in education, particularly in mathematical problem-solving, offering researchers and developers valuable research foundations and practical tools.

Visit

Light-R1-14B-DS Visit Over Time

Monthly Visits

27175375

Bounce Rate

44.30%

Page per Visit

5.8

Visit Duration

00:04:57

Light-R1-14B-DS Visit Trend

Light-R1-14B-DS Visit Geography

Light-R1-14B-DS Traffic Sources

Light-R1-14B-DS Alternatives

Light-R1-14B-DS — An open-source 14B-parameter mathematical model, trained using reinforcement learning, with excellent performance.

Productivity

•Reinforcement Learning•Mathematical Model

612

Light-R1 — Light-R1 is an open-source project focusing on long-chain reasoning (Long COT), providing a training method from scratch through curriculum-style SFT, DPO, and RL.

Programming

•Artificial Intelligence•Long-Chain Reasoning

774

Steiner-32b-preview — Steiner is a reasoning model trained on synthetic data, designed to explore multiple reasoning paths and verify them autonomously.

Productivity

•Reasoning Model•Reinforcement Learning

630

SWE-RL — Enhancing the reasoning capabilities of large language models in open-source software evolution through reinforcement learning.

Programming

•Reinforcement Learning•Large Language Model

300

R1-V — Enhances the generalization capabilities of visual language models at a low cost of less than $3.

Programming

•Reinforcement Learning•Visual Language Model

624

Tülu 3 405B — Tülu 3 405B is a large-scale open-source language model enhanced through reinforcement learning.

Programming

•Artificial Intelligence•Natural Language Processing

1494

DeepSeek-R1-Distill-Qwen-1.5B — DeepSeek-R1-Distill-Qwen-1.5B is an efficient inference open-source language model suitable for various natural language processing tasks.

Programming

•Natural Language Processing•Reinforcement Learning

3906

DeepSeek-R1-Distill-Qwen-7B — DeepSeek-R1-Distill-Qwen-7B is an open-source reasoning model focusing on mathematics, coding, and reasoning tasks.

Programming

•Reinforcement Learning•Reasoning Model

2286

DeepSeek-R1-Distill-Llama-8B — DeepSeek-R1-Distill-Llama-8B is a high-performance open-source language model suitable for text generation and inference tasks.

Productivity

•language model•inference

2664

DeepSeek-R1-Distill-Qwen-32B — DeepSeek-R1-Distill-Qwen-32B is a high-performance open-source language model suitable for various text generation tasks.

Productivity

•Text Generation•Reinforcement Learning

1722

DeepSeek-R1-Distill-Llama-70B — DeepSeek-R1-Distill-Llama-70B is a large language model optimized using reinforcement learning, focusing on reasoning and conversational capabilities.

Programming

•Large Language Model•Reinforcement Learning

984

DeepSeek-R1 — DeepSeek-R1 is a high-performance inference model supporting various languages and tasks, suitable for both research and commercial applications.

ChineseSelection

•Artificial Intelligence•Inference Model

9000

self-adaptive-llms — A real-time adaptive framework for unseen tasks using large language models.

Programming

•Artificial Intelligence•Large Language Models

258

PRIME-RL — PRIME enhances the reasoning abilities of language models through implicit reward-driven online reinforcement learning.

Programming

•Reinforcement Learning•Reasoning Capability

330

HuatuoGPT-o1 — A large language model for complex reasoning in the medical field

Education

•Medical•Complex Reasoning

354

BabelDOC — A library for PDF scientific paper translation and bilingual comparison.

Productivity

•Translation•Document Processing

AGI News — A daily AI newsletter provided by an autonomous AI agent.

Productivity

•News•Newsletter

pdf-document-layout-analysis — A powerful PDF document layout analysis service.

Productivity

•PDF Analysis•OCR

DeepCoder — An open-source 14B parameter programming model with efficient code reasoning capabilities.

Productivity

•Open-source•Programming

SkyReels-A2 — A framework for synthesizing any content in a video diffusion transformer.

Video

•Video Generation•Deep Learning

MegaTTS 3 — A highly efficient speech synthesis model that supports Chinese, English, and speech cloning.

Music

•Speech Synthesis•Deep Learning

Agno — A lightweight library for building multimodal agents.

Productivity

•Multimodal Agent•Open Source

DeepSeek-V3-0324 — A powerful text generation model suitable for various dialogue applications.

GlobalTrending

•Text Generation•Dialogue System

516

Fin-R1 — A large language model for financial reasoning driven by reinforcement learning.

Productivity

•Finance•Artificial Intelligence

414

HunYuan T1 — The industry's first ultra-large-scale hybrid Mamba reasoning model, with strong reasoning capabilities.

ChineseSelection

•Reasoning Model•Artificial Intelligence

576

HunYuan T1 — An industry-leading deep reasoning large model, optimized for human preferences.

ChineseSelection

•Deep Learning•Reasoning Model

780

AI News

AI Daily

AI Timeline

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Light-R1-14B-DS

Light-R1-14B-DS Visit Over Time

Light-R1-14B-DS Visit Trend

Light-R1-14B-DS Visit Geography

Light-R1-14B-DS Traffic Sources

Light-R1-14B-DS Alternatives

Light-R1-14B-DS — An open-source 14B-parameter mathematical model, trained using reinforcement learning, with excellent performance.

Light-R1 — Light-R1 is an open-source project focusing on long-chain reasoning (Long COT), providing a training method from scratch through curriculum-style SFT, DPO, and RL.

Steiner-32b-preview — Steiner is a reasoning model trained on synthetic data, designed to explore multiple reasoning paths and verify them autonomously.

SWE-RL — Enhancing the reasoning capabilities of large language models in open-source software evolution through reinforcement learning.

R1-V — Enhances the generalization capabilities of visual language models at a low cost of less than $3.

Tülu 3 405B — Tülu 3 405B is a large-scale open-source language model enhanced through reinforcement learning.

DeepSeek-R1-Distill-Qwen-1.5B — DeepSeek-R1-Distill-Qwen-1.5B is an efficient inference open-source language model suitable for various natural language processing tasks.

DeepSeek-R1-Distill-Qwen-7B — DeepSeek-R1-Distill-Qwen-7B is an open-source reasoning model focusing on mathematics, coding, and reasoning tasks.

DeepSeek-R1-Distill-Llama-8B — DeepSeek-R1-Distill-Llama-8B is a high-performance open-source language model suitable for text generation and inference tasks.

DeepSeek-R1-Distill-Qwen-32B — DeepSeek-R1-Distill-Qwen-32B is a high-performance open-source language model suitable for various text generation tasks.

DeepSeek-R1-Distill-Llama-70B — DeepSeek-R1-Distill-Llama-70B is a large language model optimized using reinforcement learning, focusing on reasoning and conversational capabilities.

DeepSeek-R1 — DeepSeek-R1 is a high-performance inference model supporting various languages and tasks, suitable for both research and commercial applications.

self-adaptive-llms — A real-time adaptive framework for unseen tasks using large language models.

PRIME-RL — PRIME enhances the reasoning abilities of language models through implicit reward-driven online reinforcement learning.

HuatuoGPT-o1 — A large language model for complex reasoning in the medical field

Unitree RL GYM — Unitree robot platform for reinforcement learning

agibot_x1_train — Modular humanoid robot for reinforcement learning training

Qwen2.5-Math — World-leading open-source large language model for mathematics

MuKoe — An open-source implementation of MuZero, a distributed AI framework

BabelDOC — A library for PDF scientific paper translation and bilingual comparison.

AGI News — A daily AI newsletter provided by an autonomous AI agent.

pdf-document-layout-analysis — A powerful PDF document layout analysis service.

DeepCoder — An open-source 14B parameter programming model with efficient code reasoning capabilities.

SkyReels-A2 — A framework for synthesizing any content in a video diffusion transformer.

MegaTTS 3 — A highly efficient speech synthesis model that supports Chinese, English, and speech cloning.

Agno — A lightweight library for building multimodal agents.

DeepSeek-V3-0324 — A powerful text generation model suitable for various dialogue applications.

Fin-R1 — A large language model for financial reasoning driven by reinforcement learning.

HunYuan T1 — The industry's first ultra-large-scale hybrid Mamba reasoning model, with strong reasoning capabilities.

HunYuan T1 — An industry-leading deep reasoning large model, optimized for human preferences.