SCoRe

Public

SCoRe: Training Language Models to Self-Correct via Reinforcement Learning

fine-tuning language-model reinforcement-learning self-correction

Hora de creación：2024-10-06T00:51:09

Hora de actualización：2025-02-23T05:26:54

Stars

Stars Increase

Proyectos relacionados

Annotated_deep_learning_paper_implementations

Hot

attention

??? 60+ Implementations/tutorials of deep learning papers with side-by-side notes ?; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ? reinforcement learning (ppo, dqn), capsnet, distillation, ... ?

60346

1个月前

+76today

LLaMA Factory

Hot

agent

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

47790

1个月前

+169today

LLMs From Scratch

Hot

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

46948

7个月前

+46948today

Llama_index

Hot

agents

LlamaIndex is the leading framework for building LLM-powered agents over your data.

41266

1个月前

+60today

Unsloth

Hot

deepseek

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! ?

37618

1个月前

+97today

Open Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

37331

1个月前

+7today

Mlc Llm

language-model

Universal LLM Deployment Engine with ML Compilation

20493

1个月前

+12today

Ml Agents

deep-learning

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

18028

1个月前

+6today

Tensor2tensor

deep-learning

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

16089

1个月前

+8today

DocsGPT

DocsGPT is an open-source genAI tool that helps users get reliable answers from knowledge source, while avoiding hallucinations. It enables private and reliable information retrieval, with tooling and agentic system capability built in.

15579

1年前

+3today

Noticias de IA

IA Diario

Cronología de la IA

Al hardware

Últimos Casos

Colección de Imágenes

Colección de Videos

Colección de Audio

Colección de Contenido

Últimos Tutoriales

Ranking de Productos de IA

Ranking de Crecimiento de Tráfico de IA

Ranking de Descenso de Tráfico de IA

Ranking Semanal de IA

Estados Unidos

China

India

Brasil

Generación de Imágenes

Asistente Personal

Generación de Personajes

Generación de Videos

Ranking de Proyectos de IA

Ranking de Crecimiento de Proyectos de IA

Ranking de Desarrolladores de IA

Ranking de Organizaciones de IA

Deepseek

TTS

LLM

ChatGPT

Visión General

SCoRe

Proyectos relacionados

Annotated_deep_learning_paper_implementations

LLaMA Factory

LLMs From Scratch

Llama_index

Unsloth

Open Assistant

Mlc Llm

Ml Agents

Tensor2tensor

DocsGPT