DPO-ST

Public

[ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

chain-of-thought dpo math-word-problem

Creat：2024-06-04T23:37:20

Update：2025-02-28T09:50:26

https://arxiv.org/abs/2407.18248

Stars

Stars Increase

Related projects

LeetCode Go

acm-icpc

? Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解

33768

8个月前

+5today

PDFMathTranslate

Hot

chinese

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/Docker/Zotero

30396

8个月前

+64today

YAPI

ansi

A Collection of useful Methods in Java

27739

2年前

-1today

Etherpad Lite

collaboration

Etherpad: A modern really-real-time collaborative document editor.

17966

8个月前

+10today

Awesome Multimodal Large Language Models

chain-of-thought

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

16892

8个月前

+44today

Numpy Ml

attention

Machine learning, in numpy

16210

8个月前

+1today

LaTeX OCR

dataset

pix2tex: Using a ViT to convert images of equations into LaTeX code.

16012

8个月前

+12today

Chinese Word Vectors

chinese

100+ Chinese Word Vectors 上百种预训练中文词向量

12140

8个月前

+5today

LLMSurvey

chain-of-thought

The official GitHub page for the survey paper "A Survey of Large Language Models".

11998

8个月前

+8today

Univer

appscript

Univer is a full-stack framework for creating and editing spreadsheets, documents, and slides on both web and server.

11819

8个月前

+20today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services

AI Model Compatibility Checker

AI Deployment Calculator

DPO-ST

Related projects

LeetCode Go

PDFMathTranslate

YAPI

Etherpad Lite

Awesome Multimodal Large Language Models

Numpy Ml

LaTeX OCR

Chinese Word Vectors

LLMSurvey

Univer

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

DPO-ST

Related projects

LeetCode Go

PDFMathTranslate

YAPI

Etherpad Lite

Awesome Multimodal Large Language Models

Numpy Ml

LaTeX OCR

Chinese Word Vectors

LLMSurvey

Univer

GEO Services