AIbase
Product LibraryTool Navigation

beta-DPO

Public

[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$

Creat2024-05-22T16:17:20
Update2025-02-26T17:38:41
41
Stars
0
Stars Increase

Related projects