AIbase
Product LibraryTool Navigation

Federated-RLHF

Public

[AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple instances of GPT-2 for personalized sentiment aligned text generation.

Creat2025-01-25T19:56:48
Update2025-03-13T21:31:32
6
Stars
0
Stars Increase