C3PO

User Feedback-Based LLM Model Alignment Technique

CommonProductProductivityLLM ModelUser Feedback
C3PO is a user feedback-based LLM model alignment technique that allows for fine-tuning of LLMs based on single feedback sentences, mitigating over-generalization. This technique provides reference implementations, benchmark data, and necessary components to facilitate research on the proposed technique.
Visit

C3PO Alternatives