AIbase
Product LibraryTool Navigation

llm-summarization

Public

LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset

Creat2024-03-19T23:27:02
Update2025-03-12T04:28:41
10
Stars
0
Stars Increase

Related projects