AIbase
Product LibraryTool Navigation

Travel-Agent-based-on-Qwen2-RLHF

Public

A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using Prompt-Template + Tool-Use + Chroma embedding database + LangChain

Creat2024-04-19T15:07:03
Update2025-03-26T14:47:57
8
Stars
0
Stars Increase