InstructLLaMA

Public

Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to InstructGPT or ChatGPT, but on a much smaller scale.

4bit-fine-tune instructgpt llam2 ppo qlora rhlf

Creat：2023-07-08T21:58:22

Update：2025-02-02T09:06:07

Stars

Stars Increase

Related projects

SkyText Chinese GPT3

SkyText是由奇点智源发布的中文GPT3预训练大模型，可以进行文章续写、对话、中英翻译、内容风格生成、推理、诗词对联等不同任务。| SkyText is a Chinese GPT3 pre-trained large model released by Singularity-AI, which can perform different tasks such as chatting, Q&A, and Chinese-English translation.

406

1个月前

InstructGOOSE

chatgpt

Implementation of Reinforcement Learning from Human Feedback (RLHF)

173

1个月前

MiniChatGPT

chatgpt

Mini ChatGPT

7个月前

+12today

ChatGPT4Me

chatgpt

A program that enhances and customizes ChatGPT's underlying pre-trained LLM w/ transformer architecture. Based on OpenAI's beta InstructGPT fine-tune model.

12个月前

Book Mentat

books

Considering how to analyse book collections, Large Language Model style

4个月前

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

InstructLLaMA

Related projects

SkyText Chinese GPT3

InstructGOOSE

MiniChatGPT

ChatGPT4Me

Book Mentat