AIbase
Product LibraryTool Navigation

LLM-From-Scratch

Public

Medical Language Model fine-tuned using pretraining, instruction tuning, and Direct Preference Optimization (DPO). Progresses from general medical knowledge to specific instruction following, with experiments in preference alignment for improved medical text generation and understanding.

Creat2024-10-04T16:15:11
Update2025-02-02T03:03:26
27
Stars
0
Stars Increase