Llama-3.1-Tulu-3-8B-DPO is a member of the Tülu3 model family focused on instruction adherence. It provides fully open-source data, code, and guidelines intended to serve as a comprehensive resource for modern post-training techniques. This model is designed for a wide range of tasks beyond conversation, such as MATH, GSM8K, and IFEval, achieving state-of-the-art performance. Key advantages include open-source data and code, support for multiple tasks, and excellent performance. The model was developed by the Allen AI Institute and adheres to the Llama 3.1 community licensing agreement for research and educational use.