A powerful new contender has emerged in the FinTech arena. The Fin-R1 model, jointly developed by Professor Zhang Liwen's team at the School of Statistics and Data Science, Shanghai University of Finance and Economics (SUFE-AIFLM-Lab), and Caiyue Xingchen, has been officially open-sourced, garnering significant industry attention due to its impressive performance.
This financial-specific large language model, based on Qwen2.5-7B and trained using reinforcement learning, achieves leading performance across multiple financial benchmark tests.
Remarkably, Fin-R1, with only 7B parameters, surpasses most competitors of similar size, and even those with tens of times more parameters. It excels in crucial tasks such as Financial Table Reasoning (FinQA) and Conversational Financial QA (ConvFinQA), demonstrating exceptional understanding of the financial domain.
Tailored for reasoning and analytical tasks in core financial business scenarios, Fin-R1 boasts an extensive range of capabilities. It can proficiently write financial code, build pricing models and risk assessment scripts; accurately perform quantitative analysis and report calculations; fluently generate English financial models and professional reports; provide financial security and compliance analysis; implement anti-fraud measures, default prediction, and other intelligent risk control functions; and even conduct ESG (Environmental, Social, and Governance) sustainability analysis.
Fin-R1's development leveraged advanced techniques. The research team built the model architecture based on Qwen2.5-7B-Instruct and utilized the DeepSeek-R1 framework for "data distillation" and "dual-wheel quality screening." A training method combining supervised fine-tuning (SFT) with high-quality chain-of-thought data and reinforcement learning (RL) successfully created this AI assistant for the financial industry.
Notably, Fin-R1 supports both Chinese and English environments, enabling financial modeling, report generation, and conversational interaction in both languages, showcasing strong cross-lingual capabilities. The release of this open-source model will provide strong support for the digital transformation of the financial industry and promises to become a valuable tool for financial analysts, risk management experts, and investment advisors.