Proposed by Peking University and others, a training method for medical expert models has elevated an 8B model to the performance level of GPT-4.

AIbase

Published inAI News · 5 min read · Jul 2, 2024

The team from Peking University and the Hong Kong University of Science and Technology made a big splash with a new training method that has achieved GPT-4 level performance with an 8B-sized medical expert model. This is no small feat, and they have also introduced a new concept, "stability gap," to explain certain phenomena observed during the continuous pre-training of large language models.

AI medical, doctor, artificial intelligence

Image Source Note: The image is generated by AI, and the image is provided by Midjourney, an image authorization service provider

Firstly, they found that during continuous pre-training, the model's performance in the target domain would first decline and then improve, like a rollercoaster ride. To address this issue, they proposed three strategies. The first is to conduct multi-round pre-training on appropriately sized data subsets, which can recover performance faster than single-round large dataset pre-training. The second is to select the highest quality subset of text for multi-round pre-training. Lastly, by mixing data to approximate the distribution of pre-training data, this allows the model to become more stable.

These strategies have achieved significant results in continuous pre-training and instruction tuning in the medical field, improving effectiveness while also reducing computational volume. Moreover, their open-source Llama-3-Physician-8B model is now available on HuggingFace.

The significance of this research goes beyond this. They also found that with these strategies, the OpenLLaMa model only needs to be trained for 4 rounds on high-quality 5 billion data to significantly outperform all baselines in medical tasks. This not only enhances performance but also greatly reduces the consumption of computational resources.

Even more impressive is that their Llama-3-Physician-8B-instruct model's performance in medical question answering tasks is not only superior to other models of the same size but also surpasses the closed-source GPT-3.5 model, approaching the level of GPT-4. This is a revolution in the medical field.

This research not only provides us with a new training method but also shows the tremendous potential of large language models in the medical field. Through continuous pre-training and instruction fine-tuning, we can achieve higher performance in specific domains while reducing computational costs. This is undoubtedly a great boon for the medical industry.

This research also reminds us that the training of large language models is not an overnight success and requires continuous optimization and adjustment. By introducing the concept of "stability gap," we can better understand and solve problems in model training, allowing models to play a greater role in specific domains. This is not only a technical breakthrough but also a profound insight into the medical industry.

Link to the paper: https://arxiv.org/abs/2406.14833

Open Source Link: https://huggingface.co/YiDuo1999/Llama-3-Physician-8B-Instruct

Google Launches New Veo 3 Video Generation Model Globally

Google announced the global launch of its latest video generation model, Veo3. This long-anticipated release has generated great excitement among users, as Veo3 is now available to Gemini users in over 159 countries, offering a new video creation experience. The key feature of the Veo3 video generation model is its ability to generate videos up to eight seconds long based on simple text prompts. According to Google, this technology is designed for creative users, especially those on social media who increasingly demand short-form content.

Hitachi Energy Warns: Power Demand Fluctuations in AI Centers May Threaten Global Power Supply Stability

Recently, Andreas Schierenbeck, CEO of Hitachi Energy, the world's largest transformer manufacturer, stated in an interview with the Financial Times that as large technology companies see a surge in power demand when training artificial intelligence models, governments need to take measures to limit these fluctuations to ensure the stability of the power supply. Image source note: The image is generated by AI, and the image licensing service provider is Midjourney. Schierenbeck said that the power demand fluctuations in AI data centers are extremely severe,

JD Logistics Launches Self-Developed Unmanned Light Truck JD Logistics VAN with L4 Level Public Road Autonomous Driving

At the 17th International Exhibition of Transportation Technology and Equipment held recently, JD Logistics officially launched its self-developed unmanned light truck product - JD Logistics VAN. This unmanned light truck has a large cargo space of 24 cubic meters, making it the one with the largest cargo capacity in the logistics industry. It is expected to replace traditional 4.2-meter trucks in logistics shuttle and transfer station links. According to the introduction, JD Logistics VAN has a full-load driving range of up to 400 kilometers and is equipped with L4-level autonomous driving capabilities on public roads. This means it can drive autonomously.

Kunlun Xiwang Once Again Open-Sources the Reward Model Skywork-Reward-V2

On July 4, 2025, Kunlun Xiwang continued to open-source the second-generation reward model Skywork-Reward-V2 series. This series includes 8 reward models based on different foundation models, with parameter sizes ranging from 600 million to 8 billion. Upon its release, it won all seven major reward model evaluation rankings, becoming a focus in the open-source reward model field. Reward models play a key role in the reinforcement learning from human feedback (RLHF) process. To build the next generation of reward models, Kunlun Xiwang has constructed a dataset containing 40 million

China's Medical Large Model Release Volume Accounts for 70% of the Global Total! KPMG Reveals Future Market Potential

According to KPMG China's recent report, "The First 50 Health Tech Companies," China accounts for more than 70% of the global release volume of medical large models. This data not only demonstrates China's rapid development in the field of intelligent healthcare, but also reflects the wide application of large language models in the healthcare industry. The report points out that about 65% of the currently released medical large models are large language models. These models can process and generate natural language, playing a significant supporting role in the analysis of medical data, patient communication, and scientific research.

Xiaopeng G7 Ultra Makes a Grand Debut! Revolutionary Intelligent Driving Large Model Unveiled

In the new energy vehicle market, Xiaopeng Automotive has once again drawn attention. On July 3rd, the Xiaopeng G7 Ultra was officially launched, becoming the first intelligent vehicle equipped with the local-end "VLA+VLM" large model. This innovative technology marks an important step forward for Xiaopeng in the field of intelligent driving. The Xiaopeng G7 Ultra is equipped with the VLA (active thinking and rapid decision-making capability) large model, making the driving experience more intelligent. In daily driving, the G7 Ultra can flexibly handle various complex driving scenarios, such as in traffic.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Proposed by Peking University and others, a training method for medical expert models has elevated an 8B model to the performance level of GPT-4.

AIbase

This article is from AIbase Daily

AI News Recommendations

Google Launches New Veo 3 Video Generation Model Globally

Hitachi Energy Warns: Power Demand Fluctuations in AI Centers May Threaten Global Power Supply Stability

JD Logistics Launches Self-Developed Unmanned Light Truck JD Logistics VAN with L4 Level Public Road Autonomous Driving

MiniMax Launches the World's First Open-Source Large-Scale AI Model, Technological Breakthrough Attracts Industry Attention

Kunlun Xiwang Once Again Open-Sources the Reward Model Skywork-Reward-V2

Google Veo 3 Video Generation Model Now Available to Pro/Ultra Subscribers, Will Add Photo-to-Video Function

China's Medical Large Model Release Volume Accounts for 70% of the Global Total! KPMG Reveals Future Market Potential

Xiaopeng G7 Ultra Makes a Grand Debut! Revolutionary Intelligent Driving Large Model Unveiled

A Daily: Bilibili Upgrades Anime Video Generation Model AniSora V3; ByteDance Open Sources 4D Video Generation Framework EX-4D; DeepSWE Open Sources AI Agent System Rises to the Top

ByteDance Open Sources New Model VINCIE-3B: 300 Million Parameters Support Continuous Image Editing with Context