Recently, Tencent Technology (Shenzhen) Co., Ltd. announced a patent related to training methods and equipment for large language models on the Tianyancha App. The patent, titled "Training Method, Device, Computer Equipment, and Storage Medium for Large Language Models," aims to enhance the learning ability and accuracy of large language models through innovative training methods.

In the training process of large language models, traditional methods often rely on a single text summary, which may lead to overfitting and affect the accuracy and diversity of the generated content. However, Tencent's new method introduces two different sources of information — the first summary text and the second summary text. These two summary texts contain varying amounts of information, and the first summary text includes both correct and incorrect statements, forming the basis for contrastive learning.

Patent

This contrastive learning approach allows the model to learn from different summaries of the same text, effectively avoiding learning errors caused by relying on a single summary by distinguishing between correct and incorrect statements in the first summary text. This innovative method not only enhances the model's generalization ability, allowing it to perform better when faced with unknown data, but also improves the model's accuracy, reducing the likelihood of generating incorrect content.

With the continuous advancement of artificial intelligence technology, the application range of large language models is becoming increasingly broad, showing great potential in areas such as natural language processing, intelligent customer service, and content creation. The announcement of this patent by Tencent marks another technological breakthrough in the field of training large language models, promising to provide new directions for future research and applications.

It is foreseeable that the further development of this technology will drive continuous progress in intelligent applications, helping various industries better utilize the conveniences brought by artificial intelligence in their digital transformation.