Tencent releases new patent for training large language model to improve model generalization and accuracy - AI Articles

Author：Eve Cole Update Time：2025-02-14 18:16:01

With the rapid development of artificial intelligence technology, major enterprises have increased their R&D investment and promoted technological innovation. Recently, Tencent Technology (Shenzhen) Co., Ltd. has made significant progress in the training of large language models and has applied for and published relevant patents.

Recently, Tencent Technology (Shenzhen) Co., Ltd. announced a patent on the training method and related equipment of large language models on the Tianyancha App. The name of this patent is "Training methods, devices, computer equipment and storage media for large language models", and aims to improve the learning ability and accuracy of large language models through innovative training methods.

In the training process of large language models, traditional methods often rely on a single text summary, which may lead to overfitting of the model and affecting the accuracy and diversity of the generated content. However, Tencent’s new approach introduces two different sources of information—the first abstract text and the second abstract text. The amount of information in these two abstract texts is different, and the first abstract text contains correct and wrong statements, forming the basis for comparative learning.

This contrast learning method allows the model to learn in different abstracts of the same text. By distinguishing the correct and wrong statements in the first abstract text, it effectively avoids learning errors caused by the single summary. This innovative method not only improves the generalization ability of the model and allows it to perform better when facing unknown data, but also enhances the accuracy of the model and reduces the probability of generating wrong content.

With the continuous advancement of artificial intelligence technology, the application scope of large language models has become more and more widespread, and huge potential has been shown in fields such as natural language processing to intelligent customer service to content creation. The announcement of Tencent’s patent marks another technological breakthrough in the field of large language model training and is expected to provide new directions for future related research and applications.

It can be foreseen that the further development of this technology will promote the continuous progress of intelligent applications and help all walks of life better utilize the convenience brought by artificial intelligence in digital transformation.

In short, the advancement of artificial intelligence technology not only improves the effectiveness of existing applications, but also lays a solid foundation for future development.