The editor of Downcodes learned that Alibaba Cloud Bailian platform has recently launched the Qwen2.5-Turbo million long text model. This model was developed by the Tongyi Qianwen team and has the ability to process ultra-long texts of up to 1 million tokens. In long text processing Significant breakthroughs have been made in the field. This breakthrough will bring users more powerful text processing capabilities and expand the boundaries of AI applications. The Qwen2.5-Turbo model surpassed GPT-4 in multiple long text evaluations, demonstrating its advantages in accuracy and efficiency, and providing more powerful services at a lower cost.
This new version of the model achieved 100% accuracy in long text retrieval tasks, and scored 93.1 on the long text evaluation set RULER, surpassing GPT-4. In long text tasks close to real scenes such as LV-Eval and LongBench-Chat, Qwen2.5-Turbo surpasses GPT-4o-mini in most dimensions. In the short text benchmark test, Qwen2.5-Turbo also performed very well, significantly surpassing the previous open source model with a context length of 1M tokens.
The Qwen2.5-Turbo model has a wide range of application scenarios, including in-depth understanding of novels, large-scale code assistants, reading of multiple papers, etc. It can process 10 novels, 150 hours of speeches, or 30,000 lines of code at one time. In terms of reasoning speed, the Tongyi Qianwen team compressed the calculation amount by about 12.5 times through the sparse attention mechanism, and reduced the first word return time of processing 1M tokens context from 4.9 minutes to 68 seconds, achieving a 4.3 times speed increase.
Alibaba Cloud Bailian platform provides all users with the ability to directly call Qwen2.5-Turbo API, and provides a limited-time gift of 10 million tokens. The cost of subsequent use of one million tokens is only 0.3 yuan.
At present, Alibaba Cloud Bailian platform has launched more than 200 domestic and foreign mainstream open source and closed source large models, including Qwen, Llama, and ChatGLM, supporting users to directly call, train and fine-tune or create RAG applications.
The emergence of the Qwen2.5-Turbo model marks a significant progress in long text processing technology. Its broad application prospects and efficient performance will bring more possibilities to all walks of life. The open strategy of Alibaba Cloud Bailian platform also provides developers with convenient access and promotes the development and application of AI technology. The editor of Downcodes looks forward to more innovative applications based on this model!