Alibaba recently released two versions of its multi-modal large model Qwen-VL-Plus and Qwen-VL-Max. Both versions have achieved significant breakthroughs in text-image tasks and visual reasoning, surpassing the current industry-leading GPT-4V and Gemini in performance. This move marks a new stage in technological competition in the field of multi-modal large models. Alibaba has demonstrated strong technical strength and innovation capabilities in this field, providing new possibilities for the development of future AI applications.
Alibaba launched Qwen-VL-Plus and Qwen-VL-Max versions, which have made significant progress in text-image tasks and visual reasoning respectively, surpassing GPT-4V and Gemini. This marks a new round of technological upgrades in the field of multimodal models.
The release of Qwen-VL-Plus and Qwen-VL-Max heralds the wider application of multi-modal AI technology, bringing more opportunities for innovation and efficiency improvement to all walks of life. Alibaba’s continued investment and technological breakthroughs in the field of artificial intelligence are worth looking forward to. In the future, we will see more innovative applications emerging based on Qwen-VL series models.