Alibaba Cloud has launched Qwen2.5-Omni-7B, a unified end-to-end multimodal model in the Qwen series. Uniquely designed for comprehensive multimodal perception, it can process diverse inputs, including text, images, audio, and videos, while generating real-time text and natural speech responses. This sets a new standard for optimal deployable multimodal AI for edge devices like mobile phones and laptops.
Qwen2.5-Omni-7B delivers remarkable performance across all modalities, rivaling specialized single-modality models of comparable size. Notably, it sets a new benchmark in real-time voice interaction, natural and robust speech generation, and end-to-end speech instruction following.
Qwen2.5-Omni-7B was pre-trained on a vast, diverse dataset, including image-text, video-text, video-audio, audio-text, and text data, ensuring robust performance across tasks.
Read More News on Business News Malaysia
Read More News on Business News Malaysia
Eco World reported stronger 1HFY26 earnings driven by industrial land sales, while robust new sales…
Scoot, the low-cost subsidiary of Singapore Airlines (SIA), is pleased to announce an exciting collaboration…
RICOH Malaysia unveiled AI and automation solutions designed to improve operational efficiency, workflow intelligence, and…
Singapore, June 18, 2026 — Federal Express Corporation, one of the world’s largest express transportation…
Malaysia faces critical challenges like rising costs and political instability while pursuing a future of…
Malaysia has jumped eight spots to rank 15th in the 2026 IMD World Competitiveness Ranking,…
This website uses cookies.