Loading market data...

Alibaba Qwen Team Introduces Qwen3.5-LiveTranslate-Flash: Real-Time Multimodal Interpretation Across 60 Languages at 2.8-Second Latency

MarkTechPostMay 20, 2026 at 8:09 AM

Alibaba's Qwen team has released Qwen3. 5-LiveTranslate-Flash, a real-time multimodal translation model that processes audio and video simultaneously. The model covers 60 input languages and produces speech output in 29 languages at 2. 8 seconds of latency.

Key additions over the previous Qwen3 version include real-time speaker voice cloning, vision-enhanced comprehension via lip movements and on-screen text, and dynamic keyword configuration for domain-specific terminology.

This is a summary. For the full story, read the original article at MarkTechPost.

Original source: MarkTechPost

Google Introduces Gemini 3.5 Flash at I/O 2026: A Faster and Cheaper Model for AI Agents and Coding

MarkTechPostMay 20, 2026 at 7:12 AM

Introducing the Ettin Reranker Family

HuggingFaceMay 19, 2026 at 12:00 AM

OlmoEarth v1.1: A more efficient family of models

HuggingFaceMay 19, 2026 at 6:38 PM

← Back to all articles

Related Articles

Google Introduces Gemini 3.5 Flash at I/O 2026: A Faster and Cheaper Model for AI Agents and Coding

Introducing the Ettin Reranker Family

OlmoEarth v1.1: A more efficient family of models