Alibaba Qwen Team Introduces Qwen3.5-LiveTranslate-Flash: Real-Time Multimodal Interpretation Across 60 Languages at 2.8-Second Latency
MarkTechPost
Read Full Article at MarkTechPost →
Ad Slot — In-Article (728x90)
Alibaba's Qwen team has released Qwen3. 5-LiveTranslate-Flash, a real-time multimodal translation model that processes audio and video simultaneously. The model covers 60 input languages and produces speech output in 29 languages at 2. 8 seconds of latency.
Key additions over the previous Qwen3 version include real-time speaker voice cloning, vision-enhanced comprehension via lip movements and on-screen text, and dynamic keyword configuration for domain-specific terminology.
This is a summary. For the full story, read the original article at MarkTechPost.
Original source: MarkTechPost