Loading market data...
ai

StepFun Releases StepAudio 2.5 Realtime: An End-to-End Voice Model with Roleplay-Specific RLHF and Paralinguistic Comprehension

MarkTechPost
Read Full Article at MarkTechPost
Share:PostShare
StepFun Releases StepAudio 2.5 Realtime: An End-to-End Voice Model with Roleplay-Specific RLHF and Paralinguistic Comprehension
Ad Slot — In-Article (728x90)

StepFun, the Shanghai-based AI lab, released StepAudio 2. 5 Realtime in May 2026 — an end-to-end real-time speech large language model with fully customizable persona capabilities.

The model connects via a WebSocket API, supports Chinese and English, and ranked first across all five benchmark dimensions tested in April 2026, including an 80. 41 human evaluation score and 82. 18 on paralinguistic comprehension. The post StepFun Releases StepAudio 2.

This is a summary. For the full story, read the original article at MarkTechPost.

Original source: MarkTechPost

Ad Slot — Below Article (300x250)