Loading market data...

StepFun Releases StepAudio 2.5 Realtime: An End-to-End Voice Model with Roleplay-Specific RLHF and Paralinguistic Comprehension

MarkTechPostMay 24, 2026 at 10:51 PM

StepFun, the Shanghai-based AI lab, released StepAudio 2. 5 Realtime in May 2026 — an end-to-end real-time speech large language model with fully customizable persona capabilities.

The model connects via a WebSocket API, supports Chinese and English, and ranked first across all five benchmark dimensions tested in April 2026, including an 80. 41 human evaluation score and 82. 18 on paralinguistic comprehension. The post StepFun Releases StepAudio 2.

This is a summary. For the full story, read the original article at MarkTechPost.

Original source: MarkTechPost

Meet OmniVoice Studio: A Local, Open-Source Alternative to ElevenLabs

MarkTechPostMay 26, 2026 at 7:56 AM

WorkOS Releases auth.md: An Open Agent Registration Protocol Built on OAuth Standards

MarkTechPostMay 25, 2026 at 7:38 AM

NVIDIA AI Releases Gated DeltaNet-2: A Linear Attention Layer That Decouples Erase and Write in the Delta Rule

MarkTechPostMay 24, 2026 at 7:42 AM

← Back to all articles

Related Articles

Meet OmniVoice Studio: A Local, Open-Source Alternative to ElevenLabs

WorkOS Releases auth.md: An Open Agent Registration Protocol Built on OAuth Standards

NVIDIA AI Releases Gated DeltaNet-2: A Linear Attention Layer That Decouples Erase and Write in the Delta Rule