Loading market data...

NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing

MarkTechPostMay 9, 2026 at 10:24 PM

NVIDIA researchers have introduced Star Elastic, a post-training method that embeds multiple nested reasoning models — at 30B, 23B, and 12B parameter scales — inside a single checkpoint, eliminating the need for separate training runs or stored model weights per variant.

Built on the Nemotron Elastic framework and applied to Nemotron Nano v3, the method trains all three variants in a single 160B-token run, achieving a 360× token reduction compared to pretraining each model from scratch.

This is a summary. For the full story, read the original article at MarkTechPost.

Original source: MarkTechPost

Nous Research Releases Token Superposition Training to Speed Up LLM Pre-Training by Up to 2.5x Across 270M to 10B Parameter Models

MarkTechPostMay 14, 2026 at 5:46 AM

Hermes Unlocks Self-Improving AI Agents, Powered by NVIDIA RTX PCs and DGX Spark

NVIDIA BlogMay 13, 2026 at 1:00 PM

NVIDIA, Ineffable Intelligence Team Up to Build the Future of Reinforcement Learning Infrastructure

NVIDIA BlogMay 13, 2026 at 1:00 PM

← Back to all articles

Related Articles

Nous Research Releases Token Superposition Training to Speed Up LLM Pre-Training by Up to 2.5x Across 270M to 10B Parameter Models

Hermes Unlocks Self-Improving AI Agents, Powered by NVIDIA RTX PCs and DGX Spark

NVIDIA, Ineffable Intelligence Team Up to Build the Future of Reinforcement Learning Infrastructure