Loading market data...

NVIDIA AI Releases Nemotron-Labs-Diffusion: A Tri-Mode Language Model with 6× Tokens Per Forward Over Qwen3-8B

MarkTechPostMay 20, 2026 at 10:41 AM

NVIDIA researchers have released Nemotron-Labs-Diffusion, a language model family that unifies three decoding modes in one architecture. The model supports autoregressive (AR) decoding, diffusion-based parallel decoding, and self-speculation decoding.

It is available in 3B, 8B, and 14B parameter sizes. The family includes base, instruct, and vision-language variants.

This is a summary. For the full story, read the original article at MarkTechPost.

Original source: MarkTechPost

OlmoEarth v1.1: A more efficient family of models

HuggingFaceMay 19, 2026 at 6:38 PM

Google Introduces Gemini 3.5 Flash at I/O 2026: A Faster and Cheaper Model for AI Agents and Coding

MarkTechPostMay 20, 2026 at 7:12 AM

How AI Mode is changing the way people search in the U.S.

Google AIMay 19, 2026 at 5:45 PM

← Back to all articles

Related Articles

OlmoEarth v1.1: A more efficient family of models

Google Introduces Gemini 3.5 Flash at I/O 2026: A Faster and Cheaper Model for AI Agents and Coding

How AI Mode is changing the way people search in the U.S.