Nemotron-3-Nano-Omni-30B

Overview

Nemotron-3-Nano-Omni-30B-A3B-Reasoning is a 30-billion parameter multimodal model from NVIDIA announced without a blog post on Hugging Face in April 2026. The model combines audio, image, and video inputs with text reasoning, and publishes as both BF16 and GGUF weights. The stealth release pattern (community discovery via r/LocalLLaMA rather than official announcement) and the A3B designation suggest an active-parameter mixture-of-experts architecture that extends NVIDIA’s Nemotron reasoning-model lineage into multimodal territory.

Timeline

2026-04-29-AI-Digest — Nemotron-3-Nano-Omni-30B-A3B-Reasoning appears on Hugging Face in BF16 and GGUF formats without accompanying NVIDIA blog post; community discovery via r/LocalLLaMA (183 upvotes, 71 comments). Size class and multimodal scope landing as stealth drop rather than launch event; treat as preliminary until NVIDIA documents.

Key Developments

Multimodal Reasoning Integration: Audio + image + video → text reasoning in a 30B-parameter model extends Nemotron’s reasoning capabilities into the multimodal space, signaling NVIDIA’s pursuit of unified reasoning-agent models across modality boundaries.
Stealth-Release Pattern: The unannounced Hugging Face upload mirrors the “leaks first, post second” pattern of large-shop model releases—particularly common among hyperscalers testing community response before formal launch.