NVIDIA Launches Nemotron 3 Nano Omni: Open-Source Omni-Modal Model Supporting Millions of Tokens

By: M 9 hours ago

NVIDIA has released Nemotron 3 Nano Omni, an open-source omni-modal model that continues the "Nano" positioning, emphasizing high cost-performance and inference efficiency. The model involves approximately 30 billion total parameters and supports ultra-long context windows of up to one million tokens.

The model adopts a 30B-A3B mixture-of-experts architecture and integrates Mamba layers with Transformer layers at the architectural level. The Mamba layer is responsible for improving long-sequence processing efficiency and memory utilization, while the Transformer layer ensures inference accuracy. Official data shows that this hybrid design can increase memory and computational efficiency by up to fourfold.

According to the industry benchmark MediaPerf, the Nemotron 3 Nano Omni achieves the highest throughput across all evaluated tasks and reaches the lowest inference cost in video-level annotation tasks. Under a fixed user interaction latency threshold, the model delivers an effective system capacity approximately 9.2 times higher than other open full-modal models in video reasoning tasks, and about 7.4 times higher in multi-document reasoning tasks.

AI and software companies that have already adopted Nemotron 3 Nano Omni include Aible, Applied Scientific Intelligence (ASI), Eka Care, Foxconn, H Company, Palantir, and Pyler, while Dell Technologies, DocuSign, Infosys, K-Dense, Lila, Oracle, and Zefr are currently evaluating the model.

NVIDIA Launches Nemotron 3 Nano Omni: Open-Source Omni-Modal Model Supporting Millions of Tokens

Latest News

NVIDIA Launches Nemotron 3 Nano Omni: Open-Source Omni-Modal Model Supporting Millions of Tokens

Seagate: Net Profit Up 120% YoYear in Q3FY2026, Annual Revenue Growth Target Raised to at Least 20%

Samsung Electronics Produces First Working Die on Sub-10nm DRAM Node

TSMC Targets 2029 for Launching Arizona Advanced Packaging Plant, Unveils Advanced Packaging Roadmap

GPT-5.5 Model Released with Significantly Reduced Token Consumption

Nextorage Launches PCIe Gen4 QLC SSD G Series EEA with Up to 8TB Capacity

JEDEC Previews LPDDR6 Roadmap Expanding LPDDR into Data Centers and Processing-in-Memory

Samsung Reportedly Ramps Up GDDR Supply and Production for Tesla

Price Center

Newsflash

Dosilicon: Q1 Net Profit of 138 Million Yuan, Returns to Profitability Year-on-Year

Reports indicate that US orders chip equipment companies to halt some shipments to Hua Hong

NVIDIA Launches Nemotron 3 Nano Omni: Open-Source Omni-Modal Model Supporting Millions of Tokens

Oracle and OpenAI Expand Strategic Collaboration

Seagate: Net Profit Up 120% YoYear in Q3FY2026, Annual Revenue Growth Target Raised to at Least 20%

Foxconn Industrial Internet's Net Profit Doubles in Q1 2026, Exceeding RMB 10 Billion for Three Consecutive Quarter

Samsung Reportedly Plans to Start Pyeongtaek P4 Plant Production Six Months Ahead of Schedule

Macronix: Currently Focusing on eMMC/Low-Density NAND, Enterprise SSD Development Temporarily Slowed Down

Longcheer Technology: Net Profit Attributable to Parent Company in Q1 Reaches RMB 15.4087 Million, Down 90% Year-on-Year

Montage Tech Reports 61.30% YoY Net Profit Growth in Q1, as New RCD Chip and Interconnect Chip Products Gain Revenue Share

Hot News

Structural Divergence in Memory Spot Market: Industrial SSDs Steady with Moderate Gains; UDIMM,SODIMMs,and Channel SSDs Continue to See Downward Revisions

Operating Profit Quadruples, Investment Scale Significantly Increases; SK Hynix Expects This Price-Rise Cycle to Be Longer Than in the Past

KIOXIA introduces new mainstream BG8 series SSDs for PC OEMs

Retail trading prices of finished storage products continue to fall, channel SSD and UDIMM prices drop again

Samsung Reportedly Halts New Orders for Mobile DRAM LPDDR4 and LPDDR4X

Samsung's 1c DRAM Yield Reportedly Exceeds 80%

Speculative Trading Disrupts the Spot Marke,Channel SSD, UDIMM and SODIMM Prices Generally See Subtle Declines

MemoryMarket