What is the context length of Nemotron 3 Nano Omni (free)?

Nemotron 3 Nano Omni (free) has a 256,000 token context window. It supports up to 65,536 completion tokens.

How much does Nemotron 3 Nano Omni (free) cost?

Nemotron 3 Nano Omni (free) is free to use through OpenRouter.

What providers serve Nemotron 3 Nano Omni (free), and can I use it via API?

Nemotron 3 Nano Omni (free) is available through the OpenRouter API from 1 provider.

What modalities does Nemotron 3 Nano Omni (free) support?

Nemotron 3 Nano Omni (free) accepts text, audio, image, video input and produces text output.

When was Nemotron 3 Nano Omni (free) released?

Nemotron 3 Nano Omni (free) was released on 2026-04-28.

Is Nemotron 3 Nano Omni (free) free?

Yes. The selected Nemotron 3 Nano Omni (free) variant has zero input and output token pricing.

For the free endpoint, please do not upload any confidential information or personal data (such as voices or faces of people). Your use is logged for security purposes and to improve NVIDIA products and services. The logged session data for improvement purposes is not linked to your identity or any persistent identifier. For more information about NVIDIA's data processing practices, see Privacy Policy(opens in new tab). By using this free endpoint, you consent to NVIDIA's collection, recording, and use of such information and the NVIDIA API Trial Terms of Service(opens in new tab).

NVIDIA: Nemotron 3 Nano Omni (free)

Name: NVIDIA: Nemotron 3 Nano Omni (free)
Author: nvidia

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free

Model weights

Compare

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and audio inputs and produces text output, enabling agents to perceive and reason across modalities in a single inference loop.

Built on a hybrid MoE Transformer-Mamba architecture with Conv3D video layers and Efficient Video Sampling (EVS), it delivers approximately 2× higher throughput and 2.5× lower compute for video reasoning versus separate vision + speech pipelines. It supports up to 300K context length and a 16,384 reasoning budget, with extended thinking enabled via reasoning.enabled on OpenRouter.

Modalities

Price

Free

Context

256K

Released

Apr 28, 2026

Providers

This model is hosted by one provider. OpenRouter forwards every request to it directly — no routing decisions to make.

Effective Pricing

The chart below shows the average price customers are actually paying after prompt caching. Depending on the amount of repeated context you send, this can be 60–80% cheaper than the provider list price. Shown are rolling averages from the past 30 days.

Performance

Throughput is how fast the model writes (tokens per second — higher is better). Latency is total round-trip time (lower is better). TTFT is time-to-first-token — how long before you see anything appear (lower is better).

Uptime

Percent of requests that succeeded over the last 30 days. OpenRouter monitors every provider continuously and automatically retries on the next-best provider when one returns an error.

Benchmarks

Scores on standardized evaluations. Higher percentages are better — and rank percentile shows where this model lands among all models on OpenRouter.

Apps

Public apps that send the most traffic to this model. Good signal for what real production workloads look like — and a hint at which use cases this model is best suited for.

Activity

Token volume and request traffic to this model over time.

Quick Start

Drop-in code to call this model. OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. The only thing that changes between models is the model slug below.

NVIDIA: Nemotron 3 Nano Omni (free)

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free

NVIDIA: Nemotron 3 Nano Omni (free)

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

NVIDIA: Nemotron 3 Nano Omni (free)

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free

NVIDIA: Nemotron 3 Nano Omni (free)

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

What is Nemotron 3 Nano Omni (free)?

What is the context length of Nemotron 3 Nano Omni (free)?

How much does Nemotron 3 Nano Omni (free) cost?

What providers serve Nemotron 3 Nano Omni (free), and can I use it via API?

What modalities does Nemotron 3 Nano Omni (free) support?

When was Nemotron 3 Nano Omni (free) released?

Is Nemotron 3 Nano Omni (free) free?

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

What is Nemotron 3 Nano Omni (free)?

What is the context length of Nemotron 3 Nano Omni (free)?

How much does Nemotron 3 Nano Omni (free) cost?

What providers serve Nemotron 3 Nano Omni (free), and can I use it via API?

What modalities does Nemotron 3 Nano Omni (free) support?

When was Nemotron 3 Nano Omni (free) released?

Is Nemotron 3 Nano Omni (free) free?