Is Nemotron 3 Super (free) free?

Yes. The pricing shown on this page for Nemotron 3 Super (free) is zero, so you are not charged for prompt or completion tokens. Free endpoints are rate limited — see the rate limit docs.

What is the context length of Nemotron 3 Super (free)?

Nemotron 3 Super (free) has a 262,144 token context window. It supports up to 262,144 completion tokens.

Does Nemotron 3 Super (free) support tool calling and structured outputs?

Yes. Nemotron 3 Super (free) accepts tools and tool_choice for function calling. It also supports structured outputs via a JSON schema in response_format.

What other text models does Nvidia have?

Nemotron 3.5 Content Safety (free), Nemotron 3 Ultra, Nemotron 3 Nano Omni (free) and 3 more are other text models from Nvidia.

When was Nemotron 3 Super (free) released?

Nemotron 3 Super (free) was released on 2026-03-11.

For the free endpoint, please do not upload any confidential information or personal data (such as voices or faces of people). Your use is logged for security purposes and to improve NVIDIA products and services. The logged session data for improvement purposes is not linked to your identity or any persistent identifier. For more information about NVIDIA's data processing practices, see Privacy Policy(opens in new tab). By using this free endpoint, you consent to NVIDIA's collection, recording, and use of such information and the NVIDIA API Trial Terms of Service(opens in new tab)

NVIDIA: Nemotron 3 Super (free)

Name: NVIDIA: Nemotron 3 Super (free)
Author: nvidia

nvidia/nemotron-3-super-120b-a12b:free

Model weights

Compare

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer Mixture-of-Experts architecture with multi-token prediction (MTP), it delivers over 50% higher token generation compared to leading open models.

The model features a 1M token context window for long-term agent coherence, cross-document reasoning, and multi-step task planning. Latent MoE enables calling 4 experts for the inference cost of only one, improving intelligence and generalization. Multi-environment RL training across 10+ environments delivers leading accuracy on benchmarks including AIME 2025, TerminalBench, and SWE-Bench Verified.

Fully open with weights, datasets, and recipes under the NVIDIA Open License, Nemotron 3 Super allows easy customization and secure deployment anywhere — from workstation to cloud.

Modalities

Price

Free

Context

262K

Released

Mar 11, 2026

NVIDIA: Nemotron 3 Super (free)

nvidia/nemotron-3-super-120b-a12b:free

Model weights

Compare

Fully open with weights, datasets, and recipes under the NVIDIA Open License, Nemotron 3 Super allows easy customization and secure deployment anywhere — from workstation to cloud.

Modalities

Price

Free

Context

262K

Released

Mar 11, 2026

NVIDIA: Nemotron 3 Super (free)

nvidia/nemotron-3-super-120b-a12b:free

NVIDIA: Nemotron 3 Super (free)

nvidia/nemotron-3-super-120b-a12b:free

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

NVIDIA: Nemotron 3 Super (free)

nvidia/nemotron-3-super-120b-a12b:free

NVIDIA: Nemotron 3 Super (free)

nvidia/nemotron-3-super-120b-a12b:free

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

What is Nemotron 3 Super (free)?

Is Nemotron 3 Super (free) free?

What is the context length of Nemotron 3 Super (free)?

Does Nemotron 3 Super (free) support tool calling and structured outputs?

What other text models does Nvidia have?

When was Nemotron 3 Super (free) released?

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

What is Nemotron 3 Super (free)?

Is Nemotron 3 Super (free) free?

What is the context length of Nemotron 3 Super (free)?

Does Nemotron 3 Super (free) support tool calling and structured outputs?

What other text models does Nvidia have?

When was Nemotron 3 Super (free) released?