Search/
Skip to content
/

NVIDIA: Llama Nemotron Embed VL 1B V2 (free)Free variant

nvidia/llama-nemotron-embed-vl-1b-v2:free

Released Feb 25, 2026131,072 context
$0/M input tokens$0/M output tokens

The Llama Nemotron Embed VL 1B V2 embedding model is optimized for multimodal question-answering retrieval. The model can embed 'documents' in the form of image, text, or image and text combined. Documents can be retrieved given a user query in text form. The model supports images containing text, tables, charts, and infographics.

Note: For the free endpoint, all prompts and output are logged to improve the provider's model and its product and services. Please do not upload any personal, confidential, or otherwise sensitive information. This is a trial use only. Do not use for production or business-critical systems.

OpenRouter
© 2026 OpenRouter, Inc

Product

  • Chat
  • Rankings
  • Models
  • Providers
  • Pricing
  • Enterprise

Company

  • About
  • Announcements
  • CareersHiring
  • Privacy
  • Terms of Service
  • Support
  • State of AI
  • Works With OR

Developer

  • Documentation
  • API Reference
  • SDK
  • Status

Connect

  • Discord
  • GitHub
  • LinkedIn
  • X
  • YouTube

Providers for Llama Nemotron Embed VL 1B V2 (free)

OpenRouter routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

Performance for Llama Nemotron Embed VL 1B V2 (free)

Compare different providers across OpenRouter

Effective Pricing for Llama Nemotron Embed VL 1B V2 (free)

Actual cost per million tokens across providers over the past hour

Apps using Llama Nemotron Embed VL 1B V2 (free)

Top public apps this month

Recent activity on Llama Nemotron Embed VL 1B V2 (free)

Total usage per day on OpenRouter

Not enough data to display yet.

Uptime stats for Llama Nemotron Embed VL 1B V2 (free)

Uptime stats for Llama Nemotron Embed VL 1B V2 (free) across all providers

Sample code and API for Llama Nemotron Embed VL 1B V2 (free)

OpenRouter normalizes requests and responses across providers for you.

OpenRouter provides an OpenAI-compatible embeddings API that you can call directly, or using the OpenAI SDK.

In the examples below, the OpenRouter-specific headers are optional. Setting them allows your app to appear on the OpenRouter leaderboards.

Using third-party SDKs

For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.

See the Request docs for all possible fields, and Parameters for explanations of specific sampling parameters.