NVIDIA: Llama Nemotron Embed VL 1B V2
nvidia/llama-nemotron-embed-vl-1b-v2
Released Feb 25, 2026131,072 context
Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.