NVIDIA: Llama Nemotron Rerank VL 1B V2
nvidia/llama-nemotron-rerank-vl-1b-v2
Llama Nemotron Rerank VL 1B V2 is a 1.7B multimodal reranking model from NVIDIA. It evaluates the relevance of document images and text against user queries, designed for vision RAG pipelines handling charts, tables, infographics, and mixed-media documents. Functions as a cross-encoder that accepts text queries paired with image, text, or combined document inputs, delivering approximately 6-7% recall improvements over embedding-only baselines on visual document retrieval benchmarks.
Modalities
Context
10K
Released
Jun 9, 2026
Activity
Token volume and request traffic to this model over time.