NVIDIA: Llama Nemotron Rerank VL 1B V2 (free)
nvidia/llama-nemotron-rerank-vl-1b-v2:free
Llama Nemotron Rerank VL 1B V2 is a 1.7B multimodal reranking model from NVIDIA. It evaluates the relevance of document images and text against user queries, designed for vision RAG pipelines handling charts, tables, infographics, and mixed-media documents. Functions as a cross-encoder that accepts text queries paired with image, text, or combined document inputs, delivering approximately 6-7% recall improvements over embedding-only baselines on visual document retrieval benchmarks.
Modalities
Price
Free
Context
10K
Weekly Tokens
3K
Released
Jun 9, 2026