Skip to content
  •  
  • © 2023 – 2025 OpenRouter, Inc
      Favicon for InoCloud

      InoCloud

      Browse models provided by InoCloud (Terms of Service)

      2 models

      Tokens processed

      • Qwen: Qwen2.5 VL 32B Instruct

        Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysis tasks, including object recognition, textual interpretation within images, and precise event localization in extended videos. Qwen2.5-VL-32B demonstrates state-of-the-art performance across multimodal benchmarks such as MMMU, MathVista, and VideoMME, while maintaining strong reasoning and clarity in text-based tasks like MMLU, mathematical problem-solving, and code generation.

        by qwen33K context$1.10/M input tokens$1.10/M output tokens
      • Google: Gemma 3 27B

        Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities, including structured outputs and function calling. Gemma 3 27B is Google's latest open source model, successor to Gemma 2

        by google131K context$0.10/M input tokens$0.20/M output tokens