Google: Gemini Pro Vision 1.0

google/gemini-pro-vision

Updated Dec 1345,875 context
$0.125/M input tkns$0.375/M output tkns$2.5/K input imgs

Google's flagship multimodal model, supporting image and video in text or chat prompts for a text or code response.

See the benchmarks and prompting guidelines from Deepmind.

Usage of Gemini is subject to Google's Gemini Terms of Use.

#multimodal