AI Models with Vision
Vision support is now table stakes for document parsing, screenshot understanding, and multimodal agents. Every model below accepts image input. Where they differ is in how much that capability costs and how large a context they pair it with, which matters a lot when you are feeding in high-resolution pages rather than single thumbnails.
Showing top 25 of 346 matching models · last synced 18 hours ago
Top pick right now: Aya Vision 8B from Cohere. It pairs image input with the lowest output price in this set (N/A/Mtok) and a 16K context window.
| # | Model | Provider | Input $/Mtok | Output $/Mtok | Context | Capabilities |
|---|---|---|---|---|---|---|
| 1 |
Aya Vision 8B
|
Cohere | N/A | N/A | 16,000 | vision, open |
| 2 |
Aya Vision 32B
|
Cohere | N/A | N/A | 16,000 | vision, open |
| 3 |
gpt-image-1
|
OpenAI | N/A | N/A | N/A | vision |
| 4 |
gpt-image-1-mini
|
OpenAI | N/A | N/A | N/A | vision |
| 5 |
chatgpt-image-latest
|
OpenAI | N/A | N/A | N/A | vision |
| 6 |
gpt-image-1.5
|
OpenAI | N/A | N/A | N/A | vision |
| 7 |
Gemma 4 31B IT
|
N/A | N/A | 262,144 | vision, tools, reasoning, structured output, open | |
| 8 |
Gemma 4 26B A4B IT
|
N/A | N/A | 262,144 | vision, tools, reasoning, structured output, open | |
| 9 |
Auto Router
|
OpenRouter | N/A | N/A | 2,000,000 | vision, tools, reasoning, structured output |
| 10 |
Grok Imagine Image Quality
|
xAI | N/A | N/A | 8,000 | vision |
| 11 |
Grok Imagine Video
|
xAI | N/A | N/A | 1,024 | vision |
| 12 |
Grok Imagine Image
|
xAI | N/A | N/A | 8,000 | vision |
| 13 |
Gemma 4 E4B IT
|
N/A | N/A | 131,072 | vision, tools, reasoning, structured output, open | |
| 14 |
Gemma 4 E2B IT
|
N/A | N/A | 131,072 | vision, tools, reasoning, structured output, open | |
| 15 |
Free Models Router
|
OpenRouter | Free | Free | 200,000 | vision, tools, reasoning, structured output |
| 16 |
Seedream 4.5
|
OpenRouter | Free | Free | 4,096 | vision, open |
| 17 |
FLUX.2 Max
|
OpenRouter | Free | Free | 46,864 | vision |
| 18 |
FLUX.2 Flex
|
OpenRouter | Free | Free | 67,344 | vision |
| 19 |
FLUX.2 Pro
|
OpenRouter | Free | Free | 46,864 | vision |
| 20 |
FLUX.2 Klein 4B
|
OpenRouter | Free | Free | 40,960 | vision, open |
| 21 |
Nemotron 3 Nano Omni (free)
|
OpenRouter | Free | Free | 256,000 | vision, tools, reasoning, open |
| 22 |
Nemotron Nano 12B 2 VL (free)
|
OpenRouter | Free | Free | 128,000 | vision, tools, reasoning, open |
| 23 |
Riverflow V2 Standard Preview
|
OpenRouter | Free | Free | 8,192 | vision, open |
| 24 |
Riverflow V2 Fast Preview
|
OpenRouter | Free | Free | 8,192 | vision, open |
| 25 |
Riverflow V2 Max Preview
|
OpenRouter | Free | Free | 8,192 | vision, open |
Browse other categories
Data synced daily from models.dev. Always verify pricing with the provider before deploying to production.