Model Status
Last tested: 2026-06-17 07:58:13 UTC
268 of 281 models passed. Chat/completion: 246/259. Embeddings: 22/22.
Chat and Completion Models
| Model | Underlying Model | Status | Tokens (p/r/m/t) | Time | Cost |
|---|---|---|---|---|---|
gpt-4o | openai/gpt-4o | ✅ | p=16 r=0 t=17 | 2.5s | $0.00005 |
gpt-4o-mini | openai/gpt-4o-mini | ✅ | p=16 r=0 t=17 | 988ms | $0.000003 |
gpt-4.1 | openai/gpt-4.1 | ✅ | p=16 r=0 t=17 | 1.3s | $0.00004 |
gpt-4.1-mini | openai/gpt-4.1-mini | ✅ | p=16 r=0 t=17 | 880ms | $0.000008 |
gpt-4.1-nano | openai/gpt-4.1-nano | ✅ | p=16 r=0 t=17 | 686ms | $0.000002 |
gpt-5 | openai/gpt-5 | ✅ | p=15 r=192 t=217 | 6.7s | $0.002039 |
gpt-5-chat | openai/gpt-5-chat-latest | ✅ | p=16 r=0 t=17 | 646ms | $0.00003 |
gpt-5-mini | openai/gpt-5-mini | ✅ | p=15 r=64 t=89 | 2.5s | $0.000152 |
gpt-5-mini-high | openai/gpt-5-mini | ✅ | p=15 r=128 t=153 | 2.6s | $0.00028 |
gpt-5-nano | openai/gpt-5-nano | ✅ | p=15 r=0 t=25 | 1.4s | $0.000005 |
gpt-5-nano-high | openai/gpt-5-nano | ✅ | p=15 r=256 t=281 | 3.0s | $0.000107 |
gpt-5-codex | openai/gpt-5-codex | ✅ | p=15 r=0 t=37 | 1.5s | $0.000239 |
gpt-5-high | openai/gpt-5 | ✅ | p=15 r=128 t=153 | 3.5s | $0.001399 |
gpt-5.1 | openai/gpt-5.1 | ✅ | p=15 r=0 t=25 | 811ms | $0.000119 |
gpt-5.1-chat | openai/gpt-5.1-chat-latest | ✅ | p=15 r=0 t=25 | 1.2s | $0.000119 |
gpt-5.1-codex | openai/gpt-5.1-codex | ✅ | p=15 r=0 t=29 | 2.0s | $0.000159 |
gpt-5.1-codex-mini | openai/gpt-5.1-codex-mini | ✅ | p=15 r=0 t=51 | 1.1s | $0.000076 |
gpt-5.1-codex-max | openai/gpt-5.1-codex-max | ✅ | p=15 r=64 t=122 | 1.6s | $0.001089 |
gpt-5.1-high | openai/gpt-5.1 | ✅ | p=15 r=3 t=28 | 828ms | $0.000149 |
gpt-5.2 | openai/gpt-5.2 | ✅ | p=15 r=0 t=19 | 779ms | $0.000082 |
gpt-5.2-chat | openai/gpt-5.2-chat-latest | ✅ | p=15 r=0 t=25 | 3.9s | $0.000166 |
gpt-5.2-codex | openai/gpt-5.2-codex | ✅ | p=15 r=0 t=31 | 1.0s | $0.00025 |
gpt-5.2-high | openai/gpt-5.2 | ✅ | p=15 r=5 t=30 | 965ms | $0.000236 |
gpt-5.2-pro | openai/gpt-5.2 | ✅ | p=15 r=0 t=19 | 2.6s | $0.000082 |
gpt-5.3-codex | openai/gpt-5.3-codex | ✅ | p=15 r=0 t=20 | 933ms | $0.000096 |
gpt-5.3-codex-high | openai/gpt-5.3-codex | ✅ | p=15 r=0 t=20 | 1.2s | $0.000096 |
gpt-5.3-codex-xhigh | openai/gpt-5.3-codex | ✅ | p=15 r=0 t=20 | 924ms | $0.000096 |
gpt-5.4 | openai/gpt-5.4 | ✅ | p=15 r=0 t=19 | 917ms | $0.000098 |
gpt-5.4-mini | openai/gpt-5.4-mini | ✅ | p=15 r=0 t=19 | 4.9s | $0.000029 |
gpt-5.4-nano | openai/gpt-5.4-nano | ✅ | p=15 r=0 t=19 | 733ms | $0.000008 |
gpt-5.4-high | openai/gpt-5.4 | ✅ | p=15 r=9 t=34 | 1.4s | $0.000322 |
gpt-5.4-xhigh | openai/gpt-5.4 | ✅ | p=15 r=17 t=42 | 1.5s | $0.000443 |
gpt-5.4-mini-high | openai/gpt-5.4-mini | ✅ | p=15 r=0 t=19 | 1.3s | $0.000029 |
gpt-5.4-mini-xhigh | openai/gpt-5.4-mini | ✅ | p=15 r=11 t=36 | 946ms | $0.000106 |
gpt-5.4-nano-high | openai/gpt-5.4-nano | ✅ | p=15 r=0 t=19 | 815ms | $0.000008 |
gpt-5.4-nano-xhigh | openai/gpt-5.4-nano | ✅ | p=15 r=23 t=48 | 937ms | $0.000044 |
gpt-5.4-pro | openai/gpt-5.4 | ✅ | p=15 r=0 t=19 | 941ms | $0.000098 |
gpt-5.5 | openai/gpt-5.5 | ✅ | p=15 r=4 t=29 | 1.3s | $0.000495 |
gpt-5.5-low | openai/gpt-5.5 | ✅ | p=15 r=0 t=19 | 1.1s | $0.000195 |
gpt-5.5-medium | openai/gpt-5.5 | ✅ | p=15 r=0 t=19 | 1.8s | $0.000195 |
gpt-5.5-high | openai/gpt-5.5 | ✅ | p=15 r=6 t=31 | 1.2s | $0.000555 |
gpt-5.5-xhigh | openai/gpt-5.5 | ✅ | p=15 r=6 t=31 | 1.4s | $0.000555 |
gpt-5.5-pro | openai/gpt-5.5-pro | ✅ | p=15 r=10 t=32 | 2.6s | $0.00351 |
o1 | openai/o1 | ✅ | p=15 r=64 t=93 | 1.7s | $0.004905 |
o1-high | openai/o1 | ✅ | p=15 r=128 t=158 | 5.4s | $0.008805 |
google/claude-haiku-4-5 | vertex_ai/claude-haiku-4-5@20251001 | ✅ | p=17 r=0 m=4 t=21 | 952ms | $0.000037 |
google/claude-haiku-4-5-high | vertex_ai/claude-haiku-4-5@20251001 | ✅ | p=46 r=30 m=15 t=91 | 929ms | $0.000271 |
google/claude-sonnet-4 | vertex_ai/claude-sonnet-4@20250514 | ✅ | p=17 r=0 m=4 t=21 | 918ms | $0.000111 |
google/claude-sonnet-4-high | vertex_ai/claude-sonnet-4@20250514 | ✅ | p=46 r=42 m=15 t=103 | 2.0s | $0.000993 |
google/claude-sonnet-4-5 | vertex_ai/claude-sonnet-4-5@20250929 | ✅ | p=17 r=0 m=4 t=21 | 2.3s | $0.000111 |
google/claude-sonnet-4-5-high | vertex_ai/claude-sonnet-4-5@20250929 | ✅ | p=46 r=42 m=16 t=104 | 2.7s | $0.001008 |
google/claude-sonnet-4-6 | vertex_ai/claude-sonnet-4-6@default | ✅ | p=17 r=0 m=4 t=21 | 1.2s | $0.000111 |
google/claude-sonnet-4-6-high | vertex_ai/claude-sonnet-4-6@default | ✅ | p=17 r=0 m=4 t=21 | 977ms | $0.000111 |
google/claude-opus-4 | vertex_ai/claude-opus-4@20250514 | ✅ | p=17 r=0 m=4 t=21 | 1.6s | $0.000555 |
google/claude-opus-4-high | vertex_ai/claude-opus-4@20250514 | ✅ | p=46 r=27 m=13 t=86 | 2.7s | $0.00369 |
google/claude-opus-4-1 | vertex_ai/claude-opus-4-1@20250805 | ✅ | p=17 r=0 m=4 t=21 | 9.0s | $0.000555 |
google/claude-opus-4-1-high | vertex_ai/claude-opus-4-1@20250805 | ✅ | p=17 r=0 m=4 t=21 | 1.6s | $0.000555 |
google/claude-opus-4-5 | vertex_ai/claude-opus-4-5@20251101 | ✅ | p=17 r=0 m=4 t=21 | 1.8s | $0.000185 |
google/claude-opus-4-5-high | vertex_ai/claude-opus-4-5@20251101 | ✅ | p=46 r=15 m=12 t=73 | 1.1s | $0.000905 |
google/claude-opus-4-6 | vertex_ai/claude-opus-4-6@default | ✅ | p=17 r=0 m=4 t=21 | 5.5s | $0.000185 |
google/claude-opus-4-6-high | – | ❌ | – | – | – |
google/claude-opus-4-7 | vertex_ai/claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 1.2s | $0.000285 |
google/claude-opus-4-7-low | vertex_ai/claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 1.2s | $0.000285 |
google/claude-opus-4-7-medium | vertex_ai/claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 2.4s | $0.000285 |
google/claude-opus-4-7-high | vertex_ai/claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 1.7s | $0.000285 |
google/claude-opus-4-7-xhigh | vertex_ai/claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 1.1s | $0.000285 |
google/claude-opus-4-7-max | vertex_ai/claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 1.7s | $0.000285 |
google/claude-opus-4-8 | vertex_ai/claude-opus-4-8 | ✅ | p=22 r=0 m=4 t=26 | 1.2s | $0.00021 |
google/claude-opus-4-8-low | vertex_ai/claude-opus-4-8 | ✅ | p=22 r=0 m=4 t=26 | 1.3s | $0.00021 |
google/claude-opus-4-8-medium | vertex_ai/claude-opus-4-8 | ✅ | p=22 r=0 m=4 t=26 | 1.2s | $0.00021 |
google/claude-opus-4-8-high | vertex_ai/claude-opus-4-8 | ✅ | p=22 r=0 m=4 t=26 | 2.5s | $0.00021 |
google/claude-opus-4-8-xhigh | vertex_ai/claude-opus-4-8 | ✅ | p=22 r=0 m=4 t=26 | 2.5s | $0.00021 |
google/claude-opus-4-8-max | vertex_ai/claude-opus-4-8 | ✅ | p=22 r=0 m=4 t=26 | 2.8s | $0.00021 |
amazon/claude-haiku-3-5 | bedrock/us.anthropic.claude-3-5-haiku-20241022-v1:0 | ✅ | p=20 r=0 m=96 t=116 | 3.1s | $0.0004 |
amazon/claude-haiku-4-5 | bedrock/us.anthropic.claude-haiku-4-5-20251001-v1:0 | ✅ | p=17 r=0 m=4 t=21 | 1.2s | $0.000041 |
amazon/claude-haiku-4-5-high | bedrock/us.anthropic.claude-haiku-4-5-20251001-v1:0 | ✅ | p=46 r=27 m=14 t=87 | 1.1s | $0.000276 |
amazon/claude-sonnet-4 | bedrock/us.anthropic.claude-sonnet-4-20250514-v1:0 | ✅ | p=17 r=0 m=4 t=21 | 968ms | $0.000111 |
amazon/claude-sonnet-4-high | bedrock/us.anthropic.claude-sonnet-4-20250514-v1:0 | ✅ | p=46 r=42 m=14 t=102 | 1.5s | $0.000978 |
amazon/claude-sonnet-4-5 | bedrock/us.anthropic.claude-sonnet-4-5-20250929-v1:0 | ✅ | p=17 r=0 m=4 t=21 | 1.5s | $0.000122 |
amazon/claude-sonnet-4-5-high | bedrock/us.anthropic.claude-sonnet-4-5-20250929-v1:0 | ✅ | p=46 r=26 m=13 t=85 | 2.5s | $0.000795 |
amazon/claude-sonnet-4-6 | bedrock/us.anthropic.claude-sonnet-4-6 | ✅ | p=17 r=0 m=4 t=21 | 1.2s | $0.000122 |
amazon/claude-sonnet-4-6-high | bedrock/us.anthropic.claude-sonnet-4-6 | ✅ | p=17 r=0 m=4 t=21 | 1.5s | $0.000122 |
amazon/claude-opus-4-1 | bedrock/us.anthropic.claude-opus-4-1-20250805-v1:0 | ✅ | p=17 r=0 m=4 t=21 | 3.3s | $0.000555 |
amazon/claude-opus-4-1-high | bedrock/us.anthropic.claude-opus-4-1-20250805-v1:0 | ✅ | p=46 r=30 m=14 t=90 | 5.9s | $0.00399 |
amazon/claude-opus-4-5 | bedrock/us.anthropic.claude-opus-4-5-20251101-v1:0 | ✅ | p=17 r=0 m=4 t=21 | 1.7s | $0.000204 |
amazon/claude-opus-4-5-high | bedrock/us.anthropic.claude-opus-4-5-20251101-v1:0 | ✅ | p=46 r=15 m=12 t=73 | 2.2s | $0.000995 |
amazon/claude-opus-4-6 | bedrock/us.anthropic.claude-opus-4-6-v1 | ✅ | p=17 r=0 m=4 t=21 | 2.5s | $0.000204 |
amazon/claude-opus-4-6-high | bedrock/us.anthropic.claude-opus-4-6-v1 | ✅ | p=17 r=16 m=20 t=53 | 2.5s | $0.001084 |
amazon/claude-opus-4-7 | bedrock/us.anthropic.claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 1.2s | $0.000313 |
amazon/claude-opus-4-7-low | bedrock/us.anthropic.claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 1.0s | $0.000313 |
amazon/claude-opus-4-7-medium | bedrock/us.anthropic.claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 3.0s | $0.000313 |
amazon/claude-opus-4-7-high | bedrock/us.anthropic.claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 1.3s | $0.000313 |
amazon/claude-opus-4-7-xhigh | bedrock/us.anthropic.claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 1.2s | $0.000313 |
amazon/claude-opus-4-7-max | bedrock/us.anthropic.claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 1.2s | $0.000313 |
amazon/claude-opus-4-8 | – | ❌ | – | – | – |
amazon/claude-opus-4-8-low | – | ❌ | – | – | – |
amazon/claude-opus-4-8-medium | – | ❌ | – | – | – |
amazon/claude-opus-4-8-high | – | ❌ | – | – | – |
amazon/claude-opus-4-8-xhigh | – | ❌ | – | – | – |
amazon/claude-opus-4-8-max | – | ❌ | – | – | – |
devstral-2 | bedrock/mistral.devstral-2-123b | ✅ | p=12 r=0 m=2 t=14 | 810ms | $0.000009 |
mistral-large-3 | bedrock/mistral.mistral-large-3-675b-instruct | ✅ | p=12 r=0 m=2 t=14 | 553ms | $0.000009 |
nemotron-super-3 | bedrock/nvidia.nemotron-super-3-120b | ✅ | p=25 r=0 m=2 t=27 | 1.2s | $0.000005 |
nemotron-nano-3 | bedrock/nvidia.nemotron-nano-3-30b | ✅ | p=25 r=0 m=2 t=27 | 620ms | $0.000002 |
nemotron-nano-vl | bedrock/nvidia.nemotron-nano-12b-v2 | ✅ | p=24 r=0 m=3 t=27 | 623ms | $0.000007 |
nova-premier-1 | – | ❌ | – | – | – |
nova-pro-1 | bedrock/amazon.nova-pro-v1:0 | ✅ | p=10 r=0 m=2 t=12 | 648ms | $0.000014 |
nova-micro-1 | bedrock/amazon.nova-micro-v1:0 | ✅ | p=10 r=0 m=2 t=12 | 623ms | $0.000001 |
gemini-2.5-flash | vertex_ai/gemini-2.5-flash | ✅ | p=9 r=23 m=1 t=33 | 1.3s | $0.000063 |
gemini-2.5-flash-high | vertex_ai/gemini-2.5-flash | ✅ | p=9 r=22 m=1 t=32 | 1.6s | $0.00006 |
gemini-2.5-flash-lite | vertex_ai/gemini-2.5-flash-lite | ✅ | p=9 m=1 t=10 | 751ms | $0.000001 |
gemini-2.5-pro | vertex_ai/gemini-2.5-pro | ✅ | p=9 r=220 m=1 t=230 | 3.8s | $0.002221 |
gemini-2.5-pro-high | vertex_ai/gemini-2.5-pro | ✅ | p=9 r=104 m=1 t=114 | 3.5s | $0.001061 |
gemini-3-flash | vertex_ai/gemini-3-flash-preview | ✅ | p=9 r=103 m=0 t=112 | 3.1s | $0.000314 |
gemini-3-flash-high | vertex_ai/gemini-3-flash-preview | ✅ | p=9 r=43 m=1 t=53 | 2.7s | $0.000136 |
gemini-3-flash-priority | vertex_ai/gemini-3-flash-preview | ✅ | p=9 r=41 m=1 t=51 | 3.6s | $0.000131 |
gemini-3-flash-high-priority | vertex_ai/gemini-3-flash-preview | ✅ | p=9 r=43 m=1 t=53 | 3.9s | $0.000136 |
gemini-3.1-flash-lite | vertex_ai/gemini-3.1-flash-lite-preview | ✅ | p=9 m=1 t=10 | 920ms | $0.000004 |
gemini-3.1-flash-lite-high | vertex_ai/gemini-3.1-flash-lite-preview | ✅ | p=9 r=69 m=1 t=79 | 3.1s | $0.000107 |
gemini-3.1-flash-lite-priority | vertex_ai/gemini-3.1-flash-lite-preview | ✅ | p=9 m=1 t=10 | 936ms | $0.000004 |
gemini-3.1-flash-lite-high-priority | vertex_ai/gemini-3.1-flash-lite-preview | ✅ | p=9 r=80 m=1 t=90 | 2.8s | $0.000124 |
gemini-3.1-pro | vertex_ai/gemini-3.1-pro-preview | ✅ | p=9 r=133 m=1 t=143 | 2.9s | $0.001626 |
gemini-3.1-pro-high | vertex_ai/gemini-3.1-pro-preview | ✅ | p=9 r=119 m=1 t=129 | 5.0s | $0.001458 |
gemini-3.1-pro-priority | vertex_ai/gemini-3.1-pro-preview | ✅ | p=9 r=104 m=1 t=114 | 2.7s | $0.001278 |
gemini-3.1-pro-high-priority | vertex_ai/gemini-3.1-pro-preview | ✅ | p=9 r=135 m=1 t=145 | 4.8s | $0.00165 |
gemini-3.5-flash | vertex_ai/gemini-3.5-flash | ✅ | p=9 r=62 m=1 t=72 | 1.4s | $0.00058 |
gemini-3.5-flash-high | vertex_ai/gemini-3.5-flash | ✅ | p=9 r=95 m=1 t=105 | 2.9s | $0.000878 |
gemini-3.5-flash-priority | vertex_ai/gemini-3.5-flash | ✅ | p=9 r=112 m=1 t=122 | 1.8s | $0.00103 |
gemini-3.5-flash-high-priority | vertex_ai/gemini-3.5-flash | ✅ | p=9 r=129 m=1 t=139 | 2.9s | $0.001183 |
google/gpt-oss-120b | vertex_ai/openai/gpt-oss-120b-maas | ✅ | p=76 t=133 | 1.1s | $0.000036 |
google/gpt-oss-20b | vertex_ai/openai/gpt-oss-20b-maas | ✅ | p=76 t=123 | 793ms | $0.000015 |
google/gpt-oss-120b-high | vertex_ai/openai/gpt-oss-120b-maas | ✅ | p=76 t=194 | 1.1s | $0.000073 |
google/gpt-oss-20b-high | vertex_ai/openai/gpt-oss-20b-maas | ✅ | p=76 t=119 | 785ms | $0.000014 |
google/deepseek-r1 | vertex_ai/deepseek-ai/deepseek-r1-0528-maas | ✅ | p=14 t=39 | 736ms | $0.000154 |
google/qwen-3-coder | vertex_ai/qwen/qwen3-coder-480b-a35b-instruct-maas | ✅ | p=17 t=19 | 1.2s | $0.000023 |
google/qwen-3 | vertex_ai/qwen/qwen3-235b-a22b-instruct-2507-maas | ✅ | p=17 t=19 | 908ms | $0.000006 |
google/gemma-4 | vertex_ai/google/gemma-4-26b-a4b-it-maas | ✅ | p=22 t=24 | 520ms | $0.000004 |
google/codestral | – | ❌ | – | – | – |
google/glm-5 | vertex_ai/zai-org/glm-5-maas | ✅ | p=14 t=164 | 3.1s | $0.000494 |
google/glm-4.7 | vertex_ai/zai-org/glm-4.7-maas | ✅ | p=14 t=97 | 1.7s | $0.000191 |
google/deepseek-3.2 | vertex_ai/deepseek-ai/deepseek-v3.2-maas | ✅ | p=13 t=15 | 1.6s | $0.000011 |
google/kimi-k2-thinking | vertex_ai/moonshotai/kimi-k2-thinking-maas | ✅ | p=16 t=65 | 902ms | $0.000132 |
google/minimax-m2 | vertex_ai/minimaxai/minimax-m2-maas | ✅ | p=31 t=86 | 1.1s | $0.00007 |
google/grok-4.20-reasoning | vertex_ai/xai/grok-4.20-reasoning | ✅ | p=338 r=465 t=804 | 2.1s | $0.003466 |
google/grok-4.1-non-reasoning | vertex_ai/xai/grok-4.1-fast-non-reasoning | ✅ | p=680 r=0 t=681 | 547ms | $0.000037 |
google/grok-4.1-reasoning | vertex_ai/xai/grok-4.1-fast-reasoning | ✅ | p=668 r=101 t=770 | 1.1s | $0.000086 |
google/grok-4.3 | – | ❌ | – | – | – |
amazon/llama-4-maverick | bedrock/us.meta.llama4-maverick-17b-instruct-v1:0 | ✅ | p=44 r=0 m=2 t=46 | 500ms | $0.000012 |
amazon/llama-4-scout | bedrock/us.meta.llama4-scout-17b-instruct-v1:0 | ✅ | p=44 r=0 m=2 t=46 | 535ms | $0.000009 |
amazon/gpt-oss-120b | bedrock/openai.gpt-oss-120b-1:0 | ✅ | p=76 r=20 m=11 t=107 | 676ms | $0.00003 |
amazon/gpt-oss-20b | bedrock/openai.gpt-oss-20b-1:0 | ✅ | p=76 r=21 m=11 t=108 | 664ms | $0.000015 |
lbl/gemma-4 | hosted_vllm/gemma-4 | ✅ | p=22 t=24 | 9.4s | – |
lbl/gemma-4-thinking | hosted_vllm/gemma-4-thinking | ✅ | p=25 t=119 | 12.8s | – |
lbl/gemma-4-mini | hosted_vllm/gemma-4-mini | ✅ | p=22 t=45 | 571ms | – |
lbl/gemma-4-mini-thinking | hosted_vllm/gemma-4-mini-thinking | ✅ | p=25 t=27 | 299ms | – |
lbl/gpt-oss-20b | hosted_vllm/gpt-oss-20b | ✅ | p=76 t=109 | 470ms | – |
lbl/gpt-oss-20b-low | hosted_vllm/gpt-oss-20b | ✅ | p=76 t=107 | 447ms | – |
lbl/gpt-oss-20b-medium | hosted_vllm/gpt-oss-20b | ✅ | p=76 t=112 | 484ms | – |
lbl/gpt-oss-20b-high | hosted_vllm/gpt-oss-20b-high | ✅ | p=76 t=109 | 462ms | – |
lbl/gpt-oss-120b | hosted_vllm/gpt-oss-120b | ✅ | p=76 t=129 | 470ms | – |
lbl/gpt-oss-120b-low | hosted_vllm/gpt-oss-120b-low | ✅ | p=76 t=108 | 390ms | – |
lbl/gpt-oss-120b-medium | hosted_vllm/gpt-oss-120b-medium | ✅ | p=76 t=129 | 573ms | – |
lbl/gpt-oss-120b-high | hosted_vllm/gpt-oss-120b-high | ✅ | p=76 t=122 | 452ms | – |
lbl/cborg-chat | openai/lbl/cborg-chat | ✅ | p=76 t=109 | 465ms | – |
lbl/cborg-deepthought | openai/lbl/cborg-deepthought | ✅ | p=25 t=58 | 732ms | – |
lbl/cborg-coder | openai/lbl/cborg-coder | ✅ | p=25 t=66 | 1.1s | – |
lbl/cborg-coder-fast | openai/lbl/cborg-coder-fast | ✅ | p=76 t=130 | 560ms | – |
lbl/cborg-mini | openai/lbl/cborg-mini | ✅ | p=25 t=27 | 320ms | – |
lbl/cborg-mini-fast | openai/lbl/cborg-mini-fast | ✅ | p=22 t=47 | 546ms | – |
lbl/cborg-privacy-filter | openai/lbl/cborg-privacy-filter | ✅ | p=6 t=12 | 501ms | – |
lbl/cborg-instant | openai/lbl/cborg-instant | ✅ | p=26 t=282 | 911ms | – |
lbl/cborg-instant-short | openai/lbl/cborg-instant-short | ✅ | p=26 t=282 | 1.1s | – |
lbl/cborg-safeguard | openai/lbl/cborg-safeguard | ✅ | p=76 t=121 | 434ms | – |
lbl/cborg-safeguard-high | openai/lbl/cborg-safeguard-high | ✅ | p=76 t=118 | 572ms | – |
lbl/cborg-ocr | openai/lbl/cborg-ocr | ✅ | p=22 t=24 | 560ms | – |
lbl/cborg-vision | openai/lbl/cborg-vision | ✅ | p=25 t=92 | 975ms | – |
lbl/cborg-vision-fast | openai/lbl/cborg-vision-fast | ✅ | p=25 t=27 | 319ms | – |
lbl/cborg-ocr-fast | openai/lbl/cborg-ocr-fast | ✅ | p=25 t=27 | 333ms | – |
claude-haiku | vertex_ai/claude-haiku-4-5@20251001 | ✅ | p=17 r=0 m=4 t=21 | 715ms | $0.000037 |
claude-haiku-high | vertex_ai/claude-haiku-4-5@20251001 | ✅ | p=46 r=27 m=14 t=87 | 944ms | $0.000251 |
claude-sonnet | vertex_ai/claude-sonnet-4-6@default | ✅ | p=17 r=0 m=4 t=21 | 1.0s | $0.000111 |
claude-sonnet-high | vertex_ai/claude-sonnet-4-6@default | ✅ | p=17 r=0 m=4 t=21 | 1.4s | $0.000111 |
claude-opus | vertex_ai/claude-opus-4-8 | ✅ | p=22 r=0 m=4 t=26 | 1.6s | $0.00021 |
claude-opus-high | vertex_ai/claude-opus-4-8 | ✅ | p=22 r=0 m=4 t=26 | 1.6s | $0.00021 |
anthropic/claude-haiku | vertex_ai/claude-haiku-4-5@20251001 | ✅ | p=17 r=0 m=4 t=21 | 761ms | $0.000037 |
anthropic/claude-haiku-high | vertex_ai/claude-haiku-4-5@20251001 | ✅ | p=46 r=28 m=14 t=88 | 1.3s | $0.000256 |
anthropic/claude-sonnet | vertex_ai/claude-sonnet-4-6@default | ✅ | p=17 r=0 m=4 t=21 | 4.2s | $0.000111 |
anthropic/claude-sonnet-high | vertex_ai/claude-sonnet-4-6@default | ✅ | p=17 r=0 m=4 t=21 | 1.3s | $0.000111 |
anthropic/claude-opus | vertex_ai/claude-opus-4-8 | ✅ | p=22 r=0 m=4 t=26 | 1.1s | $0.00021 |
anthropic/claude-opus-high | vertex_ai/claude-opus-4-8 | ✅ | p=22 r=0 m=4 t=26 | 2.4s | $0.00021 |
gemini-pro | vertex_ai/gemini-3.1-pro-preview | ✅ | p=9 r=101 m=1 t=111 | 3.2s | $0.001242 |
gemini-pro-high | vertex_ai/gemini-3.1-pro-preview | ✅ | p=9 r=101 m=1 t=111 | 4.9s | $0.001242 |
gemini-pro-priority | vertex_ai/gemini-3.1-pro-preview | ✅ | p=9 r=78 m=1 t=88 | 3.6s | $0.000966 |
gemini-pro-high-priority | vertex_ai/gemini-3.1-pro-preview | ✅ | p=9 r=121 m=1 t=131 | 4.2s | $0.001482 |
gemini-flash | vertex_ai/gemini-3.5-flash | ✅ | p=9 r=122 m=1 t=132 | 1.9s | $0.00112 |
gemini-flash-high | vertex_ai/gemini-3.5-flash | ✅ | p=9 r=134 m=1 t=144 | 3.6s | $0.001228 |
gemini-flash-priority | vertex_ai/gemini-3.5-flash | ✅ | p=9 r=103 m=0 t=112 | 1.8s | $0.00094 |
gemini-flash-high-priority | vertex_ai/gemini-3.5-flash | ✅ | p=9 r=139 m=1 t=149 | 1.6s | $0.001273 |
gemini-flash-lite | vertex_ai/gemini-3.1-flash-lite-preview | ✅ | p=9 m=1 t=10 | 756ms | $0.000004 |
gemini-flash-lite-high | vertex_ai/gemini-3.1-flash-lite-preview | ✅ | p=9 r=96 m=1 t=106 | 2.8s | $0.000148 |
gemini-flash-lite-priority | vertex_ai/gemini-3.1-flash-lite-preview | ✅ | p=9 m=1 t=10 | 808ms | $0.000004 |
gemini-flash-lite-high-priority | vertex_ai/gemini-3.1-flash-lite-preview | ✅ | p=9 r=92 m=1 t=102 | 2.4s | $0.000142 |
meta/llama-4-scout | bedrock/us.meta.llama4-scout-17b-instruct-v1:0 | ✅ | p=44 r=0 m=2 t=46 | 486ms | $0.000009 |
Llama-4-Scout-17B-16E-Instruct | bedrock/us.meta.llama4-scout-17b-instruct-v1:0 | ✅ | p=44 r=0 m=2 t=46 | 484ms | $0.000009 |
gpt-oss-20b | hosted_vllm/gpt-oss-20b | ✅ | p=76 t=98 | 424ms | – |
gpt-oss-120b | vertex_ai/openai/gpt-oss-120b-maas | ✅ | p=76 t=100 | 665ms | $0.000015 |
gpt-oss-20b-high | hosted_vllm/gpt-oss-20b-high | ✅ | p=76 t=107 | 440ms | – |
gpt-oss-120b-high | vertex_ai/openai/gpt-oss-120b-maas | ✅ | p=76 t=160 | 954ms | $0.000052 |
gemma-4 | hosted_vllm/gemma-4 | ✅ | p=22 t=24 | 511ms | – |
gemma-4-thinking | hosted_vllm/gemma-4-thinking | ✅ | p=25 t=120 | 2.4s | – |
gemma-4-mini | hosted_vllm/gemma-4-mini | ✅ | p=22 t=57 | 650ms | – |
gemma-4-mini-thinking | hosted_vllm/gemma-4-mini-thinking | ✅ | p=25 t=27 | 328ms | – |
gpt | openai/gpt-5.5 | ✅ | p=15 r=4 t=29 | 1.4s | $0.000495 |
gpt-high | openai/gpt-5.5 | ✅ | p=15 r=6 t=31 | 1.5s | $0.000555 |
gpt-codex | openai/gpt-5.3-codex | ✅ | p=15 r=0 t=20 | 1.0s | $0.000096 |
gpt-chat | openai/gpt-5.2-chat-latest | ✅ | p=15 r=0 t=25 | 1.2s | $0.000166 |
gpt-pro | openai/gpt-5.5-pro | ✅ | p=15 r=10 t=32 | 3.2s | $0.00351 |
gpt-mini | openai/gpt-5.4-mini | ✅ | p=15 r=0 t=19 | 867ms | $0.000029 |
gpt-mini-high | openai/gpt-5.4-mini | ✅ | p=15 r=11 t=36 | 732ms | $0.000106 |
gpt-nano | openai/gpt-5.4-nano | ✅ | p=15 r=0 t=19 | 779ms | $0.000008 |
gpt-nano-high | openai/gpt-5.4-nano | ✅ | p=15 r=0 t=19 | 959ms | $0.000008 |
xai/grok-4.20-reasoning | vertex_ai/xai/grok-4.20-reasoning | ✅ | p=338 r=416 t=755 | 1.8s | $0.002596 |
xai/grok-4.1-fast-reasoning | vertex_ai/xai/grok-4.1-fast-reasoning | ✅ | p=668 r=107 t=776 | 913ms | $0.000089 |
xai/grok-4.1-fast-non-reasoning | vertex_ai/xai/grok-4.1-fast-non-reasoning | ✅ | p=680 r=0 t=681 | 528ms | $0.000037 |
xai/grok-4.3 | – | ❌ | – | – | – |
xai/grok-reasoning | vertex_ai/xai/grok-4.20-reasoning | ✅ | p=338 r=493 t=832 | 2.2s | $0.003058 |
xai/grok-fast-reasoning | vertex_ai/xai/grok-4.1-fast-reasoning | ✅ | p=668 r=122 t=791 | 1.2s | $0.000095 |
xai/grok-fast-non-reasoning | vertex_ai/xai/grok-4.1-fast-non-reasoning | ✅ | p=680 r=0 t=681 | 554ms | $0.000037 |
xai/grok | – | ❌ | – | – | – |
cborg-chat | openai/lbl/cborg-chat | ✅ | p=76 t=99 | 379ms | – |
cborg-deepthought | openai/lbl/cborg-deepthought | ✅ | p=25 t=145 | 1.8s | – |
cborg-coder | openai/lbl/cborg-coder | ✅ | p=25 t=98 | 49.1s | – |
cborg-coder-fast | openai/lbl/cborg-coder-fast | ✅ | p=76 t=107 | 430ms | – |
cborg-mini | openai/lbl/cborg-mini | ✅ | p=25 t=27 | 318ms | – |
cborg-mini-fast | openai/lbl/cborg-mini-fast | ✅ | p=22 t=52 | 672ms | – |
cborg-ocr | openai/lbl/cborg-ocr | ✅ | p=22 t=24 | 573ms | – |
cborg-vision | openai/lbl/cborg-vision | ✅ | p=25 t=94 | 2.3s | – |
cborg-vision-fast | openai/lbl/cborg-vision-fast | ✅ | p=25 t=128 | 739ms | – |
cborg-ocr-fast | openai/lbl/cborg-ocr-fast | ✅ | p=25 t=27 | 281ms | – |
cborg-safeguard | openai/lbl/cborg-safeguard | ✅ | p=76 t=136 | 553ms | – |
cborg-safeguard-high | openai/lbl/cborg-safeguard-high | ✅ | p=76 t=120 | 465ms | – |
cborg-privacy-filter | openai/lbl/cborg-privacy-filter | ✅ | p=6 t=12 | 364ms | – |
cborg-instant | openai/lbl/cborg-instant | ✅ | p=26 t=282 | 970ms | – |
cborg-instant-short | openai/lbl/cborg-instant-short | ✅ | p=22 t=278 | 559ms | – |
claude-haiku-4-5 | vertex_ai/claude-haiku-4-5@20251001 | ✅ | p=17 r=0 m=4 t=21 | 954ms | $0.000037 |
claude-sonnet-4-0 | vertex_ai/claude-sonnet-4@20250514 | ✅ | p=17 r=0 m=4 t=21 | 956ms | $0.000111 |
claude-sonnet-4-5 | vertex_ai/claude-sonnet-4-5@20250929 | ✅ | p=17 r=0 m=4 t=21 | 1.1s | $0.000111 |
claude-sonnet-4-6 | vertex_ai/claude-sonnet-4-6@default | ✅ | p=17 r=0 m=4 t=21 | 1.2s | $0.000111 |
claude-opus-4-0 | vertex_ai/claude-opus-4@20250514 | ✅ | p=17 r=0 m=4 t=21 | 1.9s | $0.000555 |
claude-opus-4-1 | vertex_ai/claude-opus-4-1@20250805 | ✅ | p=17 r=0 m=4 t=21 | 1.9s | $0.000555 |
claude-opus-4-5 | vertex_ai/claude-opus-4-5@20251101 | ✅ | p=17 r=0 m=4 t=21 | 866ms | $0.000185 |
claude-opus-4-6 | vertex_ai/claude-opus-4-6@default | ✅ | p=17 r=0 m=4 t=21 | 3.3s | $0.000185 |
claude-opus-4-7 | vertex_ai/claude-opus-4-7 | ✅ | p=27 r=0 m=6 t=33 | 1.2s | $0.000285 |
claude-opus-4-8 | vertex_ai/claude-opus-4-8 | ✅ | p=22 r=0 m=4 t=26 | 1.3s | $0.00021 |
devstral | bedrock/mistral.devstral-2-123b | ✅ | p=12 r=0 m=2 t=14 | 717ms | $0.000009 |
mistral-large | bedrock/mistral.mistral-large-3-675b-instruct | ✅ | p=12 r=0 m=2 t=14 | 930ms | $0.000009 |
nova-premier | – | ❌ | – | – | – |
nova-pro | bedrock/amazon.nova-pro-v1:0 | ✅ | p=10 r=0 m=3 t=13 | 643ms | $0.000018 |
nova-micro | bedrock/amazon.nova-micro-v1:0 | ✅ | p=10 r=0 m=3 t=13 | 608ms | $0.000001 |
Embedding Models
| Model | Underlying Model | Status | Dimensions | Time | Cost |
|---|---|---|---|---|---|
text-embedding-ada-002 | openai/text-embedding-ada-002 | ✅ | 1536 | 554ms | $0.000001 |
nova-2-embed-multimodal | bedrock/amazon.nova-2-multimodal-embeddings-v1:0 | ✅ | 3072 | 597ms | $0.000104 |
titan-embed-text-v1 | bedrock/amazon.titan-embed-text-v1 | ✅ | 1536 | 441ms | $0.000001 |
titan-embed-image-v1 | bedrock/amazon.titan-embed-image-v1 | ✅ | 1024 | 486ms | $0.000009 |
titan-embed-text-v2 | bedrock/amazon.titan-embed-text-v2:0 | ✅ | 1024 | 528ms | $0.000002 |
cohere-embed-multilingual-v3 | bedrock/cohere.embed-multilingual-v3 | ✅ | 1024 | 588ms | $0.000001 |
cohere-embed-english-v3 | bedrock/cohere.embed-english-v3 | ✅ | 1024 | 423ms | $0.000001 |
cohere-embed-v4 | bedrock/cohere.embed-v4:0 | ✅ | 1536 | 485ms | $0.000001 |
gemini-embedding-001 | vertex_ai/gemini-embedding-001 | ✅ | 3072 | 10.9s | $0.000001 |
text-embedding-004 | vertex_ai/text-embedding-004 | ✅ | 768 | 538ms | $0.000001 |
lbl/nomic-embed-text | openai/nomic-embed-text | ✅ | 768 | 1.9s | – |
lbl/nomic-embed-vision | openai/nomic-embed-vision | ✅ | 768 | 299ms | – |
lbl/nomic-embed-code | openai/nomic-embed-code | ✅ | 3584 | 456ms | – |
lbl/nomic-embed-text-test | openai/nomic-embed-text | ✅ | 768 | 359ms | – |
lbl/nomic-embed-vision-test | openai/nomic-embed-vision | ✅ | 768 | 272ms | – |
lbl/nomic-embed-code-test | openai/nomic-embed-code | ✅ | 3584 | 427ms | – |
nomic-embed-text | openai/nomic-embed-text | ✅ | 768 | 305ms | – |
nomic-embed-vision | openai/nomic-embed-vision | ✅ | 768 | 308ms | – |
nomic-embed-code | openai/nomic-embed-code | ✅ | 3584 | 416ms | – |
nomic-embed-text-test | openai/nomic-embed-text | ✅ | 768 | 310ms | – |
nomic-embed-vision-test | openai/nomic-embed-vision | ✅ | 768 | 318ms | – |
nomic-embed-code-test | openai/nomic-embed-code | ✅ | 3584 | 395ms | – |
Auto-generated by cborg-etc/bin/test-all-models.sh. Run the script and publish to update.