Model Status

Model Status

Last tested: 2026-06-04 22:27:43 UTC

233 of 266 models passed. Chat/completion: 219/244. Embeddings: 14/22.

Chat and Completion Models

ModelUnderlying ModelStatusTokens (p/r/m/t)TimeCost
claude-haikuvertex_ai/claude-haiku-4-5@20251001p=17 r=0 m=4 t=21658ms$0.000037
claude-haiku-highvertex_ai/claude-haiku-4-5@20251001p=46 r=27 m=14 t=871.2s$0.000251
claude-sonnetvertex_ai/claude-sonnet-4-6@defaultp=17 r=0 m=4 t=21885ms$0.000111
claude-sonnet-highvertex_ai/claude-sonnet-4-6@defaultp=17 r=0 m=4 t=21877ms$0.000111
claude-opusvertex_ai/claude-opus-4-8@defaultp=22 r=0 m=4 t=261.2s$0.00021
claude-opus-highvertex_ai/claude-opus-4-8@defaultp=22 r=0 m=4 t=261.0s$0.00021
anthropic/claude-haikuvertex_ai/claude-haiku-4-5@20251001p=17 r=0 m=4 t=21622ms$0.000037
anthropic/claude-haiku-highvertex_ai/claude-haiku-4-5@20251001p=46 r=29 m=14 t=89851ms$0.000261
anthropic/claude-sonnetvertex_ai/claude-sonnet-4-6@defaultp=17 r=0 m=4 t=21858ms$0.000111
anthropic/claude-sonnet-highvertex_ai/claude-sonnet-4-6@defaultp=17 r=0 m=4 t=211.4s$0.000111
anthropic/claude-opusvertex_ai/claude-opus-4-8@defaultp=22 r=0 m=4 t=262.5s$0.00021
anthropic/claude-opus-highvertex_ai/claude-opus-4-8@defaultp=22 r=0 m=4 t=262.2s$0.00021
gemini-provertex_ai/gemini-3.1-pro-previewp=9 r=141 m=1 t=1513.1s$0.001722
gemini-pro-highvertex_ai/gemini-3.1-pro-previewp=9 r=98 m=1 t=1088.3s$0.001206
gemini-pro-priorityvertex_ai/gemini-3.1-pro-previewp=9 r=68 m=1 t=782.7s$0.000846
gemini-pro-high-priorityvertex_ai/gemini-3.1-pro-previewp=9 r=105 m=1 t=1154.0s$0.00129
gemini-flashvertex_ai/gemini-3.5-flashp=9 r=87 m=1 t=971.7s$0.000806
gemini-flash-highvertex_ai/gemini-3.5-flashp=9 r=108 m=1 t=1182.6s$0.000994
gemini-flash-priorityvertex_ai/gemini-3.5-flashp=9 r=106 m=1 t=1161.6s$0.000976
gemini-flash-high-priorityvertex_ai/gemini-3.5-flashp=9 r=85 m=1 t=952.9s$0.000788
gemini-flash-litevertex_ai/gemini-3.1-flash-lite-previewp=9 m=1 t=10772ms$0.000004
gemini-flash-lite-highvertex_ai/gemini-3.1-flash-lite-previewp=9 r=93 m=1 t=1032.5s$0.000143
gemini-flash-lite-priorityvertex_ai/gemini-3.1-flash-lite-previewp=9 m=1 t=10855ms$0.000004
gemini-flash-lite-high-priorityvertex_ai/gemini-3.1-flash-lite-previewp=9 r=119 m=1 t=1292.3s$0.000182
meta/llama-4-scoutbedrock/us.meta.llama4-scout-17b-instruct-v1:0p=44 r=0 m=2 t=46417ms$0.000009
Llama-4-Scout-17B-16E-Instructbedrock/us.meta.llama4-scout-17b-instruct-v1:0p=44 r=0 m=2 t=46398ms$0.000009
gpt-oss-20bhosted_vllm/gpt-oss-20bp=76 t=1051.0s
gpt-oss-120bvertex_ai/openai/gpt-oss-120b-maasp=76 t=113534ms$0.000024
gpt-oss-20b-highhosted_vllm/gpt-oss-20b-highp=76 t=98818ms
gpt-oss-120b-highvertex_ai/openai/gpt-oss-120b-maasp=76 t=1871.3s$0.000068
gemma-4hosted_vllm/gemma-4p=22 t=24334ms
gemma-4-thinkinghosted_vllm/gemma-4-thinkingp=25 t=65562ms
gemma-4-minihosted_vllm/gemma-4-minip=76 t=1071.0s
gemma-4-mini-thinkinghosted_vllm/gemma-4-mini-thinkingp=76 t=1131.2s
gptopenai/gpt-5.5p=15 r=6 t=311.5s$0.000555
gpt-highopenai/gpt-5.5p=15 r=6 t=311.2s$0.000555
gpt-codexopenai/gpt-5.3-codexp=15 r=0 t=20906ms$0.000096
gpt-chatopenai/gpt-5.2-chat-latestp=15 r=0 t=251.1s$0.000166
gpt-proopenai/gpt-5.5-prop=15 r=10 t=324.5s$0.00351
gpt-miniopenai/gpt-5.4-minip=15 r=0 t=19461ms$0.000029
gpt-mini-highopenai/gpt-5.4-minip=15 r=10 t=35623ms$0.000101
gpt-nanoopenai/gpt-5.4-nanop=15 r=0 t=19701ms$0.000008
gpt-nano-highopenai/gpt-5.4-nanop=15 r=0 t=19534ms$0.000008
xai/grok-4.20-reasoningvertex_ai/xai/grok-4.20-reasoningp=338 r=146 t=4851.1s$0.000976
xai/grok-4.3
xai/grok-reasoningvertex_ai/xai/grok-4.20-reasoningp=338 r=323 t=6621.6s$0.002038
xai/grok
cborg-chathosted_vllm/gemma-4p=22 t=242.3s
cborg-deepthoughthosted_vllm/gemma-4-thinkingp=25 t=9811.4s
cborg-coderhosted_vllm/gemma-4-thinkingp=25 t=862.6s
cborg-coder-fasthosted_vllm/gemma-4p=22 t=24888ms
cborg-minihosted_vllm/gemma-4-mini-thinkingp=76 t=1071.1s
cborg-mini-fasthosted_vllm/gemma-4-minip=76 t=1121.2s
cborg-ocrhosted_vllm/gemma-4p=22 t=242.2s
cborg-visionhosted_vllm/gemma-4-thinkingp=25 t=668.8s
cborg-safeguardhosted_vllm/gpt-oss-safeguard-20bp=76 t=108332ms
cborg-safeguard-highhosted_vllm/gpt-oss-safeguard-20b-highp=76 t=99282ms
cborg-privacy-filterhosted_vllm/privacy-filterp=6 t=12334ms
claude-haiku-4-5vertex_ai/claude-haiku-4-5@20251001p=17 r=0 m=4 t=21644ms$0.000037
claude-sonnet-4-5vertex_ai/claude-sonnet-4-5@20250929p=17 r=0 m=4 t=21950ms$0.000111
claude-sonnet-4-6vertex_ai/claude-sonnet-4-6@defaultp=17 r=0 m=4 t=21849ms$0.000111
claude-opus-4-0vertex_ai/claude-opus-4@20250514p=17 r=0 m=4 t=211.6s$0.000555
claude-opus-4-1vertex_ai/claude-opus-4-1@20250805p=17 r=0 m=4 t=211.4s$0.000555
claude-opus-4-5vertex_ai/claude-opus-4-5@20251101p=17 r=0 m=4 t=21853ms$0.000185
claude-opus-4-6vertex_ai/claude-opus-4-6@defaultp=17 r=0 m=4 t=211.0s$0.000185
claude-opus-4-7vertex_ai/claude-opus-4-7@defaultp=27 r=0 m=6 t=331.5s$0.000285
claude-opus-4-8vertex_ai/claude-opus-4-8@defaultp=22 r=0 m=4 t=261.3s$0.00021
devstralbedrock/mistral.devstral-2-123bp=12 r=0 m=3 t=15685ms$0.000011
mistral-largebedrock/mistral.mistral-large-3-675b-instructp=12 r=0 m=2 t=14464ms$0.000009
nova-premier
nova-pro
nova-micro
gpt-4oopenai/gpt-4op=16 r=0 t=17600ms$0.00005
gpt-4o-miniopenai/gpt-4o-minip=16 r=0 t=17526ms$0.000003
gpt-4.1openai/gpt-4.1p=16 r=0 t=17559ms$0.00004
gpt-4.1-miniopenai/gpt-4.1-minip=16 r=0 t=17716ms$0.000008
gpt-4.1-nanoopenai/gpt-4.1-nanop=16 r=0 t=17440ms$0.000002
gpt-5openai/gpt-5p=15 r=64 t=892.2s$0.000759
gpt-5-chatopenai/gpt-5-chat-latestp=16 r=0 t=17514ms$0.00003
gpt-5-miniopenai/gpt-5-minip=15 r=64 t=892.1s$0.000152
gpt-5-mini-highopenai/gpt-5-minip=15 r=192 t=2173.6s$0.000408
gpt-5-nanoopenai/gpt-5-nanop=15 r=64 t=891.2s$0.00003
gpt-5-nano-highopenai/gpt-5-nanop=15 r=128 t=1531.6s$0.000056
gpt-5-codexopenai/gpt-5-codexp=15 r=0 t=621.3s$0.000489
gpt-5-highopenai/gpt-5p=15 r=128 t=1532.1s$0.001399
gpt-5.1openai/gpt-5.1p=15 r=0 t=25690ms$0.000119
gpt-5.1-chatopenai/gpt-5.1-chat-latestp=15 r=0 t=251.1s$0.000119
gpt-5.1-codexopenai/gpt-5.1-codexp=15 r=0 t=29933ms$0.000159
gpt-5.1-codex-miniopenai/gpt-5.1-codex-minip=15 r=0 t=431.5s$0.00006
gpt-5.1-codex-maxopenai/gpt-5.1-codex-maxp=15 r=0 t=831.4s$0.000699
gpt-5.1-highopenai/gpt-5.1p=15 r=16 t=41823ms$0.000279
gpt-5.2openai/gpt-5.2p=15 r=0 t=19587ms$0.000082
gpt-5.2-chatopenai/gpt-5.2-chat-latestp=15 r=0 t=251.2s$0.000166
gpt-5.2-codexopenai/gpt-5.2-codexp=15 r=0 t=31892ms$0.00025
gpt-5.2-highopenai/gpt-5.2p=15 r=0 t=19592ms$0.000082
gpt-5.2-proopenai/gpt-5.2p=15 r=0 t=19587ms$0.000082
gpt-5.3-codexopenai/gpt-5.3-codexp=15 r=0 t=201.0s$0.000096
gpt-5.3-codex-highopenai/gpt-5.3-codexp=15 r=13 t=351.2s$0.000306
gpt-5.3-codex-xhighopenai/gpt-5.3-codexp=15 r=0 t=201.1s$0.000096
gpt-5.4openai/gpt-5.4p=15 r=0 t=19862ms$0.000098
gpt-5.4-miniopenai/gpt-5.4-minip=15 r=0 t=19494ms$0.000029
gpt-5.4-nanoopenai/gpt-5.4-nanop=15 r=0 t=19502ms$0.000008
gpt-5.4-highopenai/gpt-5.4p=15 r=33 t=581.1s$0.000682
gpt-5.4-xhighopenai/gpt-5.4p=15 r=29 t=541.2s$0.000623
gpt-5.4-mini-highopenai/gpt-5.4-minip=15 r=9 t=34666ms$0.000097
gpt-5.4-mini-xhighopenai/gpt-5.4-minip=15 r=13 t=38620ms$0.000115
gpt-5.4-nano-highopenai/gpt-5.4-nanop=15 r=0 t=19464ms$0.000008
gpt-5.4-nano-xhighopenai/gpt-5.4-nanop=15 r=10 t=35702ms$0.000028
gpt-5.4-proopenai/gpt-5.4p=15 r=0 t=19792ms$0.000098
gpt-5.5openai/gpt-5.5p=15 r=0 t=191.5s$0.000195
gpt-5.5-lowopenai/gpt-5.5p=15 r=5 t=301.2s$0.000525
gpt-5.5-mediumopenai/gpt-5.5p=15 r=6 t=311.3s$0.000555
gpt-5.5-highopenai/gpt-5.5p=15 r=6 t=311.3s$0.000555
gpt-5.5-xhighopenai/gpt-5.5p=15 r=6 t=311.3s$0.000555
gpt-5.5-proopenai/gpt-5.5-prop=15 r=10 t=328.8s$0.00351
o1openai/o1p=15 r=128 t=1572.6s$0.008745
o1-highopenai/o1p=15 r=192 t=2222.8s$0.012645
google/claude-haiku-4-5vertex_ai/claude-haiku-4-5@20251001p=17 r=0 m=4 t=21582ms$0.000037
google/claude-haiku-4-5-highvertex_ai/claude-haiku-4-5@20251001p=46 r=27 m=14 t=871.8s$0.000251
google/claude-sonnet-4vertex_ai/claude-sonnet-4@20250514p=17 r=0 m=4 t=21868ms$0.000111
google/claude-sonnet-4-highvertex_ai/claude-sonnet-4@20250514p=46 r=42 m=15 t=1032.0s$0.000993
google/claude-sonnet-4-5vertex_ai/claude-sonnet-4-5@20250929p=17 r=0 m=4 t=21848ms$0.000111
google/claude-sonnet-4-5-highvertex_ai/claude-sonnet-4-5@20250929p=46 r=44 m=16 t=1061.8s$0.001038
google/claude-sonnet-4-6vertex_ai/claude-sonnet-4-6@defaultp=17 r=0 m=4 t=211.6s$0.000111
google/claude-sonnet-4-6-highvertex_ai/claude-sonnet-4-6@defaultp=17 r=0 m=4 t=21903ms$0.000111
google/claude-opus-4vertex_ai/claude-opus-4@20250514p=17 r=0 m=4 t=211.6s$0.000555
google/claude-opus-4-highvertex_ai/claude-opus-4@20250514p=46 r=39 m=15 t=1002.8s$0.00474
google/claude-opus-4-1vertex_ai/claude-opus-4-1@20250805p=17 r=0 m=4 t=218.1s$0.000555
google/claude-opus-4-1-highvertex_ai/claude-opus-4-1@20250805p=17 r=0 m=4 t=211.4s$0.000555
google/claude-opus-4-5vertex_ai/claude-opus-4-5@20251101p=17 r=0 m=4 t=211.8s$0.000185
google/claude-opus-4-5-highvertex_ai/claude-opus-4-5@20251101p=46 r=15 m=12 t=731.2s$0.000905
google/claude-opus-4-6vertex_ai/claude-opus-4-6@defaultp=17 r=0 m=4 t=213.3s$0.000185
google/claude-opus-4-6-highvertex_ai/claude-opus-4-6@defaultp=17 r=0 m=5 t=22913ms$0.00021
google/claude-opus-4-7vertex_ai/claude-opus-4-7@defaultp=27 r=0 m=6 t=33976ms$0.000285
google/claude-opus-4-7-lowvertex_ai/claude-opus-4-7@defaultp=27 r=0 m=6 t=33888ms$0.000285
google/claude-opus-4-7-mediumvertex_ai/claude-opus-4-7@defaultp=27 r=0 m=6 t=331.4s$0.000285
google/claude-opus-4-7-highvertex_ai/claude-opus-4-7@defaultp=27 r=0 m=6 t=33890ms$0.000285
google/claude-opus-4-7-xhighvertex_ai/claude-opus-4-7@defaultp=27 r=0 m=6 t=331.2s$0.000285
google/claude-opus-4-7-maxvertex_ai/claude-opus-4-7@defaultp=27 r=0 m=6 t=332.7s$0.000285
google/claude-opus-4-8vertex_ai/claude-opus-4-8@defaultp=22 r=0 m=4 t=261.1s$0.00021
google/claude-opus-4-8-lowvertex_ai/claude-opus-4-8@defaultp=22 r=0 m=4 t=261.1s$0.00021
google/claude-opus-4-8-mediumvertex_ai/claude-opus-4-8@defaultp=22 r=0 m=4 t=261.5s$0.00021
google/claude-opus-4-8-highvertex_ai/claude-opus-4-8@defaultp=22 r=0 m=4 t=261.3s$0.00021
google/claude-opus-4-8-xhighvertex_ai/claude-opus-4-8@defaultp=22 r=0 m=4 t=263.6s$0.00021
google/claude-opus-4-8-maxvertex_ai/claude-opus-4-8@defaultp=22 r=0 m=4 t=261.0s$0.00021
amazon/claude-haiku-3-5bedrock/us.anthropic.claude-3-5-haiku-20241022-v1:0p=17 r=0 m=5 t=22853ms$0.000034
amazon/claude-haiku-4-5bedrock/us.anthropic.claude-haiku-4-5-20251001-v1:0p=17 r=0 m=4 t=21980ms$0.000041
amazon/claude-haiku-4-5-highbedrock/us.anthropic.claude-haiku-4-5-20251001-v1:0p=46 r=27 m=14 t=871.5s$0.000276
amazon/claude-sonnet-4bedrock/us.anthropic.claude-sonnet-4-20250514-v1:0p=17 r=0 m=4 t=21871ms$0.000111
amazon/claude-sonnet-4-highbedrock/us.anthropic.claude-sonnet-4-20250514-v1:0p=46 r=42 m=13 t=1011.5s$0.000963
amazon/claude-sonnet-4-5bedrock/us.anthropic.claude-sonnet-4-5-20250929-v1:0p=17 r=0 m=4 t=212.0s$0.000122
amazon/claude-sonnet-4-5-highbedrock/us.anthropic.claude-sonnet-4-5-20250929-v1:0p=46 r=27 m=13 t=862.2s$0.000812
amazon/claude-sonnet-4-6bedrock/us.anthropic.claude-sonnet-4-6p=17 r=0 m=4 t=211.0s$0.000122
amazon/claude-sonnet-4-6-highbedrock/us.anthropic.claude-sonnet-4-6p=17 r=0 m=4 t=212.0s$0.000122
amazon/claude-opus-4-1bedrock/us.anthropic.claude-opus-4-1-20250805-v1:0p=17 r=0 m=4 t=213.5s$0.000555
amazon/claude-opus-4-1-highbedrock/us.anthropic.claude-opus-4-1-20250805-v1:0p=46 r=36 m=15 t=976.0s$0.004515
amazon/claude-opus-4-5bedrock/us.anthropic.claude-opus-4-5-20251101-v1:0p=17 r=0 m=4 t=211.9s$0.000204
amazon/claude-opus-4-5-highbedrock/us.anthropic.claude-opus-4-5-20251101-v1:0p=46 r=15 m=12 t=731.7s$0.000995
amazon/claude-opus-4-6bedrock/us.anthropic.claude-opus-4-6-v1p=17 r=0 m=4 t=212.1s$0.000204
amazon/claude-opus-4-6-highbedrock/us.anthropic.claude-opus-4-6-v1p=17 r=15 m=20 t=521.9s$0.001056
amazon/claude-opus-4-7
amazon/claude-opus-4-7-low
amazon/claude-opus-4-7-medium
amazon/claude-opus-4-7-high
amazon/claude-opus-4-7-xhigh
amazon/claude-opus-4-7-max
amazon/claude-opus-4-8
amazon/claude-opus-4-8-low
amazon/claude-opus-4-8-medium
amazon/claude-opus-4-8-high
amazon/claude-opus-4-8-xhigh
amazon/claude-opus-4-8-max
devstral-2bedrock/mistral.devstral-2-123bp=12 r=0 m=2 t=14613ms$0.000009
mistral-large-3bedrock/mistral.mistral-large-3-675b-instructp=12 r=0 m=2 t=14450ms$0.000009
nemotron-super-3bedrock/nvidia.nemotron-super-3-120bp=25 r=0 m=2 t=27537ms$0.000005
nemotron-nano-3bedrock/nvidia.nemotron-nano-3-30bp=25 r=0 m=2 t=27606ms$0.000002
nemotron-nano-vlbedrock/nvidia.nemotron-nano-12b-v2p=24 r=0 m=3 t=27449ms$0.000007
nova-premier-1
nova-pro-1bedrock/amazon.nova-pro-v1:0p=10 r=0 m=18 t=28592ms$0.000066
nova-micro-1bedrock/amazon.nova-micro-v1:0p=10 r=0 m=2 t=12441ms$0.000001
gemini-2.0-flash
gemini-2.0-flash-lite
gemini-2.5-flashvertex_ai/gemini-2.5-flashp=9 r=23 m=1 t=33737ms$0.000063
gemini-2.5-flash-highvertex_ai/gemini-2.5-flashp=9 r=21 m=1 t=311.2s$0.000058
gemini-2.5-flash-litevertex_ai/gemini-2.5-flash-litep=9 m=1 t=10662ms$0.000001
gemini-2.5-provertex_ai/gemini-2.5-prop=9 r=215 m=1 t=2253.3s$0.002171
gemini-2.5-pro-highvertex_ai/gemini-2.5-prop=9 r=111 m=1 t=1212.8s$0.001131
gemini-3-flashvertex_ai/gemini-3-flash-previewp=9 r=132 m=1 t=1424.2s$0.000404
gemini-3-flash-highvertex_ai/gemini-3-flash-previewp=9 r=118 m=1 t=1283.4s$0.000362
gemini-3-flash-priorityvertex_ai/gemini-3-flash-previewp=9 r=46 m=1 t=562.6s$0.000145
gemini-3-flash-high-priorityvertex_ai/gemini-3-flash-previewp=9 r=46 m=1 t=563.2s$0.000145
gemini-3.1-flash-litevertex_ai/gemini-3.1-flash-lite-previewp=9 m=1 t=10718ms$0.000004
gemini-3.1-flash-lite-highvertex_ai/gemini-3.1-flash-lite-previewp=9 r=90 m=1 t=1002.6s$0.000139
gemini-3.1-flash-lite-priorityvertex_ai/gemini-3.1-flash-lite-previewp=9 m=1 t=10747ms$0.000004
gemini-3.1-flash-lite-high-priorityvertex_ai/gemini-3.1-flash-lite-previewp=9 r=113 m=1 t=1232.9s$0.000173
gemini-3.1-provertex_ai/gemini-3.1-pro-previewp=9 r=128 m=1 t=1382.5s$0.001566
gemini-3.1-pro-highvertex_ai/gemini-3.1-pro-previewp=9 r=119 m=1 t=1293.7s$0.001458
gemini-3.1-pro-priorityvertex_ai/gemini-3.1-pro-previewp=9 r=113 m=1 t=1232.4s$0.001386
gemini-3.1-pro-high-priorityvertex_ai/gemini-3.1-pro-previewp=9 r=167 m=1 t=1774.2s$0.002034
gemini-3.5-flashvertex_ai/gemini-3.5-flashp=9 r=71 m=1 t=811.1s$0.000661
gemini-3.5-flash-highvertex_ai/gemini-3.5-flashp=9 r=120 m=1 t=1302.7s$0.001102
gemini-3.5-flash-priorityvertex_ai/gemini-3.5-flashp=9 r=126 m=1 t=1361.4s$0.001156
gemini-3.5-flash-high-priorityvertex_ai/gemini-3.5-flashp=9 r=129 m=1 t=1392.7s$0.001183
google/gpt-oss-120bvertex_ai/openai/gpt-oss-120b-maasp=76 t=106703ms$0.00002
google/gpt-oss-20bvertex_ai/openai/gpt-oss-20b-maasp=76 t=108886ms$0.00001
google/gpt-oss-120b-highvertex_ai/openai/gpt-oss-120b-maasp=76 t=150708ms$0.000046
google/gpt-oss-20b-highvertex_ai/openai/gpt-oss-20b-maasp=76 t=119751ms$0.000014
google/deepseek-r1
google/qwen-3-codervertex_ai/qwen/qwen3-coder-480b-a35b-instruct-maasp=17 t=191.3s$0.000021
google/qwen-3vertex_ai/qwen/qwen3-235b-a22b-instruct-2507-maasp=17 t=19808ms$0.000006
google/gemma-4
google/codestral
google/glm-5vertex_ai/zai-org/glm-5-maasp=14 t=1092.7s$0.000318
google/glm-4.7vertex_ai/zai-org/glm-4.7-maasp=14 t=31883ms$0.000046
google/deepseek-3.2vertex_ai/deepseek-ai/deepseek-v3.2-maasp=13 t=1565.2s$0.000011
google/kimi-k2-thinkingvertex_ai/moonshotai/kimi-k2-thinking-maasp=16 t=761.0s$0.00016
google/minimax-m2
google/grok-4.20-reasoningvertex_ai/xai/grok-4.20-reasoningp=338 r=171 t=5101.3s$0.001702
google/grok-4.1-non-reasoningvertex_ai/xai/grok-4.1-fast-non-reasoningp=680 r=0 t=681497ms$0.000037
google/grok-4.1-reasoningvertex_ai/xai/grok-4.1-fast-reasoningp=668 r=143 t=8123.6s$0.000107
google/grok-4.3
amazon/llama-4-maverickbedrock/us.meta.llama4-maverick-17b-instruct-v1:0p=44 r=0 m=2 t=46426ms$0.000012
amazon/llama-4-scoutbedrock/us.meta.llama4-scout-17b-instruct-v1:0p=44 r=0 m=2 t=46297ms$0.000009
amazon/gpt-oss-120bbedrock/openai.gpt-oss-120b-1:0p=76 r=26 m=11 t=1131.0s$0.000034
amazon/gpt-oss-20bbedrock/openai.gpt-oss-20b-1:0p=76 r=23 m=11 t=1105.1s$0.000016
lbl/gemma-4hosted_vllm/gemma-4p=22 t=242.3s
lbl/gemma-4-thinkinghosted_vllm/gemma-4-thinkingp=25 t=95533ms
lbl/gemma-4-minihosted_vllm/gemma-4-minip=76 t=1061.0s
lbl/gemma-4-mini-thinkinghosted_vllm/gemma-4-mini-thinkingp=76 t=1211.4s
lbl/gpt-oss-20bhosted_vllm/gpt-oss-20bp=76 t=99857ms
lbl/gpt-oss-20b-highhosted_vllm/gpt-oss-20b-highp=76 t=1111.2s
lbl/cborg-chathosted_vllm/gemma-4p=22 t=24200ms
lbl/cborg-deepthoughthosted_vllm/gemma-4-thinkingp=25 t=739.7s
lbl/cborg-coderhosted_vllm/gemma-4-thinkingp=25 t=10713.8s
lbl/cborg-coder-fasthosted_vllm/gemma-4p=22 t=24206ms
lbl/cborg-minihosted_vllm/gemma-4-mini-thinkingp=76 t=1051.0s
lbl/cborg-mini-fasthosted_vllm/gemma-4-minip=76 t=1161.3s
lbl/cborg-privacy-filterhosted_vllm/privacy-filterp=6 t=12313ms
lbl/cborg-safeguardhosted_vllm/gpt-oss-safeguard-20bp=76 t=105317ms
lbl/cborg-safeguard-highhosted_vllm/gpt-oss-safeguard-20b-highp=76 t=108344ms
lbl/cborg-ocrhosted_vllm/gemma-4p=22 t=24208ms
lbl/cborg-ocr-fasthosted_vllm/gemma-4-minip=76 t=104980ms
lbl/cborg-visionhosted_vllm/gemma-4-thinkingp=25 t=97376ms
lbl/cborg-vision-fasthosted_vllm/gemma-4-mini-thinkingp=76 t=96783ms

Embedding Models

ModelUnderlying ModelStatusDimensionsTimeCost
nomic-embed-textopenai/nomic-embed-text768224ms
nomic-embed-vision
nomic-embed-code
nomic-embed-text-testopenai/nomic-embed-text7681.2s
nomic-embed-vision-test
nomic-embed-code-test
text-embedding-ada-002openai/text-embedding-ada-0021536486ms$0.000001
nova-2-embed-multimodalbedrock/amazon.nova-2-multimodal-embeddings-v1:03072661ms$0.000104
titan-embed-text-v1bedrock/amazon.titan-embed-text-v11536332ms$0.000001
titan-embed-image-v1bedrock/amazon.titan-embed-image-v11024336ms$0.000009
titan-embed-text-v2bedrock/amazon.titan-embed-text-v2:01024334ms$0.000002
cohere-embed-multilingual-v3bedrock/cohere.embed-multilingual-v31024509ms$0.000001
cohere-embed-english-v3bedrock/cohere.embed-english-v31024355ms$0.000001
cohere-embed-v4bedrock/cohere.embed-v4:01536581ms$0.000001
gemini-embedding-001vertex_ai/gemini-embedding-0013072569ms$0.000001
text-embedding-004vertex_ai/text-embedding-004768323ms$0.000001
lbl/nomic-embed-textopenai/nomic-embed-text7681.3s
lbl/nomic-embed-vision
lbl/nomic-embed-code
lbl/nomic-embed-text-testopenai/nomic-embed-text7681.2s
lbl/nomic-embed-vision-test
lbl/nomic-embed-code-test

Auto-generated by cborg-etc/bin/test-all-models.sh. Run the script and publish to update.