News

Current Status

lbl/cborg-deepthought is currently offline. All other systems are operational.

Recent Updates

July 25th, 2025

Gemini CLI is now supported by CBorg API

July 24th, 2025

GPT 4.5 Preview has been removed due to API deprecation. Suggested replacement GPT 4.1, until GPT 5 is released in the coming months.

July 14th, 2025

Improved stability for Anthropic Claude models.
Added -high aliases for Anthropic, OpenAI and Gemini reasoning models (automatically engages “high” reasoning effort)
Updated documentation on coding tools for Cline, Roo Code

June 30th, 2025

Added system prompts to Claude and Gemini models on CBorg Chat

June 29th, 2025

Claude Code is now supported with CBorg - updated documentation
Updated documentation for OpenAI Codex CLI
Reconfigured model name structure to support direct access to Anthropic Claude on Bedrock
Reconfigured model name / alias structure to support direct access to Google Gemini models

May 22nd, 2025

Claude 4.0 models are now available
All Anthropic endpoints switched over to AWS Bedrock
Agent Builder now available on CBorg Chat

Apr 28th, 2025

Improve configuration for VS Code ‘Continue’ integration
Fix autocomplete / FIM model for code completions
Changed LBL-hosted CBorg Deepthought from Perplexity Debiased Deepseek R1-Llama to QwQ 32B

Apr 21st, 2025

Enabled support for OpenAI Codex CLI to CBorg API Server
Added ChatGPT 4.1 and OpenAI o3 models to chat and API

Apr 19th, 2025

Auto-tool-calling enabled for LBL-hosted Llama 4

Apr 18th, 2025

LBL-hosted Llama model upgraded to Llama 4 Scout
Grok 3 Beta and Grok 3 Beta Mini are available on chat and API
Google Flash 2.5 Preview and Pro 2.5 Preview now available on chat and API
ChatGPT 4.1 Mini, ChatGPT 4.1 Nano and ChatGPT 4.5 Preview now available on chat and API
Science IT and HPCS Support chatbot now available on CBorg Chat
Stability and scaling improvements to API configuration

Feb 28th, 2025

Claude 3.7 Sonnet and Claude 3.7 Sonnet with Extended Thinking are available on chat and API
Google Flash Lite available on API - ideal for low latency applications
Grok 2 is available on chat and API. Grok 3 support is not available yet.

Feb 6th, 2025

Google Gemini models upgraded to Gemini Flash 2.0 and Gemini Pro 2.0

Feb 4th, 2025

CBorg Vision model changed to Qwen 2-VL 72B - also available on API as lbl/qwen-vision

Jan 31st, 2025

OpenAI o1-preview upgraded to o1 full release
OpenAI o1 and o3-mini now available on CBorg Chat and API

Jan 22nd, 2025

Upgrated CBorg Deepthought to use DeepSeekR1 Llama 70B Distilled - also available on API
Upgraded CBorg Coder Base (fill-in-the-middle code completion) to Qwen Coder 32B Base.
Enhanced CBorg Chat with vision capability by deploying merged Llama 70B 3.3 with Llama 90B Vision 3.2
Adjusted model deployment configuration for improved GPU allocation in on-prem cluster

Dec 13th, 2024

Underlying model for CBorg Chat is changed from Llama 3.1 405B to Llama 3.3 70B for increased speed and performance.
Underlying model for CBorg Coder is changed from Llama 3.1 405B to Qwen Coder 32B
Google Gemini Flash upgraded from 1.5 to new version 2.0 Experimental

Nov 8th, 2024

Upgraded Claude Haiku to new 3.5 release. Updated pricing for API use.
Fixed low constrast issue in splash screen image when dark mode enabled.

Oct 31st, 2024

Upgraded CBorg Chat to Librechat v0.7.5
Improved layout of login page on mobile devices
Fixed icons and app name for CBorg Chat installable PWA

Oct 25th, 2024

Claude Sonnet 3.5 upgraded to the newly released v2 model.
Endpoint for Anthropic Claude models on CBorg Chat is changed to AWS Bedrock for improved contract terms.

Oct 24th, 2024

Downtime for power systems upgrades has been postponed. New dates will be announced in the coming weeks.

Oct 15th, 2024

OpenAI o1 Mini and o1 Preview are available on CBorg Chat via a “streaming mode” middleware adapter
LBL-hosted fill-in-the-middle (FIM) code completion model fixed and changed to Starcoder 7B
Fixed a bug with code actions for VSCode Continue plugin when using LBL-hosted CBorg Coder - Continue users should update their config.json.

Oct 11th, 2024

o1-mini and o1-preview are available on the CBorg API service.
Removed ChatGPT 3.5 from API and CBorg Chat

Oct 4th, 2024

Added new CBorg Vision model based on Meta Llama 3.2 90B Vision.
Integrated Wolfram|Alpha into CBorg Chat.
Deployed new self-managed API Key Manager, enabling users to create and manage their own API keys.

September 19th, 2024

Underlying model for CBorg Chat and CBorg Coder changed to Llama 3.1 405b (FP8). Added experimental “CBorg Deepthought” model - based on Chain-of-Thought reasoning prompt. Fixed centering of text in CBorg logo.

September 9th, 2024

CBorg Vision model changed to Phi 3.5 with vision support - it can describe images and read text. VS Code ‘Continue’ support is now available in early beta for API users.

August 26th, 2024

Lab-hosted models have been renamed with generic “CBorg” naming; underlying model has been replaced from Llama 3.1 to Mistral Large 2047 (licensed for non-commercial use). Added “Cborg Coder” (also based on Mistral Large with customized system message) and “Cborg Nano” (based on Microsoft Phi 3.5 - lightweight model for summarization and extraction tasks).

August 24th, 2024

Google Gemini models are now available on the API service.

August 6th, 2024

Low cost ChatGPT 4o-mini with regional deployment on Azure cloud. Implemented performance improvements to self-hosted chat models: Llama 3.1 and Command R+. New Lab-hosted embedding model lbl/nomic-embed-text now available.

July 31st, 2024

The default temperature of models has been adjusted to 0.5. To customize the behavior of chat models please use a user preset.

July 30th, 2024

CBORG Chat now supports RAG using CSV, TXT and PDF files.

July 25th, 2024

Meta Llama 3 70B has been upgraded to Meta Llama 3.1 405B.

July 24th, 2024

Access to Anthropic Claude through the API service is now working. ChatGPT-4o endpoint bandwidth was increased to accomodate user demand.

July 2nd, 2024

Added a local API endpoint, https://api-local.cborg.lbl.gov which bypasses Cloudflare for local network clients.

June 24th, 2024

Upgraded Claude 3.0 Sonnet to the latest Claude 3.5 Sonnet. Details: https://www.anthropic.com/news/claude-3-5-sonnet

June 17th, 2024

Added LBNL-hosted embedding models e5-large-v2 and NV-Embed-v1, a large 4096-dimension embedding model with a leading position on the MTEB Leaderboard.

June 13th, 2024

Added graceful failover to LBNL-hosted models in event of server offline
Corrected incorrect configuration of cost-per-token setting in LBNL-hosted Command R+

June 12th, 2024

LBNL-Hosted Llama-3 70B is now running on Nvidia H100 node for increased performance
Custom chat icons have been provided for LBNL-hosted models

June 7th, 2024

Removed support for ChatGPT 4 (legacy model), replaced by GPT-4o
Removed support for Google Gemini 1.0 Pro, replaced by Gemini 1.5 Flash and Gemini 1.5 Pro
Adjusted context window for LBNL-hosted Command R+ to 80K tokens per system memory constraints
Adjusted maximum context length for commercial models to reasonable limits for chat use to control costs
Improved model selection drop-down list with model descriptions

June 6th, 2024

Added support for ChatGPT 4o
Added support for Google Gemini 1.5 Flash and Google Gemini 1.5 Pro
Fixed Anthropic Claude endpoints
Added support for Anthropic Claude Opus