Current Status
All services are operational. Occasional service interruptions may occur as we adjust system configuration.
Recent Updates
Nov 8th, 2024
- Upgraded Claude Haiku to new 3.5 release. Updated pricing for API use.
- Fixed low constrast issue in splash screen image when dark mode enabled.
Oct 31st, 2024
- Upgraded CBorg Chat to Librechat v0.7.5
- Improved layout of login page on mobile devices
- Fixed icons and app name for CBorg Chat installable PWA
Oct 25th, 2024
- Claude Sonnet 3.5 upgraded to the newly released v2 model.
- Endpoint for Anthropic Claude models on CBorg Chat is changed to AWS Bedrock for improved contract terms.
Oct 24th, 2024
- Downtime for power systems upgrades has been postponed. New dates will be announced in the coming weeks.
Oct 15th, 2024
- OpenAI o1 Mini and o1 Preview are available on CBorg Chat via a “streaming mode” middleware adapter
- LBL-hosted fill-in-the-middle (FIM) code completion model fixed and changed to Starcoder 7B
- Fixed a bug with code actions for VSCode Continue plugin when using LBL-hosted CBorg Coder - Continue users should update their config.json.
Oct 11th, 2024
- o1-mini and o1-preview are available on the CBorg API service.
- Removed ChatGPT 3.5 from API and CBorg Chat
Oct 4th, 2024
- Added new CBorg Vision model based on Meta Llama 3.2 90B Vision.
- Integrated Wolfram|Alpha into CBorg Chat.
- Deployed new self-managed API Key Manager, enabling users to create and manage their own API keys.
September 19th, 2024
- Underlying model for CBorg Chat and CBorg Coder changed to Llama 3.1 405b (FP8). Added experimental “CBorg Deepthought” model - based on Chain-of-Thought reasoning prompt. Fixed centering of text in CBorg logo.
September 9th, 2024
- CBorg Vision model changed to Phi 3.5 with vision support - it can describe images and read text. VS Code ‘Continue’ support is now available in early beta for API users.
August 26th, 2024
- Lab-hosted models have been renamed with generic “CBorg” naming; underlying model has been replaced from Llama 3.1 to Mistral Large 2047 (licensed for non-commercial use). Added “Cborg Coder” (also based on Mistral Large with customized system message) and “Cborg Nano” (based on Microsoft Phi 3.5 - lightweight model for summarization and extraction tasks).
August 24th, 2024
- Google Gemini models are now available on the API service.
August 6th, 2024
- Low cost ChatGPT 4o-mini with regional deployment on Azure cloud. Implemented performance improvements to self-hosted chat models: Llama 3.1 and Command R+. New Lab-hosted embedding model
lbl/nomic-embed-text
now available.
July 31st, 2024
- The default temperature of models has been adjusted to 0.5. To customize the behavior of chat models please use a user preset.
July 30th, 2024
- CBORG Chat now supports RAG using CSV, TXT and PDF files.
July 25th, 2024
- Meta Llama 3 70B has been upgraded to Meta Llama 3.1 405B.
July 24th, 2024
- Access to Anthropic Claude through the API service is now working. ChatGPT-4o endpoint bandwidth was increased to accomodate user demand.
July 2nd, 2024
- Added a local API endpoint, https://api-local.cborg.lbl.gov which bypasses Cloudflare for local network clients.
June 24th, 2024
- Upgraded Claude 3.0 Sonnet to the latest Claude 3.5 Sonnet. Details: https://www.anthropic.com/news/claude-3-5-sonnet
June 17th, 2024
- Added LBNL-hosted embedding models e5-large-v2 and NV-Embed-v1, a large 4096-dimension embedding model with a leading position on the MTEB Leaderboard.
June 13th, 2024
- Added graceful failover to LBNL-hosted models in event of server offline
- Corrected incorrect configuration of cost-per-token setting in LBNL-hosted Command R+
June 12th, 2024
- LBNL-Hosted Llama-3 70B is now running on Nvidia H100 node for increased performance
- Custom chat icons have been provided for LBNL-hosted models
June 7th, 2024
- Removed support for ChatGPT 4 (legacy model), replaced by GPT-4o
- Removed support for Google Gemini 1.0 Pro, replaced by Gemini 1.5 Flash and Gemini 1.5 Pro
- Adjusted context window for LBNL-hosted Command R+ to 80K tokens per system memory constraints
- Adjusted maximum context length for commercial models to reasonable limits for chat use to control costs
- Improved model selection drop-down list with model descriptions
June 6th, 2024
- Added support for ChatGPT 4o
- Added support for Google Gemini 1.5 Flash and Google Gemini 1.5 Pro
- Fixed Anthropic Claude endpoints
- Added support for Anthropic Claude Opus