Cerebras Accelerates Expansion with Six New Data Centers, Boosting Inference Speed Tenfold!

AIbase基地

Published inAI News · 5 min read · Mar 12, 2025

Cerebras Systems recently announced the construction of six new data centers across North America and Europe to boost its artificial intelligence (AI) inference capabilities. This expansion will significantly enhance the company's computing power, supporting the development of various AI applications.

The plan allocates 85% of computing capacity to the United States. Three facilities are already operational in Santa Clara and Stockton, California, and Dallas, Texas. New centers will open in Minneapolis (Q2 2025), Oklahoma City, and Montreal (Q3 2025), as well as Atlanta and France (Q4 2025).

Supercomputer Data Center (3)

Image Source Note: Image generated by AI, licensed through Midjourney.

At the heart of these new data centers is Cerebras's Wafer Scale Engine (WSE), a specialized chip architecture optimized for AI applications. The company states its CS-3 system can process 40 million tokens per second for the Llama-70B model, dramatically increasing inference speed. The Oklahoma City facility, expected to house over 300 CS-3 systems, is built to a triple-plus-three standard, withstanding tornadoes and earthquakes and featuring triple-redundant power supplies. It is projected to begin operations in June 2025.

Several prominent AI companies have partnered with Cerebras, including French startup Mistral and its Le Chat assistant, and AI question-answering engine Perplexity. Hugging Face and AlphaSense are also utilizing the Cerebras platform. This technology is particularly well-suited for inference models requiring extensive computation and large token generation, such as Deepseek-R1 and OpenAI o3.

This expansion is part of Cerebras's overall 2025 growth strategy. Some facilities will be operated in partnership with UAE-based G42. In Montreal, a new center managed by Enovum, a Bit Digital subsidiary, is expected to go live in July 2025, boasting inference speeds ten times faster than current GPUs.

Cerebras Systems, a US company specializing in AI chips, employs a unique design philosophy using an entire wafer as a single chip. They have released their third-generation Wafer Scale Engine, the WSE-3. This system is already in use at institutions such as Argonne National Laboratory, the Pittsburgh Supercomputing Center, and GlaxoSmithKline. While advantageous, the technology has limitations, such as lacking native CUDA support (Nvidia's standard) and exhibiting less server compatibility than Nvidia solutions.

Key Highlights:
🌍 Cerebras plans to build six new data centers in North America and Europe, primarily in the US, with full operation expected in 2025.
⚡ The data centers will utilize unique wafer-scale chips capable of processing 40 million tokens per second.
🤝 Several leading AI companies have partnered with Cerebras to leverage its high-speed inference capabilities.

Wafer-Scale Engine Cerebras Systems AI Inference Data Center

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

nEye, a Silicon Photonics Startup, Secures $58 Million in Funding to Boost AI Data Centers

nEye Systems Inc., a silicon photonics startup, announced today that it has closed a $58 million Series B funding round. The investment was led by CapitalG, the growth equity investment arm of Alphabet Inc., Google's parent company. Other participants include Microsoft's M12, Micron Ventures, Nvidia, and Socratic Partners. This brings nEye's total funding to over $72 million.

Apr 11, 2025

160

AI-Driven Surge in Data Center Electricity Demand: A Potential Doubling in the Next Decade

According to a recent report by the International Energy Agency (IEA), the proliferation of AI applications is projected to double data center electricity consumption by 2030, posing significant challenges to global energy security and carbon emission reduction goals. Data centers currently account for approximately 1.5% of global electricity consumption, with an average annual growth of 12% over the past five years. The rise of generative AI is further fueling the demand for massive computing power. The United States, Europe, and China together account for roughly 85% of global data center electricity consumption. Major tech companies have already...

Apr 10, 2025

100

Oracle's OpenAI Data Center Construction Slows, Potentially Impacting Future Collaboration

Apr 8, 2025

680

IBM Unveils z17 Mainframe: Capable of 450 Billion AI Inferences Daily, 50% Performance Boost

IBM on Monday launched its latest mainframe hardware, the IBM z17. This fully encrypted mainframe, powered by the IBM Telum II processor, is designed for over 250 AI use cases, including AI agents and generative AI applications. While mainframes may be considered legacy technology by some, 71% of Fortune 500 companies still use them, according to sources. According to market research firm Market Research Future, by 2024...

Apr 8, 2025

210

Infosys Partners with Formula E to Launch AI-Powered Data Center Enhancing Fan Engagement

Infosys has partnered with the ABB FIA Formula E World Championship to launch an AI-powered analytics platform called the "Formula E Stats Centre." This innovative platform aims to provide fans with real-time race insights, enabling a deeper understanding of their favorite teams and drivers. The Formula E Stats Centre leverages Infosys' AI solutions to deliver real-time race data, interactive statistics, and

Apr 3, 2025

270

98% Approval Rate! Former Doctor Launches Taxo, Revolutionizing Medical Efficiency with Transparent AI Inference, Secures $5 Million in Funding!

Mar 29, 2025

290

OpenAI Plans to Build Data Center, Potentially Becoming One of the World's Largest Storage Customers

Recent reports suggest that artificial intelligence company OpenAI is considering building its first data center. This news comes from The Information, citing sources who say OpenAI executives are exploring the feasibility of the project. If the plan proceeds, OpenAI would become one of the world's largest storage customers, projected to purchase billions of dollars worth of hardware and software. Image Note: Image generated by AI, image licensing service provider Midjourney based on insider information.

Mar 27, 2025

260

Cai Chongxin Warns of AI Data Center Bubble! Alibaba Restarts Hiring and Defines AI Strategy in Three Categories

On March 25th, Cai Chongxin, Chairman of the Alibaba Group, expressed his views at the HSBC Global Investment Summit, pointing out that a bubble is beginning to form in the construction of artificial intelligence (AI) data centers. He believes that many investment announcements in US data centers are "duplicated" or overlapping. Meanwhile, Cai Chongxin revealed that Alibaba's employee count has bottomed out and the company will restart its hiring plan. Regarding the current booming AI wave, Cai Chongxin broadly categorized companies involved into three types: the first focuses on model research and development, such as OpenAI; the second focuses on AI applications and integrations; and the third comprises infrastructure providers.

Mar 26, 2025

290

Tencent HunYuan T1 Official Version and DeepSeek V3-0324 Released on Yuanbao

Following the release of the official Tencent HunYuan T1 last week, many users have been eager to know when this new version would be available on Yuanbao. In the latest news, Tencent officially announced that the official version of HunYuan T1 and the latest version of DeepSeek V3 are now available, bringing users a brand-new experience. HunYuan T1 is Tencent's self-developed deep thinking model, and compared to the previous T1 Preview version, it has undergone comprehensive upgrades. The new version not only improves speed and performance but also significantly enhances overall effectiveness, achieving second-level response times, and...

Mar 26, 2025

450

Former Intel CEO Criticizes Nvidia's AI Chip Pricing, Sees Inference as Future Opportunity

Former Intel CEO Pat Gelsinger recently criticized Nvidia's pricing strategy for its AI GPUs on the Acquired podcast during Nvidia's 2025 GPU Technology Conference, arguing that the high cost makes them unsuitable for large-scale AI inference tasks. Gelsinger highlighted that inference is crucial for deploying AI models and that the industry should focus more on inference, a domain where Nvidia's technology falls short in terms of cost-effectiveness. Image source omitted.

Mar 25, 2025

150

AI News

AI Daily

AI Timeline

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview