The rise of artificial intelligence is riding on the back of an enormous data centre expansion.
Data centres are projected to account for anywhere from 9% to 17% of total electricity usage in the US by the end of the decade. Today, about a third of data centre electricity is devoted to cooling the chips that run AI models.
That is the process Ferveret is working to make more efficient. The startup, founded by Reza Azizian, a former MIT postdoc in nuclear engineering, and Matteo Bucci, MIT’s Esther and Harold E Edgerton Associate Professor in the Department of Nuclear Science and Engineering, is adapting an approach from nuclear reactors to cool chips using no water and significantly less electricity.
The company’s cooling system submerges computer servers in a specialised liquid that absorbs heat much more efficiently than air from a fan. What makes the solution different from other liquid cooling systems are the bubbles: Ferveret’s Adaptive Phase Cooling (APC) solution produces much smaller bubbles at the surface of the server, which detach more frequently, accelerating the heat transfer process.
Ferveret’s cooling system submerges computer servers in a specialized liquid that absorbs heat much more efficiently than air from a fan. What makes the solution different from other liquid cooling systems are smaller bubbles at the surface of the server, which detach more frequently, accelerating the heat transfer process. Image: Courtesy of Ferveret.
Ferveret is already testing its solutions with companies including CleanSpark, the data centre developer and operator, as well as FuriosaAI, an AI accelerator company, and Switch, one of the largest data centre operators in the US.
In a recent study in collaboration with the Samueli Computer Science Department at the University of California at Los Angeles, Ferveret found its APC solution led to a 15% improvement in computational power efficiency compared to state-of-the-art liquid cooling solutions. By combining those savings with Ferveret’s power control system to optimise operating conditions, the company says it allows data centres to get 35% more tokens – small pieces of text or data – from their AI models with the same amount of power.
“Our goal is to make data centres as sustainable as possible and help them use every single watt of power to generate tokens, which are the most useful outputs,” says Azizian. “Our system enables the operation of more powerful chips, it helps data centres waste a lot less energy, and it accomplishes all that with zero water consumption.”
From nuclear reactors to AI
Azizian was a postdoc at MIT in 2013 when he met Bucci, who was then a research scientist. With funding from the MIT Energy Initiative, they worked on heat transfer in nuclear reactors before Azizian went into industry, where he shifted his focus to cooling chips. Azizian first worked on Microsoft’s HoloLens augmented reality headset and then joined Nvidia, which produces the graphical processing units companies use to train and run the latest AI models. Meanwhile, Bucci continued conducting research at MIT, becoming an assistant professor in 2016.
Azizian walked into his first data centre in 2017, where he was struck by the huge, noisy fans that filled the building as they cooled.
“I thought, ‘holy crap, this is not how you cool facilities,’” says Azizian, noting air cooling can still take up 40% of the power going into a data centre. “It was not an efficient way of doing things, but since it wasn’t hurting the performance, no one cared that the cooling technology was 50 years old.”
Azizian began talking with Bucci about applying their knowledge around optimizing heat transfer in nuclear reactors to data centres. Scientists have spent decades finding better ways to move heat in nuclear reactors.
“Heat transfer determines how much energy you can extract from the reactor core, which translates directly to revenue,” explains Azizian.
The founders started Ferveret in 2021. A lot has changed since Azizian walked into his first data centre. Chip companies have packed more and more components onto their chips as the explosion in artificial intelligence has put a premium on squeezing as much computing capacity as possible out of limited power supplies.
That has driven data centre operators to use liquid to cool chips – often through a technique known as immersion cooling that submerges chips in liquid. The most effective form of immersion cooling brings the liquid to a boil.
“Liquid is a better heat transfer medium than air. That’s why when you stick your hand into room temperature water it still feels cold,” explains Bucci. “When liquid is boiling, it becomes even better at removing heat because the phase change requires a lot of energy, which is the energy you remove from the chip. That lets you transfer large quantities of heat with minimal temperature differences between the chips and the liquid.”
Unfortunately, boiling liquid adds complexity to the system because it forces operators to capture and reliquefy the bubbles while controlling for pressure, temperature, and fluid inventory.
Ferveret’s system is adapted from a process in nuclear reactors called subcooled boiling. It uses a liquid with a low boiling point and none of the toxic PFAS 'forever chemicals' that other approaches rely on. At the surface of the chip, Ferveret’s liquid produces smaller bubbles than other immersion cooling approaches. Those bubbles detach more frequently and quickly recondense in the surrounding liquid, accelerating the bubble-rewetting cycle at the surface of the chip to hasten heat transfer.
Ferveret delivers its APC system in small boxes, each of which houses one server. The founders say their modular systems make it easier to deploy the system and simplify maintenance.
“The physics enable us to get to form factors that weren’t possible in the past,” says Azizian. “Most immersion cooling solutions are large tanks that people submerge the servers in. We have a smaller, modular rack-mounted solution that makes it adaptable to the current infrastructure, so it’s easier for people to deploy our technology.”
Ferveret also offers control software that adjusts the power going to each server in real time to further improve efficiency.
“We deliver full-stack systems that include the cooling box, the rack, the cooling distribution units, and sensors that measure the temperature and pressure,” says Bucci. “Our software monitors those sensors and optimises the operating condition inside each box to ensure that energy consumption is minimised in the system.”
AI with fewer resources
In addition to helping data centres to run more efficiently, Ferveret is also improving sustainability by making it easier to operate data centres in remote regions with more renewable energy.
“The sun shines in places where you don’t have much water, so the advantage of us being water-free is we allow you to build data centres where you have solar energy but nothing to cool the data centre down,” says Bucci. “This technology can help deploy data centres in regions where normally you wouldn’t have the resources to do so, including Africa, the Middle East, and of course parts of America. It’s a huge unlock.”
Ferveret is in talks with the large cloud computing companies known as hyperscalers, and is currently part of Nvidia’s Inception programme for startups. The company plans to announce expanded partnerships later this year. From there, the founders plan to quickly scale their technology to help the AI industry continue to grow without further straining the planet.
“The computing industry is facing a huge challenge in the form of access to power, and they have a problem with access to water in many regions,” says Azizian. “That will only become more limiting as the industry grows. The main goal for these data centre operators would be to get more tokens from the power they have. We’ve shown we can do that.”