Inf2 instances announcement at AWS re:Invent 2022
Image Credits:Amazon
Enterprise

Amazon announces preview of new Inf2 instances designed for larger models

As companies build more complex machine learning models, the cost of training and running these models becomes a real issue. AWS has created a series of custom instances to help bring down the cost, and today it introduced a preview of an all-new Inf2 instance for EC2 designed to process data from larger workloads more efficiently.

AWS CEO Adam Selipsky made the announcement today at AWS re:Invent in Las Vegas.

As Selipsky told the AWS re:Invent audience, “Inf1 is great for small-to-medium complexity models, but for larger models, customers have often relied on more powerful instances because they don’t actually have the optimal resource configuration for their inference workloads.”

They did this because up until now, there simply wasn’t another solution available to help bring down the cost and complexity of processing these larger workloads.

“You want to choose the solution that is the best fit for your specific needs, which is why today I’m excited to announce a preview of the Inf2 instance powered by our new inferentia2 chip,” he said.

For folks who need that extra power, Inf2 provides it. “Customers can deploy a 175 billion parameter model for inference on a single instrument with four times higher throughput and 1/10 the latency of Inf1 instances,” he said.

The new instances are available in preview starting today.

Techcrunch event

Join 10k+ tech and VC leaders for growth and connections at Disrupt 2025

Netflix, Box, a16z, ElevenLabs, Wayve, Hugging Face, Elad Gil, Vinod Khosla — just some of the 250+ heavy hitters leading 200+ sessions designed to deliver the insights that fuel startup growth and sharpen your edge. Don’t miss the 20th anniversary of TechCrunch, and a chance to learn from the top voices in tech. Grab your ticket before doors open to save up to $444.

Join 10k+ tech and VC leaders for growth and connections at Disrupt 2025

Netflix, Box, a16z, ElevenLabs, Wayve, Hugging Face, Elad Gil, Vinod Khosla — just some of the 250+ heavy hitters leading 200+ sessions designed to deliver the insights that fuel startup growth and sharpen your edge. Don’t miss a chance to learn from the top voices in tech. Grab your ticket before doors open to save up to $444.

San Francisco | October 27-29, 2025

Read more about AWS re:Invent 2022 on TechCrunch

Topics

, , , , ,
Loading the next article
Error loading the next article