Business

Amazon SageMaker HyperPod makes it easier to train and fine-tune LLMs

Published

12 months ago

November 29, 2023

At its re:Invent conference, Amazon’s AWS cloud arm today announced the launch of SageMaker HyperPod, a new purpose-built service for training and fine-tuning large language models. SageMaker HyperPod is now generally available.

Amazon has long bet on SageMaker, its service for building, training and deploying machine learning models, as the backbone of its machine learning strategy. Now, with the advent of generative AI, it’s maybe no surprise that it is also leaning on SageMaker as the core product to make it easier for its users to train and fine-tune large language models (LLMs).

Image Credits: AWS

“SageMaker HyperPod gives you the ability to create a distributed cluster with accelerated instances that’s optimized for disputed training,” Ankur Mehrotra, AWS’ general manager for SageMaker, told me in an interview ahead of today’s announcement. “It gives you the tools to efficiently distribute models and data across your cluster — and that speeds up your training process.”

He also noted that SageMaker HyperPod allows users to frequently save checkpoints, allowing them to pause, analyze and optimize the training process without having to start over. The service also includes a number of fail-safes so that when a GPUs goes down for some reason, the entire training process doesn’t fail, too.

“For an ML team, for instance, that’s just interested in training the model — for them, it becomes like a zero-touch experience and the cluster becomes sort of a self-healing cluster in some sense,” Mehrotra explained. “Overall, these capabilities can help you train foundation models up to 40 percent faster, which, if you think about the cost and the time to market, is a huge differentiator.”

Image Credits: AWS

Users can opt to train on Amazon’s own custom Trainium (and now Trainium 2) chips or Nvidia-based GPU instances, including those using the H100 processor. The company promises that HyperPod can speed up the training process by up to 40%.

The company already has some experience with this using SageMaker for building LLMs. The Falcon 180B model, for example, was trained on SageMaker, using a cluster of thousands of A100 GPUs. Mehrotra noted that AWS was able to take what it learned from that and its previous experience with scaling SageMaker to build HyperPod.

Image Credits: AWS

Perplexity AI’s co-founder and CEO Aravind Srinivas told me that his company got early access to the service during its private beta. He noted that his team was initially skeptical about using AWS for training and fine-tuning its models.

“We did not work with AWS before,” he said. “There was a myth — it’s a myth, it’s not a fact — that AWS does not have great infrastructure for large model training and obviously we didn’t have time to do due diligence, so we believed it.” The team got connected with AWS, though, and the engineers there asked them to test the service out (for free). he also noted that he has found it easy to get support from AWS — and access to enough GPUs for Perplexity’s use case. It obviously helped that the team was already familiar with doing inference on AWS.

Srinivas also stressed that the AWS HyperPod team focused strongly on speeding up the interconnects that link Nvidia’s graphics cards. “They went and optimized the primitives — Nvidia’s various primitives — that allow you to communicate these gradients and parameters across different nodes,” he explained.

Related Topics:featured

Up Next

European consumer groups band together to fight Meta’s self-serving ad-free sub — branding it ‘unfair’ and ‘illegal’

Don't Miss

Here’s how I catch every NBA play with Sling Orange

Entertainment7 days ago

Greatest birthday gift ideas for women: What to get for your mom, sister, wife, daughter, or greatest friend

Entertainment7 days ago

‘Arcane’ Season 2 review: The greatest fantasy show of 2024, hands-down

Entertainment7 days ago

Greatest 50th birthday gifts: Celebrate half a century with the perfect present

Entertainment4 days ago

How to watch Pharrell’s ‘Piece by Piece’ at home: When is it streaming?

Entertainment7 days ago

Giant telescope’s own powerful radiation may have contributed to collapse

Entertainment7 days ago

‘Heretic’s intense ending, explained | Mashable

Entertainment4 days ago

‘Gladiator II’ review: Ridley Scott grapples with modern masculinity in ancient Rome

Entertainment3 days ago

BookTok’s growing rift over politics is heating up

The Televisor

Amazon SageMaker HyperPod makes it easier to train and fine-tune LLMs

Business

Amazon SageMaker HyperPod makes it easier to train and fine-tune LLMs

‘Interior Chinatown’ review: A very ambitious, very meta police procedural spoof

6 gadgets to help keep your home clean, from robot vacuums to electric scrubbers

Greatest birthday gifts for men: Practical and posh presents that are sure to please

Stocking up on holiday gift cards? Watch out for this scam.

Trump taps Musk for ‘Department of Government Efficiency’: What it is and what’s at risk.

Trump appoints Elon Musk to DOGE, a new U.S. government department

BookTok’s growing rift over politics is heating up

How to watch Pharrell’s ‘Piece by Piece’ at home: When is it streaming?

‘Gladiator II’ review: Ridley Scott grapples with modern masculinity in ancient Rome

‘Heretic’s intense ending, explained | Mashable

Huge NASA spacecraft is flying to a perilous part of the solar system

Instagram launches new tools to prevent sextortion, especially among teens

‘Venom: The Last Dance’ review: Half a great, stupid movie

How 6 generations of iPhone captured 20 years of motherhood in ‘Motherboard’

Flirting IRL is having a pop culture moment, from ‘Chicken Shop Date’ to Charli xcx

‘Here’ review: Robert Zemeckis, Tom Hanks, and Robin Wright reunite

The 34 greatest Australian horror films (and where to watch them)

Monday Night Football’s onside-kick rule confusion showed how Google can spread misinformation online

How to watch the 2024-2025 NBA season without cable: The greatest streaming deals

When do Black Friday sales start? What to expect from Walmart, Greatest Buy, and more

‘Interior Chinatown’ review: A very ambitious, very meta police procedural spoof

6 gadgets to help keep your home clean, from robot vacuums to electric scrubbers

Greatest birthday gifts for men: Practical and posh presents that are sure to please

Stocking up on holiday gift cards? Watch out for this scam.

Trump taps Musk for ‘Department of Government Efficiency’: What it is and what’s at risk.

Trump appoints Elon Musk to DOGE, a new U.S. government department

BookTok’s growing rift over politics is heating up

How to watch Pharrell’s ‘Piece by Piece’ at home: When is it streaming?

‘Gladiator II’ review: Ridley Scott grapples with modern masculinity in ancient Rome

‘Heretic’s intense ending, explained | Mashable

Trending

The Televisor

Amazon SageMaker HyperPod makes it easier to train and fine-tune LLMs

You may like

‘Interior Chinatown’ review: A very ambitious, very meta police procedural spoof

6 gadgets to help keep your home clean, from robot vacuums to electric scrubbers

Greatest birthday gifts for men: Practical and posh presents that are sure to please

Stocking up on holiday gift cards? Watch out for this scam.

Trump taps Musk for ‘Department of Government Efficiency’: What it is and what’s at risk.

Trump appoints Elon Musk to DOGE, a new U.S. government department

BookTok’s growing rift over politics is heating up

How to watch Pharrell’s ‘Piece by Piece’ at home: When is it streaming?

‘Gladiator II’ review: Ridley Scott grapples with modern masculinity in ancient Rome

‘Heretic’s intense ending, explained | Mashable

Huge NASA spacecraft is flying to a perilous part of the solar system

Instagram launches new tools to prevent sextortion, especially among teens

‘Venom: The Last Dance’ review: Half a great, stupid movie

How 6 generations of iPhone captured 20 years of motherhood in ‘Motherboard’

Flirting IRL is having a pop culture moment, from ‘Chicken Shop Date’ to Charli xcx

‘Here’ review: Robert Zemeckis, Tom Hanks, and Robin Wright reunite

The 34 greatest Australian horror films (and where to watch them)

Monday Night Football’s onside-kick rule confusion showed how Google can spread misinformation online

How to watch the 2024-2025 NBA season without cable: The greatest streaming deals

When do Black Friday sales start? What to expect from Walmart, Greatest Buy, and more

‘Interior Chinatown’ review: A very ambitious, very meta police procedural spoof

6 gadgets to help keep your home clean, from robot vacuums to electric scrubbers

Greatest birthday gifts for men: Practical and posh presents that are sure to please

Stocking up on holiday gift cards? Watch out for this scam.

Trump taps Musk for ‘Department of Government Efficiency’: What it is and what’s at risk.

Trump appoints Elon Musk to DOGE, a new U.S. government department

BookTok’s growing rift over politics is heating up

How to watch Pharrell’s ‘Piece by Piece’ at home: When is it streaming?

‘Gladiator II’ review: Ridley Scott grapples with modern masculinity in ancient Rome

‘Heretic’s intense ending, explained | Mashable

Trending