Off-Prem

GenAI will be bigger than the cloud or the internet, Amazon CEO hopes

And Andy Jassy will happily take your money along the way


It's safe to say Amazon CEO Andy Jassy is pretty jazzed about generative AI's potential to drive profits.

"Generative AI may be the largest technology transformation since the cloud — which itself, is still in the early stages — and perhaps since the internet," he wrote in his annual letter to shareholders Thursday. "There never has been a time in Amazon's history where we've felt so much opportunity to make our customer's lives better and easier."

For the e-commerce giant turned cloud titan, the chances to capitalize on the much-hyped technology are many, but just like the move to the cloud, there is plenty of work that needs done first.

The stage, what Jassy describes as the bottom layer, won't come as a surprise to anyone: selling developers the computing resources and software services necessary to develop their own foundation models.

These efforts, he acknowledges, will require a lot of Nvidia GPUs. Even if Amazon can't build the shovels — or, at least not the ones everyone wants — in this AI goldrush, it can still rent them at a massive premium.

At GTC, Nvidia revealed that Amazon's Ceiba supercomputer, which was initially planned to use its existing Hopper GPUs, will actually be built using more than 20,000 Blackwell accelerators.

But, with those resources in high-demand, Jassy says there's growing interest in alternative hardware, highlighting Anthropic's plans to use Amazon's own home-grown Trainium and Inferentia chips to build, train and deploy future foundation models. On the software side, Jassy also highlighted the cloud-giants SageMaker platform which is designed to streamline the process of training said models.

To support these efforts Amazon also plans to expand its datacenter presence with six new regions in Malaysia, Mexico, New Zealand, Saudi Arabia, Thailand, and Germany already in the works.

One step up from this, Jassy says Amazon is well positioned to help enterprises fine-tune, customize, and integrate existing foundation models — something we'll note is computationally much easier to do than train a system from scratch.

 For this Amazon has Bedrock, which it announced in early 2023. According to Jassy, the service has added tens of thousands of users in the year since its launch. It probably doesn't hurt that Amazon is handing out $500k in credits to anyone who wants to play with pre-trained models.

The idea of customizing and building applications using one or more large language models (LLMs) has garnered a lot of attention as of late. During his keynote address this week at Intel Vision, CEO Pat Gelsinger discussed using technologies like retrieval augmented generation (RAG) to make general purpose models, like Llama2, context aware by connecting them to vector databases containing relevant information.

Nvidia CEO Jensen Huang has also sung RAG's praises as one of the key technologies required to bring generative AI apps to the enterprise.

Amazon, for its part, has partnered closely with AI startups, including a $4 billion investment in Anthropic, the model builder behind Claude.

At the top of Jassy's generative AI stack are off-the-shelf applications and services. These are apps like Amazon's Q code assistant, which debuted last fall, as well as internal applications designed to drive sales.

"We're building a substantial number of GenAI applications across every Amazon customer business. These range from Rufus — our new AI-powered shopping assistant — to an even more intelligent and capable Alexa, to advertising capabilities," he said.

While Amazon will continue to develop these kinds of AI tools, Jassy expects most generative AI apps will ultimately be built by others.

"These AWS services, at all three layers of the stack, comprise a set of primitives that democratize this next seminal phase of AI, and will empower internal and external builders to transform virtually every customer experience that we know," he said. ®

Send us news
15 Comments

Amazon splashes $11B on AI datacenters in Georgia

Peach State already home to more than 50 bit barns

Looming energy crunch makes future uncertain for datacenters

But investors still betting big on bit barns thanks to AI and cloud demand

With AI boom in full force, 2024 datacenter deals reach $57B record

Fewer giant contracts, but many more smaller ones, in bit barn feeding frenzy

AI hype led to an enterprise datacenter spending binge in 2024 that won't last

GPUs and generative AI systems so hot right now... yet 'long-term trend remains,' says analyst

UK unveils plans to mainline AI into the veins of the nation

Government adopts all 50 venture capitalist recommendations but leaves datacenter energy puzzle unsolved

Many people are saying a luxury Dubai property developer will blow $20B on US datacenters

A successful real-estate billionaire and Donald Trump walk into a bar...

AI datacenters putting zero emissions promises out of reach

Plus: Bit barns' demand for water, land, and power could breed 'growing opposition' from residents

To save the energy grid from AI, use open source AI, says open source body

Linux Foundation Energy argues rapidly decentralizing electricity sector can't succeed with silos

Schneider Electric warns of future where datacenters eat the grid

Report charts four scenarios from 'Sustainable AI' to 'Who Turned Out The Lights?'

Enterprises in for a shock when they realize power and cooling demands of AI

Energy consumption set to become a key performance indicator by 2027

AI frenzy continues as Macquarie commits up to $5B for Applied Digital datacenters

Bubble? What bubble?

Biden opens federal land to power-hungry AI datacenters

Watt's the problem? Not enough energy to win the arms race?