Elon Musk says the next-generation Grok 3 model will require 100,000 Nvidia H100 GPUs to train


 


Elon Musk, CEO of Tesla and founder of xAI, has made striking predictions regarding the timeline for artificial general intelligence (AGI), suggesting it could exceed human intelligence as early as next year or by 2026. However, he notes that achieving this milestone will require an enormous number of processors, leading to substantial electricity demands, as reported by Reuters.


Musk's company, xAI, is currently working on the second version of its Grok large language model and expects to complete the next training phase by May. Training for Grok's version 2 has utilized approximately 20,000 Nvidia H100 GPUs, and Musk forecasts that future iterations, specifically Grok 3, could require around 100,000 H100 chips.


He highlighted two primary challenges currently hindering AI advancement: supply shortages of advanced processors like Nvidia's H100 and the availability of electricity. The H100 GPU consumes about 700 watts at full capacity, meaning that a deployment of 100,000 GPUs could consume around 70 megawatts of power. Including the additional requirements for servers and cooling, a data center with this number of GPUs could use about 100 megawatts, comparable to the energy consumption of a small city.


Musk emphasized that while the GPU supply has been a significant challenge, the availability of electricity will become increasingly critical in the next couple of years. This dual constraint highlights the difficulties of scaling AI technologies to meet rising computational demands.


Despite these challenges, advancements in computing and memory architectures are expected to enable the training of larger language models in the future. At GTC 2024, Nvidia unveiled its Blackwell B200, a new GPU architecture designed to support the scaling of large language models effectively.



#ElonMusk

 #AGI

#xAI

 #ArtificialIntelligence

#Nvidia

 #GPUs

 #AIChallenges

#EnergyDemand

#TechPredictions

#LargeLanguageModels

#ComputePower

#AIRevolution

 #ProcessorShortages

 #H100



Comments

Popular posts from this blog

Venod Khosla on AI, the Future of Programming, and a World of Abundance

The AI Upset: How a Chinese Villager Shook Silicon Valley with DeepSeek

How scientists are creating real life invisibility: Can We Ever Truly Disappear?