Nvidia CEO Jensen Huang on what's next for the AI boom

Jensen Huang walked onto the SAP Center stage Monday for his GTC keynote address and did what he does best: turning a product keynote into a zoning hearing for the future. The Nvidia $NVDA +1.65% founder and CEO opened the company’s closely watched developers’ conference by promising a tour through “every single layer” of AI, then spent the next stretch arguing that the company isn’t just selling chips into a hot market. Nope. The company wants to define the whole physical plant of the AI economy: the compute, the networking, the storage, the software, the models, the factories, and — because subtlety is clearly out of season — maybe even the orbital data centers.

The keynote sprayed announcements in every direction, but the real message was tighter than the confetti cannon made it look. Huang wanted investors, customers, and rivals to hear four things clearly: AI demand is still climbing fast enough to justify indecent amounts of spending; inference is now the center of the battlefield; agents are supposed to spill out of chatbots and into the daily machinery of office work; and the next gold rush after digital AI could be physical AI, where robots, autonomous systems and industrial software burn through even more data and infrastructure.

Well, that’s a big number

Huang’s biggest flex was numerical. He marked the 20th anniversary of CUDA, called it the flywheel behind accelerated computing, said computing demand has risen “1 million times over the last few years,” and then raised the stakes by saying he now sees at least $1 trillion in revenue opportunity from 2025 through 2027. That was the keynote’s organizing principle: a public insistence that the AI buildout is still early, still broadening, and still large enough to make today’s spending look, in Nvidia’s telling, like down payment money.

That number also did some quiet cleanup work. Nvidia has spent months fielding the usual questions that arrive whenever a company becomes the main cashier at a capital-spending frenzy: How long can this last, what happens when hyperscalers get religion on costs, and how much of the next phase leaks to custom chips and cheaper alternatives?

Huang’s answer was to widen the lens. The token, GTC’s opening video declared, is the building block of the new AI era. Huang’s point was that the business tied to those tokens won’t stop at training giant models and admiring them in benchmarks. It moves into production, where the meter never stops running.

Inference takes center stage

Perhaps one of the sharpest lines of the keynote was also the simplest: “The inference inflection has arrived.” Huang broke inference into two stages — prefill and decode — and laid out a system in which Nvidia’s Vera Rubin chips handle the prefill work, while Groq-derived silicon tackles decode, the step that actually spits out the answer. That matters because inference is where Nvidia’s next war gets messier. Training made the company rich. Serving hundreds of millions of users in real time is where customers start asking impolite questions about cost, latency, and whether they really need the same silicon for every step.

So Huang’s answer was classic Nvidia. Don’t defend the GPU in isolation, swallow the whole stack. He described Vera Rubin as “a generational leap” built around seven chips and five rack-scale systems, with Nvidia claiming the platform can train large mixture-of-experts models with one-fourth the number of GPUs versus Blackwell and deliver up to 10 times higher inference throughput per watt at one-tenth the cost per token. He also used the keynote to look beyond Rubin to the future platform Feynman, because in Nvidia-land the next generation is already standing in the wings before the current one finishes taking its bow.

That was the deeper tell from San Jose. Huang wasn’t pitching a faster part so much as a bigger dependency. Nvidia announced a Vera Rubin DSX AI factory reference design, DSX simulation tools for planning AI factories before they’re built, and a broader menu of storage, networking, and system components meant to operate as one vertically integrated machine. The message was hard to miss: Stop thinking about servers, start thinking about campuses. Or, if you’re Nvidia, start sending invoices like a utility.

Agents leave the demo stage

If the hardware pitch was about keeping Nvidia at the center of inference, the software pitch was about making sure enterprise AI doesn’t become someone else’s party. Huang said that “Claude Code and OpenClaw have sparked the agent inflection point,” adding that “employees will be supercharged by teams of frontier, specialized, and custom-built agents they deploy and manage.”

Nvidia paired that rhetoric with its Agent Toolkit, OpenShell runtime, and AI-Q blueprint — software it says can help enterprises build autonomous agents with policy guardrails and, in AI-Q’s case, cut query costs by more than 50% through a hybrid mix of frontier and Nvidia’s own open models.

There was a strategic hedge tucked inside all that openness. Nvidia unveiled the Nemotron Coalition with Black Forest Labs, Cursor, LangChain, Mistral, Perplexity, Reflection AI, Sarvam, and Thinking Machines Lab, with the first project set to underpin the coming Nemotron 4 model family. Read the subtext, and it’s pretty clear that Nvidia doesn’t want the future of AI software split neatly between a few giant closed-model vendors and a pile of commodity hardware underneath. It wants a hand in the open-model layer, too — the piece that shapes who gets to build, tune, and own AI outside the walls of the biggest labs.

The robot pitch gets bigger

Huang has been widening Nvidia’s story beyond digital assistants for a while, and GTC pushed that theme even harder. Nvidia announced a Physical AI Data Factory Blueprint with Microsoft $MSFT +1.11% Azure and Nebius that’s meant to automate how training data gets generated, augmented, and evaluated for robotics, vision AI agents, and autonomous vehicles. The pitch is straightforward enough: Real-world data is scarce, edge cases are annoying, and synthetic data plus simulation can turn compute into the raw material these systems need.

Huang also previewed GR00T N2, a next-generation robot foundation model based on DreamZero research that the company says more than doubles success versus leading VLA models on new tasks in new environments. That section of the keynote may wind up aging best. Chatbots got Wall Street excited. Physical AI is the part that could keep the infrastructure binge going for years, because robots, industrial systems, and autonomous machines don’t just need models — they need endless training data, simulation, networking, sensors, and edge compute. Huang even took the story a step further and said Nvidia is going to space, with future Vera Rubin-based systems aimed at orbital data centers and autonomous space operations. Sure, that sounds a little like a man who has discovered there are still a few untouched sectors left on the bingo card. But it also sounds like a company determined to make “AI infrastructure” mean nearly every expensive machine in sight.

By the time Huang was done, the keynote felt bigger than a launch calendar. It read like an empire map. Yes, there was DLSS 5 for graphics, new industrial software tie-ins, telecom edge partnerships, and an avalanche of developer plumbing. But the durable takeaway was simpler and much bigger: Nvidia wants AI to stop being understood as a category of software and start being treated as a utility-scale infrastructure project, with Nvidia’s hardware and software embedded at every layer.

That’s a very Jensen Huang message — neatly merchandised and only slightly modest. The unnerving part for rivals is that, for now at least, he still has plenty of customers willing to build around it.

📬 Sign up for the Daily Brief

Our free, fast and fun briefing on the global economy, delivered every weekday morning.

Sign me up

Beyond Text Generation: An AI Tool That Helps You Write BetterImprove grammar, word choice, and sentence structure everywhere you work. Write better with Grammarly.Grammarly |

SponsoredSponsored

Install Now

Undo

総合保障タイプ充実の保障内容と掛金月々1,800円の手頃さが人気のこくみん共済の「総合保障タイプ」全国労働者共済生活協同組合連合会 |

SponsoredSponsored

詳細

Undo

初回限定【サンプルセット付※】送料無料ニオイやニキビ、乾燥が気になる方に。まるごとフルフルで洗おう!W抗菌成分配合で、ニオイやニキビの原因菌の増殖を抑制。うるおい成分濃グリセリン配合で、うるおいを奪い過ぎずに泡でやさしくケア。効能・効果 皮膚の清浄・殺菌・消毒、体臭・汗臭及びニキビを防ぐ持田ヘルスケア |

SponsoredSponsored

詳細

Undo

【送料無料】初回限定WEB購入冬のカサカサ時期に!うるおい成分濃グリセリン配合(保湿成分)の薬用石鹸がおすすめ。持田ヘルスケア |

SponsoredSponsored

詳細

Undo

60歳になって聞き返すことが増えた私の最適解!ワンタッチ操作の集音器Olive Union |

SponsoredSponsored

続きを読む

Undo

不動産価格が上昇中自宅がこんな金額で売れるなんて!LIFULL HOME’S |

SponsoredSponsored

詳細

Undo

How Much Time Could You Save with Clearer Writing?Customers report Grammarly saves them almost an entire month of work every year. Improve productivity and communicate effectively in 2025 with AI communication assistance at your fingertips.Grammarly |

SponsoredSponsored

Install Now

Undo

社会実装が近づく自動運転、周辺に眠る数々のビジネス機会を深掘り!日経BP × ひとと世界をつなぐ技術の未来 |

SponsoredSponsored

Undo

こくみん共済あなたにぴったりの保障選び。申し込みもWEBで簡単!/こくみん共済 coop全国労働者共済生活協同組合連合会 |

SponsoredSponsored

詳細

Undo

家を売る前に要注意「やってはいけない不動産売却のNG行動」NTTデータG会社運営、住所入力するだけで実家の価値が判明!不動産会社が教えない、家を売る時にNGな行動とはHOME4U不動産売却 |

SponsoredSponsored

詳細

Undo

バイヤーおすすめ360度アクションカメラプロ厳選のマストアイテムAmazon.co.jp |

SponsoredSponsored

Undo

自宅がこんな金額で売れるなんてホームズの売却査定なら【無料】で最大10社までまとめて比較可能LIFULL HOME’S |

SponsoredSponsored

詳細

Undo

Learn 80+ Languages Easily with AI [Join]Discover how personalized, real conversations transform language learning experiencesTalkpal - AI Language Learning |

SponsoredSponsored

Sign Up

Undo

The best used cars to buy right now: Consumer ReportsQuartz

Undo

The 10 best used cars for less than $20,000 right now, according to Consumer ReportsQuartz

Undo

AI is changing the way you learn a languageDiscover how personalized, real conversations transform language learning experiencesTalkpal - AI Language Learning |

SponsoredSponsored

Sign Up

Undo

Play this game for 3 minutes, if you own a mouseNo Installation. Play for free.planetcapture.io |

SponsoredSponsored

Undo

「窓」が体験を生み出すインターフェースに変わる!裸眼のままで車外が3D仮想空間に日経BP × ひとと世界をつなぐ技術の未来 |

SponsoredSponsored

Undo

認知度0%が半年で20%向上、採用ブランディングに取り組む意義自社の魅力発信強化によって採用力を高める「採用ブランディング」が注目されている。電通が2024年に立ち上げた採用支援専門チームに話を聞いた電通グループ |

SponsoredSponsored

続きを読む

Undo

Play this game for 1 minute and see why everyone is crazy about itNo Installation. Play for free.planetcapture.io |

SponsoredSponsored

Undo

MOST POPULAR

What are the best cheap cars to buy right now?Quartz

Undo

MOST POPULAR

The 9 safest cars to buy right now: Consumer ReportsQuartz

Undo

Get started today!Use ForecastEx contracts to trade your view on economic and climate indicators at IBKR!Interactive Brokers |

SponsoredSponsored

Learn More

Undo

不動産売却で損する人のNG行動NTTデータG会社運営、住所入力するだけで実家の価値が判明!不動産会社が教えない、家を売る時にNGな行動とはHOME4U不動産売却 |

SponsoredSponsored

詳細

Undo

バイヤーおすすめカメラ特集プロ厳選のマストアイテムAmazon.co.jp |

SponsoredSponsored

Undo

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments