The rise of generative AI has been powered by Nvidia and its advanced GPUs. As demand far outstrips supply, the H100 has become highly sought after and extremely expensive, making Nvidia a trillion-dollar company for the first time.
It’s also prompting customers, like Microsoft, Meta, OpenAI, Amazon, and Google to start working on their own AI processors. Meanwhile, Nvidia and other chip makers like AMD and Intel are now locked in an arms race to release newer, more efficient, and more powerful AI chips.
As demand for generative AI services continues to grow, it’s evident that chips will be the next big battleground for AI supremacy.
Aug 15
Geekbench has an AI benchmark now
Photo by Tom Warren / The VergeThe popular benchmarking utility Geekbench has launched a new cross-platform tool to evaluate the performance of devices under AI-heavy workloads. Geekbench AI measures a device’s CPU, GPU, and NPU (neural processing unit) to determine how well it can handle machine learning applications.
Read Article >Geekbench developer Primate Labs has been working on the software using the name Geekbench ML, which launched in preview in 2021, but shifted the name to AI for reasons that seem obvious. To explore how different hardware responds to different AI-related tasks, it evaluates performance based on both accuracy and speed, with support for different frameworks, including ONNX, CoreML, TensorFlow Lite, and OpenVino.
Aug 6
Some good news from Intel.The company says it was able to boot up and load operating systems on Intel 18A-based processors, including Panther Lake AI PC chips. 18A is part of Intel’s roadmap to regain its footing in the processor market.
The good news comes not a moment too soon, as the company recently confirmed that crashing 13th- and- 14th-gen processors are unfixable, then laid off 15,000 employees last week.
Aug 5
The terror machines at Elliot Management view Nvidia as overvalued and say AI isn’t going to live up to the hype.Elliott Management, famous for targeting underperforming companies such as Twitter, says Nvidia is in a bubble, in a new letter to investors.
Many of AI’s supposed uses are “never going to be cost-efficient, are never going to actually work right, will take up too much energy, or will prove to be untrustworthy”, it said.
Elliott says Nvidia is in a ‘bubble’ and AI is ‘overhyped’[Financial Times]
Jul 30
AMD is becoming an AI chip company, just like Nvidia
Illustration by Alex Castro / The VergeAMD just announced its second quarter 2024 earnings today, and the highlight was this: nearly half the company’s sales are now data center products — not chips for personal computers, not game consoles, not embedded chips for industry or vehicles.
Read Article >The company’s data center business has doubled in a single year, and this quarter’s growth was primarily due to a single chip: the AMD Instinct MI300 accelerator, which competes with Nvidia’s infamously influential H100 AI chip. The AMD chip just did over $1 billion in sales in a single quarter, according to CEO Lisa Su, up from its previous milestone of $1 billion cumulatively since its December 2023 debut. (AMD says its Epyc server CPUs also contributed.)
Jul 19
OpenAI wants in on the AI chip business.According to The Information, OpenAI is in discussion with Broadcom and other semiconductor designers about developing its own artificial intelligence chip to address shortages in its supply chain and reduce dependency on Nvidia. OpenAI has apparently also hired former Google chip staffers.
Bloomberg previously reported in January that OpenAI CEO Sam Altman was planning to raise billions of dollars to set up a network of chip factories.
OpenAI Has Talked to Broadcom About Developing New AI Chip[The Information]
Jul 10
AMD will acquire an AI startup for $665 million.The Finland-based Silo AI is described as the “largest private AI lab in Europe” and has provided AI solutions for companies like Phillps, Rolls-Royce, and Unilever. In addition to Silo AI, AMD also acquired the AI startup Nod.ai last year as it aims to keep up with the likes of Nvidia.
Jul 9
a16z is trying to keep AI alive with Oxygen initiative.According to The Information, VC firm Andreessen Horowitz has secured thousands of AI chips, including Nvidia H100 GPUs, to dole out to its AI portfolio companies in exchange for equity. The initiative is aptly named Oxygen, because these chips are that integral to AI companies. The chips are almost impossible to secure for small startups too, because Big Tech companies hoover up all the supply.
Jul 4
Softbank is trying to borrow $10 billion for AI-related projects.Hey, remember the guy who’s responsible for funding WeWork’s delusional business plan? Softbank CEO Masayoshi Son is really into AI and he’s aiming to flood the area with money. His clearest targets are Nvidia chips and energy startups.
Jun 28
Apple Silicon exec joins Rain AI to develop new hardware.Bloomberg reports that Rain AI, which has OpenAI CEO Sam Altman as one of its backers, has hired Apple chip exec Jean-Didier Allegrucci to oversee the development of new AI processors that are supposed to reduce power consumption with “in-memory compute.”
[Allegrucci] has worked and led silicon teams across a broad range of applications, including CPUs, GPUs, NPUs, ISPs, SoCs, and many others....At Apple, he oversaw the development of more than 30 SoCs used for flagship products, including iPhones, Macs, iPads, Apple Watch, and many more.
Jun 18
Nvidia overtakes Microsoft as the world’s most valuable company
Cath Virginia / The VergeLess than two weeks after Nvidia jumped Apple in terms of its overall valuation, the GPU maker has now passed Microsoft to stand as the world’s most valuable company based on the chips it makes that are key to powering a boom in generative AI technology.
Read Article >At the close of trading on Tuesday, its share price stood at $135.58, up $4.60 from the previous day and pushing its market cap to $3.335 trillion. That’s more than Microsoft ($3.32 trillion), Apple ($3.29 trillion), and Google ($2.17 trillion). Nvidia’s shares split 10-for-1 after June 7th, lowering the overall share price, but the spike in the company’s value has been jarring. Its share price has gone up 160 percent in 2024, and the company only passed the $2 trillion mark in February.
Jun 18
Nvidia is the world’s most valuable company at the moment.Riding a valuation pumped up by generative AI and its chips that power many of the tools, Nvidia’s market cap has passed not only Apple but now Microsoft, too, at more than $3.3 trillion, as reported by Bloomberg.
The markets are still open, but the rise has been fast — Nvidia shares are up 160 percent in 2024, passing $2 trillion in February.
Image: BloombergJun 5
Nvidia is now more valuable than Apple at $3.01 trillion
Image: Cath Virginia / The VergeNvidia has become the second most valuable company in the world. On Wednesday afternoon, the chipmaking giant’s market capitalization hit $3.01 trillion, putting it just ahead of Apple at $3 trillion.
Read Article >As Nvidia dominates the AI race with its flagship H100 chip, the company’s market cap has only continued to rise. Nvidia became a $1 trillion company in May 2023, then skyrocketed past $2 trillion in February of this year, making it more valuable than both Amazon and Alphabet.
Jun 4
Even the Raspberry Pi is getting in on AI
Illustration by Cath Virginia / The Verge | Photos by Getty ImagesAs the AI craze continues, even the microcomputer company Raspberry Pi plans to sell an AI chip. It’s integrated with Raspberry Pi’s camera software and can run AI-based applications like chatbots natively on the tiny computer.
Read Article >Raspberry Pi partnered with chipmaker Hailo for its AI Kit, which is an add-on for its Raspberry Pi 5 microcomputer that will run Hailo’s Hailo-8L M.2 accelerator. The kits will be available “soon from the worldwide network of Raspberry Pi-approved resellers” for $70.
May 30
Intel, Google, Microsoft, Meta, and more want to standardize the tech used in AI data centers.The Ultra Accelerator Link (UALink) Promoter Group, will work to create an open standard to help AI accelerators “communicate more effectively” within data centers and boost performance. Other members include AMD, HP, Broadcom, and Cisco — but not Nvidia, which has AI chip-linking tech of its own.
May 22
Nvidia will now make new AI chips every year
Illustration by Alex Castro / The VergeNvidia just made $14 billion worth of profit in a single quarter thanks to AI chips, and it’s hitting the gas from here on out: Nvidia will now design new chips every year instead of once every two years, according to Nvidia CEO Jensen Huang.
Read Article >“I can announce that after Blackwell, there’s another chip. We’re on a one-year rhythm,” Huang just said on the company’s Q1 2025 earnings call.
May 22
Nvidia just made $14 billion of profit in a single quarter thanks to AI chips.Sales jumped 262 percent in Q1 2025 to hit a record $26B in revenue, of which nearly three-quarters ($19.4B) was data center compute — especially its Hopper GPUs for training LLMs and generative AI apps, says Nvidia. Gaming only accounted for $2.6 billion revenue this quarter.
Nvidia’s expecting record revenue again next quarter — $28B. Shovels in a gold rush, people.
Image: NvidiaMay 14
Google announced Trillium, its sixth generation of Tensor processors.CEO Sundar Pichai just announced new Trillium chips, coming later this year, that are 4.7 times faster than their predecessors, as Google competes with everyone else building new AI chips. Pichai also highlighted Axion, Google’s first ARM-based CPU, which the company announced last month.
Google will also be “one of the first” cloud companies to offer Nvidia’s Blackwell GPU starting in 2025.
Correction: Axion was announced last month, not last year. Also, corrected the spelling of Axion.
Image: GoogleMay 9
Apple plans to use M2 Ultra chips in the cloud for AI
Illustration: The VergeApple plans to start its foray into generative AI by offloading complex queries to M2 Ultra chips running in data centers before moving to its more advanced M4 chips.
Read Article >Bloomberg reports that Apple plans to put its M2 Ultra on cloud servers to run more complex AI queries, while simple tasks are processed on devices. The Wall Street Journal previously reported that Apple wanted to make custom chips to bring to data centers to ensure security and privacy in a project the publication says is called Project ACDC, or Apple Chips in Data Center. But the company now believes its existing processors already have sufficient security and privacy components.
May 7
Apple’s ‘Project ACDC’ is creating AI chips for data centers.Apple — like Google, Meta, Microsoft, OpenAI and everyone else this side of Nvidia — is reportedly working on custom server hardware to power AI models as it prepares to introduce a slew of new features.
Over the past decade, Apple has emerged as a leading player designing chips for iPhones, iPads, Apple Watch and Mac computers. The server project, which is internally code-named Project ACDC—for Apple Chips in Data Center—will bring this talent to bear for the company’s servers, according to people familiar with the matter.
Apple watcher Mark Gurman followed up saying a similar-sounding project was canceled and it doesn’t make sense anyway: it would be too expensive, lack differentiation, and Apple prefers on-device AI.
Update: Added Gurman’s rebuttal.
May 6
US plans $285 million in funding for ‘digital twin’ chips research
Illustration by Alex Castro / The VergeThe Biden administration is taking applications for $285 million in federal funding — allotted from the $280 billion CHIPS and Science Act — seeking companies to “establish and operate a CHIPS Manufacturing USA institute focused on digital twins for the semiconductor industry.” The plan for the CHIPS Manufacturing USA institute to establish a “regionally diverse” network to share resources with companies developing and manufacturing both physical semiconductors and digital twins.
Read Article >Digital twins are virtual representations of physical chips that mimic the real version and make it easier to test new processors before they’re put into production to find out how they might react to a boost in power or a different data configuration. According to the press release, digital twin-based research can also leverage tech like AI to speed up chip development and manufacturing in the US.
Apr 30
With $1B in sales, AMD’s MI300 AI chip is its fastest selling product ever.AMD also says an AI PC refresh cycle will help PCs return to growth in 2024, and that 150 software vendors will be developing for AMD AI PCs by year’s end. The company’s top priority is ramping AI data center GPUs, though, which are “tight on supply.” New AI chips are coming “later this year into 2025,” too.
AMD’s Q1 2024 earnings summary. Image: AMDApr 15
OpenAI will give you a 50 percent discount for off-peak GPT use.OpenAI’s Batch API now lets users upload a file of bulk queries to the AI model, like categorizing data or tagging images, with the understanding that they won’t need immediate attention. Promising results within 24 hours lets them run when there is unused compute power, and keeps those pricey GPUs humming around the clock.
Apr 10
Meta’s new AI chips run faster than before
Illustration by Nick Barclay / The VergeMeta promises the next generation of its custom AI chips will be more powerful and able to train its ranking models much faster.
Read Article >The Meta Training and Inference Accelerator (MTIA) is designed to work best with Meta’s ranking and recommendation models. The chips can help make training more efficient and inference — aka the actual reasoning task — easier.
Apr 9
Intel launches new AI accelerator to take on Nvidia’s H100.Intel first introduced its Gaudi 3 AI accelerator last year, but now the company has revealed more details on performance. When compared to the H100 GPU, Intel says its Gaudi 3 accelerator can deliver “50% faster time-to-train on average across the Llama2 models” with better efficiency.
The company also says the Gaudi 3 AI Accelerator will be a “fraction of the cost” of Nvidia’s pricey H100. It will become available to companies like Dell, HPE, and Lenovo in the second quarter of this year.
Image: IntelMar 29
The US is reportedly working on a list of restricted Chinese chipmaking factories.Reuters reports the list could strengthen the Commerce Department’s existing restrictions on US tech shipments to Chinese chip factories. The US government has voiced national security concerns about letting China access US technology to grow its own capabilities.
US companies have complained it’s difficult to know which Chinese factories produce advance chips and are subject to the restrictions, Reuters says.