Nvidia fixes Blackwell chip flaw with help from TSMC, mass production back on schedule

Trending 4 weeks ago

Serving tech enthusiasts for complete 25 years.
TechSpot intends tech study and proposal you can trust.

What conscionable happened? Nvidia has successfully fixed a creation flaw successful its latest Blackwell AI chips, according to CEO Jensen Huang. The issue, which caused accumulation delays, has been solved pinch nan assistance of TSMC, Nvidia's long-standing manufacturing partner. In fact, it was TSMC that primitively spotted nan problem.

Overcoming this issue was important for Nvidia, arsenic it intends to support its ascendant position successful nan AI spot market. As request for high-performance AI computing solutions continues to surge, nan successful motorboat of Blackwell will play a pivotal domiciled successful providing nan basal hardware.

Huang candidly admitted nan company's work for nan setback. "We had a creation flaw successful Blackwell," he said. "It was functional, but nan creation flaw caused nan output to beryllium low. It was 100 percent Nvidia's fault."

The Blackwell chips, unveiled successful March, were primitively slated for second-quarter shipping. However, nan creation flaw led to delays, perchance affecting awesome customers specified arsenic Meta, Google, and Microsoft.

The Blackwell task was unusually complex, Huang said, which whitethorn person been a facet successful nan flaw. "In bid to make a Blackwell machine work, 7 different types of chips were designed from scratch and had to beryllium ramped into accumulation astatine nan aforesaid time."

The method rumor stemmed from nan intricate packaging exertion utilized successful nan Blackwell B100 and B200 GPUs. These chips employment TSMC's CoWoS-L packaging, which utilizes an RDL interposer pinch section silicon interconnect bridges to execute information transportation rates of astir 10 TB/s. The problem arose from a mismatch successful thermal description properties betwixt various components, causing strategy warping and failure.

To reside this, Nvidia modified nan apical metallic layers and bumps of nan GPU silicon, enhancing accumulation yields. While circumstantial specifications of nan hole stay undisclosed, nan institution confirmed that caller masks were required.

The velocity of nan solution is noteworthy. Typically, addressing specified issues successful nan semiconductor manufacture involves modifying metallic layers and creating caller steppings, a process that tin return astir 3 months. "What TSMC did was to thief america retrieve from that output trouble and resume nan manufacturing of Blackwell astatine an unthinkable pace," Huang said.

With nan creation flaw now resolved, wide accumulation of nan fixed Blackwell GPUs is group to statesman successful precocious October. Shipments are expected to commencement successful early 2025, aligning pinch Nvidia's fiscal year.

Despite nan setback, request for Blackwell chips remains high. Huang had antecedently described nan request arsenic "insane," pinch customers eager to beryllium first successful statement for nan caller technology.

Google has ordered complete 400,000 GB200 chips successful a woody exceeding $10 billion. Similarly, Meta has placed a $10 cardinal order, while Microsoft is group to person 55,000 to 65,000 GB200 GPUs fresh for OpenAI by nan first 4th of 2025.

More
Source Tech Spot
Tech Spot