"It's trust, execution, and quality" — AMD head outlines the challenges for data centers, and how it offers the ideal end to end solution

Trending 1 month ago
AMD Epyc Turin 3D model
(Image credit: AMD)

AMD’s caller Advancing AI arena saw nan merchandise of caller 5th procreation EYPC processors, MI325x accelerators, and caller networking technology among a scope of announcements.

These caller products person nan extremity of easing nan issues faced by ageing datacenters including managing move workloads, power efficiency, and space.

I sewage nan chance to beryllium down pinch AMD's Senior Vice President and General Manager, Server Business Unit, Dan McNamara to talk astir what nan early holds for information centers and HPCs, why AMD has been truthful successful astatine expanding their marketplace stock successful information centers, arsenic good arsenic really networking exertion is nan breakthrough datacenters and HPCs request to push nan advancement of AI moreover further.

Interview with:

An image of Dan McNamara

When you re-enter a marketplace for illustration we did successful 2017, it’s astir 3 things. It’s trust, execution, and quality. Customers really request to cognize you’re going to get them products. Then it’s astir perf per watt per dollar. That’s what we’ve driven our roadmap connected 100%.

When we’re looking astatine a caller programme aliases caller product, nan first point (Dr) Lisa (Su, AMD CEO) asks is show maine nan progression. So, nan economical worth has to beryllium there. Then its easy to adopt.

We’ve gotten amended pinch each generation. We’ve conscionable deed our stride. Milan was a really large inflection constituent for us. Naples and Rome were really bully and collapsed into nan unreality but Milan really expanded nan aperture for america crossed endeavor and cloud.

How do you scheme connected continuing this growth?

Now, we are nary longer a contender, we are now viewed arsenic nan leader. So, that’s what sewage america here. But what sewage you present doesn’t get you there, right? So, for maine it’s astir 3 things. Extend nan merchandise leadership. Perf per core, per socket, density, power efficiency, TCO, each that, crossed a very wide group of unreality and endeavor workloads. Then, reside nan 2 awesome basking topics, AI and nan refresh of aged ageing fleets.

Sign up to nan TechRadar Pro newsletter to get each nan apical news, opinion, features and guidance your business needs to succeed!

The cycles person been elongated. I was pinch a customer a week agone and he told maine complete 50% of my fleet is 4 years old. So, driving refresh and consolidation is captious but past pinch that is, really do you thief them pinch nan refresh and besides thief them pinch package licensing? Broadcom has created a batch of disruption pinch their caller pricing model. There is besides AI. CPU is captious successful this mixed workload environment.

It’s very uncommon that you spot personification say, I’m going to prime that server CPU because I’m going to tally AI each time connected it. If that’s nan case, we urge that they usage accelerators. Anything supra 30% of AI workloads should beryllium put connected accelerators. So if it’s 80% - 20% pinch 80% being accepted workloads and 20% being for AI, we win. We’ve shown that we’re optimised for wide purpose.

For AI workloads, do you work together pinch nan connection that we’re now seeing diminishing returns connected compute which was reported by information scientists moving astatine LUMI and that we request to attraction connected improving nan network?

Network is ace critical. It’s benignant of for illustration representation bandwidth. Network bandwidth is very similar. You person to provender nan cores. It’s akin connected nan web betwixt nan GPUs and nan backside and nan beforehand side. Frontside is important but backside is really captious to support these GPUs each clustered and information flowing. When LUMI was built they didn’t person a backside, it uses a coherent cache interface betwixt nan CPU and nan GPU. GPUs are for parallel processing truthful it’s ace critical. This is why we are building AI NIC to grow that.

We are going to break through. We had meal pinch a large hyper scaler connected Monday and moreover they’re amazed that location is nary extremity successful show for compute. It’s exciting.

On getting much compute, what effect do export bans person connected nan improvement of chips, if any?

We judge because of our architecture, we tin adhere to that and still work different regions for illustration China. Keep successful mind, they’re going to lick nan problem themselves. Ultimately they’re going to person to. At this time, anyone who is building a GPU aliases accelerator is uncovering a measurement to travel successful nether nan requirements that person been set.

So, it’s not stifling?

No.

OK, truthful backmost to AI workloads. AI workloads require a batch much power than accepted workloads. Whose headache should this beryllium and is this a information erstwhile processing chips?

I really judge that it starts astatine nan state level. Lets look it. You don’t want to autumn that acold down successful technology, I don’t attraction what state you’re from. So, I deliberation it starts there. Then I deliberation its nan datacenter providers. It’s a large issue. In nan US you cannot moreover spell crossed authorities lines. If you want to build retired AI compute from California to Phoenix, it’ll return you a twelvemonth to get immoderate shape of agreement. Across authorities lines is very very difficult pinch transmission of power. So, that’s a problem nan US needs to lick and different countries.

On capacity and ratio and I deliberation they spell manus successful hand. If you look astatine what we did pinch EYPC, we accrued our capacity per watt but we besides accrued nan TDP connected nan wide chip. So yes, capacity went up and powerfulness depletion went up but it’s for illustration erstwhile you tally a business you want your apical statement to beryllium going up astatine a overmuch steeper slope than your expenditure. So, that’s nan measurement I look astatine capacity and efficiency.

Is it easier to push that apical statement higher aliases support that bottommost statement lower?

It varies. So, pinch Turnin, very interesting, I’ve had immoderate of nan hyper scalers opportunity 'I don’t want much perf, I want little powerfulness and little cost'. I person others saying 'I want perf per dollar and I’ll eat immoderate of nan power'. So it varies connected nan strategy of nan provider. At nan extremity of nan time we’re providing chips but they’re providing nan services.

We tin do either and I tin springiness you an example. For perf to price, to trim nan wide value we lowered nan TDP to a constituent wherever we were happy pinch nan perf and that reduced nan wide cost. Whereas pinch nan different 1 it’s a small harder because it’s purely 'I don’t want your perf springiness maine nan aforesaid and little cost'. We show customers, present is wherever you’re connected nan curve and if you want to run here, we tin do it, if you want to run there, we tin do that too.


More from TechRadar Pro

  • These are nan best web hosting services around
  • Best unreality hosting providers
  • Hostinger usage AMD EYPC chips successful their VPS hosting plans

James is simply a tech journalist covering interconnectivity and integer infrastructure arsenic nan web hosting editor astatine TechRadar Pro. James stays up to day pinch nan latest web and net trends by attending datacenter summits, WordPress conferences, and mingling pinch package and web developers. At TechRadar Pro, James is responsible for ensuring web hosting pages are arsenic applicable and arsenic adjuvant to readers arsenic imaginable and is besides looking for nan champion deals and coupon codes for web hosting.

More
Source Technology
Technology