GPU and CPU news and discussions

Tadasuke

AMD Zen CPUs "IPC" uplifts (2017 - 2024)

Post by Tadasuke »

The so called "instructions per clock" uplifts (assuming all run on 4 GHz):

Zen + (2018)
Image

Zen 2 (2019)
Image

Zen 3 (2020)
Image

Zen 4 (2022)
Image

Zen 5 (2024)
Image

Calculating from official AMD slides over the years:
1.15 × 1.19 × 1.13 × 1.16 = 1.794x or 79.4% between Zen and Zen 5

However, as newer programs tend to facilitate using newer chip designs and over the years performance differences tend to increase, real "IPC" improvement is probably close to 2x or +100% between March 2017 and March 2025 (8 years).
User avatar
wjfox
Site Admin
Posts: 13580
Joined: Sat May 15, 2021 6:09 pm
Location: Essex, UK
Contact:

Re: GPU and CPU news and discussions

Post by wjfox »

Nvidia RTX 5000 specs are leaked, and PC gamers are already unhappy with the way next-gen GPUs look to be shaping up

By Darren Allan
published yesterday

Except for the RTX 5090 maybe, but that comes with its own (obvious) concern: pricing

Nvidia’s next-gen GeForce graphics cards remain a hot topic on the rumor mill, and a new post on X is attracting a lot of attention, having spilled some of the potential specs of these Blackwell GPUs.

This comes from regular leaker Kopite7kimi (via VideoCardz), who gives us the rumored – add seasoning – loadout for Streaming Multiprocessors (SMs, which directly reflect the all-important core count), as well as memory bus and type down the product range.

As you can see, the leaker states that GB202 – the top-end chip for Blackwell that’ll power the RTX 5090 – will purportedly offer a maximum count of 192 SMs.

As the leaker notes, that’s worked out using the sum of 12 x 8 – which is the Graphics Processing Clusters or GPCs times the Texture Processing Clusters or TPCs inside – which is 96, and then doubling (which assumes Nvidia sticks with 2 SMs for every TPC) to get the total of 192 SMs.

https://www.techradar.com/computing/gpu ... shaping-up


weatheriscool
Posts: 24486
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: GPU and CPU news and discussions

Post by weatheriscool »

Chinese Chipmaker SMIC Rises to Third Biggest Contract Fab Globally
The company has now joined Global Foundries at the third spot in global market share for contract foundries.
By Josh Norem June 14, 2024
China-based Semiconductor Manufacturing International Corporation (SMIC) has become the world's third-largest contract chip manufacturer, according to an industry research firm. SMIC's ascension has seen it join US-based GlobalFoundries in the competition for market share in the collection of firms that make chips for "fabless" companies such as AMD, Nvidia, Apple, and others.

The reordering of the global rankings of contract foundries such as SMIC comes from Counterpoint Research, which just released its industry report for the first quarter of 2024. In the most recent quarter, SMIC's global market share rose one percentage point from the previous quarter to 6%, allowing it to join GlobalFoundries and topple Taiwan's UMC, according to CNBC. Although SMIC moved up on this list during the most recent quarter, it is still far behind industry heavyweights TSMC and Samsung Foundry.
Foundry market share
https://www.extremetech.com/computing/c ... b-globally
Tadasuke

Re: GPU and CPU news and discussions

Post by Tadasuke »

Regarding Nvidia:
Image

Image

Regarding Intel:
Q3 2014 $4115 145W Haswell Xeon E5-2699v3 (18C|36T, 2.3 GHz) vs Q3 2013 $2614 130W Ivy Bridge Xeon E5-2697v2 (12C|24T, 2.7 GHz) comparison:

Image
weatheriscool
Posts: 24486
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: GPU and CPU news and discussions

Post by weatheriscool »

TSMC Starts 3nm Production for Upcoming Intel CPUs
Both Arrow Lake for desktop and Lunar Lake for mobile will be utilizing TSMC's 3nm process.
By Josh Norem June 19, 2024
Intel has already revealed its next-gen mobile architecture known as Lunar Lake, and in a first for Intel, its compute tile is made by TSMC on its 3nm process. Intel will also be tapping its biggest rival for its upcoming Arrow Lake desktop processors as well, and now a report from Taiwan states TSMC has begun production on the 3nm chips that will be used for both of Intel's upcoming platforms.

TSMC has been cranking out 3nm chips for a little over a year now, but its initial run was reserved for Apple's A17 Pro chip for the iPhone as well as its M3 and M4 processors. Now TSMC is turning its attention to Intel, which will be using its 3nm process for both Lunar Lake and Arrow Lake. As Techpowerup notes, although we know Lunar Lake's compute tile is made on TSMC's 3nm process, it's unclear at this time how Intel will be using these chips for Arrow Lake. It is a huge deal for Intel to turn to a competing foundry to make its CPUs, so it's unclear if Lunar Lake is just a one-off deal. Plus, Arrow Lake has been touted as the first Intel CPUs to use its cutting-edge Intel 20A process.
https://www.extremetech.com/computing/t ... intel-cpus
weatheriscool
Posts: 24486
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: GPU and CPU news and discussions

Post by weatheriscool »

Intel Is Using TSMC 4nm for Upcoming Battlemage GPUs: Report
The chances of Intel's next-gen GPUs being competitive are starting to increase.
By Josh Norem July 9, 2024
https://www.extremetech.com/gaming/inte ... pus-report
After many delays, Intel launched its first-generation Arc Alchemist GPUs in late 2022, and they were still somewhat behind the 8-ball even at launch. Intel launched its cards using the mature TSMC 6nm node, whereas rivals AMD and Nvidia were already one node ahead, using TSMC 5nm for Ada Lovelace and RDNA 3 GPUs. Intel appears to have learned a lesson for its follow-up architecture codenamed Battlemage. It will use TSMC 4nm, which is roughly the same node as Nvidia's next-gen graphics and likely the same as AMD's upcoming RDNA 4.

Word of AMD's plans comes from a paywalled article on DigiTimes and flagged by TechPowerUp. The article notes Intel has selected TSMC's 4nm node for Battlemage, which should provide a significant uplift from its previous generation GPUs while putting it in the ballpark of its biggest rivals. Intel will also move from deep ultraviolet lithography (DUV) on the 6nm node to extreme ultraviolet (EUV) at 4nm as part of the node upgrade. The move to 4nm could allow Intel to practically double the number of Xe cores in Battlemage due to the inherent advantage of the node switch. Intel is also expected to offer sizable gains in ray tracing and upscaling with Battlemage.
weatheriscool
Posts: 24486
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: GPU and CPU news and discussions

Post by weatheriscool »

Intel Arrow Lake Desktop CPUs to Feature Built-in NPU With 13 TOPS
Intel putting an NPU into its next-gen CPU is notable because AMD isn't doing the same.
By Josh Norem July 9, 2024
https://www.extremetech.com/computing/i ... th-13-tops
As you may have heard, the era of the "AI PC" is upon us (ugh). The first wave of these machines has already arrived in the form of Copilot+ PCs using Qualcomm CPUs, and now this phenomenon might also bleed into desktops. According to a new leak, Intel's upcoming Arrow Lake desktop CPUs will offer a neural processing unit (NPU) to improve its AI performance, which is notable for two reasons: First, Intel has never offered an NPU on a desktop chip before, and second, AMD has already announced four Zen 5 Ryzen 9000 CPUs, which lack NPUs.
Tadasuke

AMD Strix Point vs Hawk Point iGPU performance improvement

Post by Tadasuke »

Image
weatheriscool
Posts: 24486
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: GPU and CPU news and discussions

Post by weatheriscool »

AMD Announces Second-Generation Fluid Motion Frames Technology
The company said it offers more controls for power users and additional AI-optimized enhancements.
By Josh Norem July 29, 2024
AMD has announced that the second generation of its frame-generation technology is now available via a new preview version of its Adrenalin software. The company says the latest version contains many improvements and is a "significant upgrade" over the first version. The biggest changes include more fine-grain controls for those who want to tweak their settings, support for more gaming APIs, and options for folks with older hardware. AMD is releasing it to gamers today in technical preview form, so proceed with caution.

The company calls this technology AMD Fluid Motion Frames 2, so when abbreviated it's now AFMF2—a bit clunky, but we'll allow it. A lot of changes are in version two, with AMD adding AI to the mix, support for more APIs, more controls for those who prefer to tweak their settings, and better overall performance, according to AMD. When the new version is enabled for a game, the optimal settings will be automatically applied, but so-called advanced users can now fine-tune some settings. This is not how the previous version worked, where it was like Nvidia's frame generation in that it was a binary switch; either it was on or off.
https://www.extremetech.com/gaming/amd- ... technology
Tadasuke

AMD STRIX POINT

Post by Tadasuke »

AMD Strix Point mobile APU silicon die pictured and annotated:
Image

It measures 12.06 mm x 18.71 mm (length x width), compared to the 9.06 mm x 15.01 mm of the current Phoenix APU. The process has been improved from TSMC N4 on Phoenix and its derivative Hawk Point to the newer TSMC N4P node.

The CPU now has 12 cores, spread across two CCX, one of which contains four Zen 5 cores sharing a 16 MB L3 cache, and the other eight Zen 5C cores sharing an 8 MB L3 cache. The two CCXs connect to the rest of the chip over Infinity Fabric. The rather large iGPU takes up the central region of the die. It is based on the RDNA 3.5 graphics architecture and features 8 workgroup processors (WGP) or 16 compute units (CU) worth 1024 stream processors (SP). Other key components include four render back-ends worth 16 ROPs and control logic. The GPU has its own 2 MB of L2 cache, that cushions transfers to the Infinity Fabric.

The NPU is the third major logic component of Strix Point. This second generation NPU by AMD is visibly larger than the one found in Phoenix. It is based on the more advanced XDNA 2 architecture and contains 32 AI engine tiles, talking to its own high-speed local memory and a control logic that interfaces with Infinity Fabric. This NPU is designed to meet and exceed the hardware requirements of Microsoft Copilot+ (40 TOPS) and provides a throughput of 50 TOPS.
weatheriscool
Posts: 24486
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: GPU and CPU news and discussions

Post by weatheriscool »

Intel Lunar Lake CPUs Set to Launch in Copilot+ PCs on Sept. 3
Intel will finally join the 'AI PC' party with its next-generation mobile platform.
By Josh Norem July 31, 2024
Intel unveiled its next-gen Lunar Lake mobile architecture at Computex in May, but it didn't get into specifics about pricing and specs. It was also unclear exactly when it would launch other than "later this year." Intel has now announced that everything will be revealed on Sept. 3, as the company will formally launch the entire range of Lunar Lake CPUs at the IFA 2024 conference in Berlin, Germany.

Intel said company executives will be on hand to unveil the next generation of what it now calls Core Ultra mobile processors. Since these are laptop CPUs, the presentation will include the Lunar Lake CPU models Intel is launching and an array of Lunar Lake laptops. These notebooks will theoretically be the first x86 "AI PCs" because the existing models certified by Microsoft as Copilot+ are all powered by Qualcomm's Arm-based Snapdragon X CPUs. AMD's Zen 5-based Ryzen 9 AI mobile CPUs are also awaiting certification and will join Intel in the arena of x86-based AI PCs shortly.
https://www.extremetech.com/computing/i ... -on-sept-3
Tadasuke

brief Intel Granite Rapids server CPUs table

Post by Tadasuke »

With Granite Rapids CPUs, Intel will soon match 2023 AMD Bergamo server CPUs in corecount (128c256t). Bergamo has 250 MHz higher clocks (2.25 GHz) and 140 watts lower TDP (360 watts) than GR. However, Intel's Redwood Cove P cores might have a slight performance per clock advantage to Zen 4 in Bergamo, so their average performance measured across many different tasks might be very similar. Their other server CPUs called Xeon 6 Sierra Forest fabricated on Intel 3 process use up to 288 Crestmont E-cores, but the 144 core variants will come out first.

Image
weatheriscool
Posts: 24486
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: GPU and CPU news and discussions

Post by weatheriscool »

Samsung Unveils LPDDR5X Chips That Are Just 0.65mm Thick
It's slimmed its memory modules down to allow for better heat dissipation in mobile devices.
By Josh Norem August 6, 2024
AI has begun encroaching on every tech world nook and cranny. A lot of the time, it's used as a buzzword without any identifiable benefit—like having AI in your thermal paste, for example. However, Samsung has unveiled ludicrously thin LPDDR5X memory that could provide a tangible benefit to all phones, not just those running some kind of AI application.

Samsung stated it had shrunk its 12nm LPDDR5X chips in both 12GB and 16GB capacities by 9%, so they're just 0.65mm thick. The advantages of this development are easy to understand: Thinner memory in a mobile device allows for more room inside a tightly packed phone, which could allow for more airflow inside the device.
Samsung LPDDR5X
https://www.extremetech.com/computing/s ... 65mm-thick
Tadasuke

Ryzen 9600X compared to multiple other CPUs

Post by Tadasuke »

Image

Image

Image

Image

As you can see, in some use cases the 9600X with 6 cores can match the 7700X with 8 cores, but actual performance heavily depends on the given task. RPCS3 runs demanding games slightly faster than on 7600X. It also has significantly better energy efficiency and lower temperatures which means easier and cheaper cooling and lower energy bills.

And here one comparison for the 9700X:
Image
Tadasuke

Ryzen 9700X performance in AI

Post by Tadasuke »

Ryzen 9700X seems to be between 30 and 50 % faster in TensorFlow workloads than Ryzen 7700 while consuming 2.5 - 3 % less energy. This is a substantial improvement in two years for about the same price and wattage.

Image

Image
Tadasuke

Zen4 Threadripper PRO vs Xeon-w9 for science and engineering

Post by Tadasuke »

Intel Sapphire Rapids Xeon-w9 3495X (launched Q1 2023) scales better with cores and threads in High Performance Linpack (HPL) than the highest-end AMD Zen 4 Threadripper Pro CPU, but very slightly worse than the 64C/128T TR 7985WX. Looks like Amdahl's Law doesn't work the same way for CPUs from different companies and foundries and even for particular CPUs!

Image

link to Puget Systems source article

Image

In HPCG benchmark, the 56-core Xeon matches the 96-core and the 64-core Threadripper Pro (the non-Pro is slower).

Image

In HPL-MxP(AI) benchmark, Xeon is 124% faster than the 96-core Threadripper Pro. Not a small difference. 38.4 TFlops according to the chart.

Image

However, in OpenFOAM, TR 7995WX is as much as 2.65x faster than w9 3495X, which is a very significant advantage.

Intel Sapphire Rapids server and workstation CPUs are decent for some particular workloads which suit them well. Sometimes faster than Zen 4. Or Granite Rapids faster than Zen 5. Threadripper Pro CPUs tend to be better than the cheaper non-Pro versions, especially for AI.
weatheriscool
Posts: 24486
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: GPU and CPU news and discussions

Post by weatheriscool »

First Intel Arrow Lake Benchmarks: Mild Single-Core Gains, Multi-Core Regression
It's early hardware and a single benchmark, we know, but these results are starting to make people sweat a bit.
By Josh Norem August 16, 2024
Intel is preparing to launch its next-generation Arrow Lake desktop platform later this year, so the trickle of early benchmark results has begun. Given the company's current problems with Raptor Lake, hopes are high it can put all that nonsense behind it with Arrow Lake. It's on a new node with a new socket and an all-new, tile-based design. However, the first benchmark for Arrow Lake has been revealed, and it's close enough in performance to Raptor Lake to raise questions about Intel's upcoming platform.
https://www.extremetech.com/computing/f ... regression
weatheriscool
Posts: 24486
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: GPU and CPU news and discussions

Post by weatheriscool »

Samsung to Adopt High-NA Lithography Alongside Intel, Ahead of TSMC
The company will reportedly begin installing the advanced lithography tools later this year, or in early 2025.
By Josh Norem August 19, 2024
https://www.extremetech.com/computing/s ... ad-of-tsmc
High Numerical Aperture (NA) lithography machines just became available earlier this year. Intel was the first global foundry to purchase one in an attempt to one-up its rivals in the race to ever-shrinking nodes. Now, Samsung foundry has reportedly jumped on board the High-NA train; a new report states the company is joining Intel in the High-NA arms race. With Samsung now onboard, TSMC is the only major foundry still holding off on adopting the new technology.

Word of Samsung's plans comes from a site called Seoul Economic Daily, which includes a photo of Samsung executives alongside a High-NA lithography machine at ASML's headquarters in the Netherlands. It was previously reported that Intel had purchased all of ASML's High-NA machines for 2024, but perhaps that is no longer the case. The Korean reports say ASML made eight machines, and Intel purchased seven, leaving one for Samsung to put in its cart.
Tadasuke

8-core Zen 5 vs 8-core Zen 2 (5 years of AMD CPUs)

Post by Tadasuke »

If we look at AMD Ryzen 9700X max overclock at 6800 MHz, then we can see it achieves almost 61.4% higher multi-core score in Cinebench R15 benchmark (4781 cb) than AMD Ryzen 3700X max overclock at 5600 MHz (2963 cb). Both of these records used liquid nitrogen as the cooling agent, and the Zen 5 processor can operate at 1200 MHz higher maximum all-core all-thread clock frequency than the Zen 2 one.

My guess is, that Ryzen 7 9700X record will be beaten by 50-125 MHz in the coming months or years. Ryzen 7 3700X record might be beaten by 20-50 MHz. The MC score might rise to 4900 cb and 3000 cb, with Zen 5 performance advantage over Zen 2 [in that benchmark] rising to 63.33%. :-)
weatheriscool
Posts: 24486
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: GPU and CPU news and discussions

Post by weatheriscool »

Intel Launches Lunar Lake Core Ultra 200V CPUs for AI PCs
It's the first Intel CPU made with a TSMC node and the first to include embedded memory.
By Josh Norem September 3, 2024
Today, Intel is officially launching its Lunar Lake architecture at the Innovation For All (IFA) conference in Berlin. These CPUs announce the company's arrival to the AI PC market thanks to the CPUs' embedded NPU. They also mark a radical shift in design philosophy, with many notable firsts for the company. They're the first Intel CPUs with a compute tile made on a TSMC node, the first to include embedded memory like Apple's M-series chips, the first with its all-new P-core and E-core design, and the first CPU to feature its second-generation Battlemage GPU architecture. In other words, it's practically an entirely new CPU from the ground up.
https://www.extremetech.com/computing/i ... for-ai-pcs
Post Reply