What is “Zen4” (Ryzen 7000)?
AMD’s Zen4 (“Raphael”) is the 4rd generation ZEN core – aka the new 7000-series of CPUs from AMD – that brings brand new features like AVX512 ISA (instruction set support), DDR5 and PCIe5. These do require a brand new platform (AM5) almost a decade since the current AM4 platform was launched before even the 1st generation Ryzen. With any luck, it will remain for the next 4 or even more CPU generations, unlike the 2 generation support on competitor (Intel) platform.
Zen4 contains only big/P(erformance) cores and it is not a hybrid design. It remains to be seen if AMD will launch such hybrid (big/LITTLE) products that, in our opinion, are too problematic on desktop platforms for the benefits they bring. Even on mobile platforms where efficiency is a top priority – workloads do not easily lend to a hybrid design despite huge work done on the Windows scheduler for Windows 11. In this regard, a non-hybrid design like Zen4 is very much preferred.
AVX512 is a huge boost for compute performance as we’ve seen on Intel since SKL-X (Skylake-X). There is a reason it exists + all the extensions (IFMA, VNNI, VAES, etc.) and it is not unexpected that even basic usage can bring up to 100% (2x) performance improvement and even higher with specific instructions. While originally CPUs would reduce clocks due to the power generated – this has pretty much been mitigated in modern designs. Even Centaur (before Intel bought them) had AVX512-enabled (LITTLE) cores.
While here AMD has implemented it as 2x 256-bit ops (similar to previous AVX2/FMA3 in Zen1/1+/2 implemented as 2x 128-bit) – we still benefit from 2x more registers + 2x wider registers (4x overall), arguably better instruction specification, optimised extensions (IFMA, VNNI, VAES, etc.) that overall can still build up to a big improvement over old AVX2/FMA3.
- 5nm process (TSMC) for CCX (vs. 7nm on Zen3) for better efficiency and clocks
- 6nm process (TSMC) for I/O hub (vs. 12nm for Zen3) for better memory speeds
- claimed 13% IPC increase vs. Zen3 + clock increase uplift => ~29% total uplift vs. Zen 3
- AVX512 instruction support, with potential 100%+ improvement in optimised workloads
- Executed as 2x 256-bit (not true 512-bit like Intel) but still many benefits over AVX2/FMA3
- Specific AVX512 extensions (IFMA, VNNI, VAES, etc.) can bring well over 100% improvement
- DDR5 support up to 5200Mt/s (official) for much higher memory bandwidth vs. DDR4 Zen3
- Unofficial support for at least 6400Mt/s with XMP3/EXPO profiles
- AMD says 6000Mt/s is the “sweet-spot” for performance/value
- 1MB L2 per core (2x vs. 512kB on Zen3)
- Standard L3 is the same 32MB, V-Cache the same 96MB
- PCIe5 support, up to 24 lanes (2x bandwidth vs. PCIe4)
- Still up to 2 chiplets (at launch) thus up to 2x 8C big/P cores (16C/32T on 7950X)
- Much higher both base and turbo speeds in most variants, e.g. 7950X
- Higher base 4.5GHz of standard CCX (vs. 3.4GHz on 5950X +32% clock uplift)
- Higher base 4.2GHz of V-Cache CCX (vs. 3.4GHz on 5950X +24% clock uplift)
- Higher turbo 5.7GHz (vs. 4.9GHz on 5950X +17% clock uplift)
- TDP has increased to 120W (vs. 105W on 5950X) thus 14% higher
- Turbo (PPT aka PL2) around 160W (vs. 142W on 5950X) thus 14% higher
- Note that other models (e.g. 7700X) have kept the same TDP/Turbo
- Built-in Radeon Graphics (RDNA2) core
- 2CU / 128SP 400-2.2GHz cores for very basic graphics
What is the new Zen4-3D V-Cache (Ryzen 7000-3D)?
It is a version of Zen4+ chiplet/CCX with vertically stacked (thus the 3D(imensions) moniker) L3 cache that is 3x larger (thus 96MB). The latency is expected to be slightly higher (+4 clock) and bandwidth also slightly lower (~10% less).
Originally, AMD launched the asymmetric/hybrid (VCache CCX + Standard CCX) dual CCX processors (7950X-3D, 7900X-3D) – likely to benefit from early adopters. Now we finally have the cheaper, single-VCache CCX version (7800X-3D).
Similar to Zen3-3D – the clocks (Base) of the cores on the V-Cache CCX (5.25GHz) are lower than the standard CCX (5.7GHz).
To upgrade from standard Zen4 or not?
Except the new L3 3D/V-Cache cache, there are no other major changes:
- Minor stepping update (S2 vs. S0) with no major fixes
- Base and Turbo clocks of standard CCX are the same as original Zen4 (e.g. 7950X)
- Base clocks of V-Cache CCX are lower than original Zen4, thus raw compute power is lower
- AMD provided Windows driver to migrate threads to the “proper” CCX while parking other CCX
- Games scheduled on V-Cache/slow CCX
- Normal workloads scheduled on standard/fast CCX
- This assumes the workload uses 16-threads or less
It all depends on the data set(s) of the workload(s) you are running:
- Data sets that either entirely fit or can be significantly served in the 96MB L3 cache – will see significant uplift
- Inter-core/thread data transfers that can entirely fit in the 3D L3 cache – will see significant uplift
- Streaming workloads or with very large data sets may not show uplift but be slower due to lower base/turbo clocks
- Compute heavy algorithms with small data sets will be slower due to lower base/turbo clocks
Review
In this article we test CPU core performance; please see our other articles on:
- AMD Ryzen 9 7950X (Zen4 Raphael) Review & Benchmarks – AVX512 Top-End Domination
- AMD Ryzen 7 7700X (Zen4 Raphael) Review & Benchmarks – AVX512 Mainstream Performance
- AMD Ryzen 5 7600X (Zen4 Raphael) Review & Benchmarks – Value AVX512 Performance
Hardware Specifications
We are comparing the top-range Ryzen 9 7000-series (Zen4 3D) with standard Ryzen 9 and competing architectures with a view to upgrading to a top-range, high performance design.
CPU Specifications | AMD Ryzen 7 7800X-3D 8C/16T (Raphael-3D) |
AMD Ryzen 7 5800X-3D 8C/16T (Vermeer-3D) | AMD Ryzen 7 7800X 8C/16T (Raphael) | Intel Core i7 13700K 8C+8c/24T (Raptor Lake) | Comments | |
Cores (CU) / Threads (SP) | 8C / 16T | 8C / 16T | 8C / 16T | 8C+8c / 24T | Core counts remain the same. | |
Topology | 3D/CCX 8C + I/O hub | 3D/CCX 8C + I/O hub | CCX 8C + I/O hub | Monolithic die | Same topology but asymmetric | |
Speed (Min / Max / Turbo) (GHz) |
4.2 [+23%] – 5.0GHz [+11%] | 3.4 – 4.5GHz | 4.5 – 5.7GHz | 3.4 – 5.4GHz / 2.5 – 4.2GHz | Base up 23%, turbo 11% | |
Power (TDP / Turbo) (W) |
120 – 253W [+14%] | 105 – 135W | 105 – 142W | 125 – 253W | TDP is 14% higher | |
L1D / L1I Caches (kB) |
8x 32kB 8-way / 8x 32kB 8-way | 8x 32kB 8-way / 8x 32kB 8-way | 8x 32kB 8-way / 8x 32kB 8-way | 8x 64kB + 8x 32kB / 8x 32kB + 8x 48kB | No changes to L1 | |
L2 Caches (MB) |
8x 1MB (8MB) 8-way | 8x 512kB (4MB) 8-way | 8x 1MB (8MB) 8-way | 8x 2MB + 2x 4MB [24MB] | L2 is 2x larger | |
L3 Caches (MB) |
96MB 16-way exclusive |
96MB 16-way exclusive | 32MB 16-way exclusive | 20MB 16-way | L3 is the same | |
Mitigations for Vulnerabilities | BTI/”Spectre”, SSB/”Spectre v4″ hardware | BTI/”Spectre”, SSB/”Spectre v4″ hardware | BTI/”Spectre”, SSB/”Spectre v4″ hardware | BTI/”Spectre”, SSB/”Spectre v4″ hardware | No new fixes required… yet! | |
Microcode (MU) |
A60F12-1203 | A20F12-1205 | A60F12-03 | 0B0671-10E | The latest microcodes have been loaded. | |
SIMD Units | 2x 256-bit (512-bit total) AVX512+ | 256-bit AVX/FMA3/AVX2 | 2x 256-bit (512-bit total) AVX512+ | 256-bit AVX/FMA3/AVX2 | Same SIMD widths | |
Price/RRP (USD) |
$449 |
$449 | $399 |
$419 | Same price as old 3D |
Disclaimer
This is an independent review (critical appraisal) that has not been endorsed nor sponsored by any entity (e.g. AMD, etc.). All trademarks acknowledged and used for identification only under fair use.
Native Performance
We are testing native arithmetic, SIMD and cryptography performance using the highest performing instruction sets. Zen4 supports all modern instruction sets including AVX2/FMA3 and crypto SHA HWA but also AVX-512 and extensions (IFMA, VNNI, VAES, etc.)
Results Interpretation: Higher values (GOPS, MB/s, etc.) mean better performance.
Environment: Windows 11 x64 (21H2), latest AMD and Intel drivers. 2MB “large pages” were enabled and in use. Turbo / Boost was enabled on all configurations. All mitigations for vulnerabilities (Meltdown, Spectre, L1TF, MDS, etc.) were enabled as per Windows default where applicable.
SiSoftware Official Ranker Scores
- AMD Ryzen 9 7950X-3D 16-Core/32-Thread
- AMD Ryzen 9 7900X-3D 12-Core/24-Thread
- AMD Ryzen 7 5800X-3D 8-Core/16-Thread
Final Thoughts / Conclusions
Summary: The 8-Core King Returns: 9/10
Even with the original 3D V-Cache Zen3 (5800X-3D) – the biggest issue was that the standard Zen3 was too good/performant and the huge L3 cache only made a difference in some workloads (notably games!). The standard 32MB L3 CCX cache is already large enough and fast enough especially considering the competition (Intel). Still, the 3D model had 3x (three times) larger L3 that can be a big asset.
Unlike the multi-CCX designs with asymmetric/hybrid L3 cache of different sizes – the 7800X-3D brings back “normality” with a single, unified 3D-VCache. No need for special drivers for games and other applications to schedule threads on the “right” CCX for best performance and turn off other cores/CCX for best power efficiency.
Due to much higher bandwidth on AM5/DDR5 platform (e.g. standard DDR5-6500 memory) vs. old AM4/DDR4 (e.g. common DDR4-3200 memory) – Zen4-3D takes less of a hit going to main memory than Zen3-3D though L3 bandwidth is still 10x higher than DDR5.
In the end – it all depends on your workloads: if you game regularly and thus want a 3D/V-Cache Zen4 but also regularly need more cores/threads for other tasks than a (future) 7800X-3D can provide, then these 7950X-3D/7900X-3D could work for you.
Otherwise you’re better off with the standard Zen4 (7700X), or if you are still on AM4 platform, the Zen3-3D (5800X-3D) has come down in price and is still very much competitive.
Please see the other reviews on other Ryzen variants:
- AMD Ryzen 9 7950X (Zen4 Raphael) Review & Benchmarks – AVX512 Top-End Domination
- AMD Ryzen 7 7700X (Zen4 Raphael) Review & Benchmarks – AVX512 Mainstream Performance
- AMD Ryzen 5 7600X (Zen4 Raphael) Review & Benchmarks – Value AVX512 Performance
Disclaimer
This is an independent review (critical appraisal) that has not been endorsed nor sponsored by any entity (e.g. AMD, etc.). All trademarks acknowledged and used for identification only under fair use.