What is “Navi2”?
It is the code-name of the new AMD GPU, the 2nd generation RDNA (Radeon DNA) GPU arch(itecture) that itself replaced “Vega” / last of the GCN arch(itecture). Unlike the original Navi that was a mid-range GPU – this is the very much expected “big Navi” top-end GPU designed to battle nVidia’s finest 3000-series GPUs.
Navi/RDNA arch brought big changes from Vega/GCN and Navi2 has been enhanced and optimised from Navi1 adding more features:
- Ray-Tracing (RT) Cores – similar to nVidia’s Turing/Ampere cards
- Infinity Cache – 128MB
- Smart-Access Memory – PCIe BAR re-sizeing
- 6800 has Navi21 XL mid-range chip with 2/3 CUs enabled (60 out of 80)
Unlike Vega1/2 GPUs that perhaps were very much compute focused – Navi1/2 seem to be more gaming focused with the few compute features already introduced: reduced workgroup size matching nVidia (32), increased work-group sizes (1024). It is likely that AMD will launch HBM2 professional cards and hopefully Navi versions with tensor units (TSX) or matrix multiplicators (tuMMA).
See these other articles on GP-GPU performance:
- ExtremeTech
- SiSoftware
Hardware Specifications
We are comparing the mid-range Radeon with previous generation cards and competing architectures with a view to upgrading to a mid-range high performance design. We have included the top-end previous generation cards that may be cheaper to obtain today.
GP-GPU Specifications | AMD Radeon RX 6800 (Navi2L) | AMD Radeon 5700XT (Navi1) | nVidia 3070 (Ampere) | nVidia 2080TI (Turing) | Comments | |
Arch / Chipset | RDNA2 / Navi 21 XL | RDNA1 / Navi 10 | Ampere / GA104 / SM8.6 | Turing / GT102 / SM7.5 | The 2nd of the Navi cores | |
Cores (CU) / Threads (SP) | 60 / 3840 [+50%] |
40 / 2,560 | 46 / 5,888 | 68 / 4,352 | The XL version has 50% more cores. | |
Wave/Warp Size | 32 | 32 | 32 | 32 | Wave size now matches nVidia. | |
Speed (Min-Turbo) (GHz) |
1.7 (1.815) |
1.6 (1.755) | 1.5 (.725) | 1.35 (1.635) | 40% faster base and 20% turbo than Vega1. | |
Power (TDP) | 250W [+11%] | 225W | 220W | 260W | Power has only increased by 33% | |
ROP / TMU | 96 / 240 [+50%] |
64 / 160 | 96 / 184 | 88 / 272 | ROPs are the same but we see ~30% less TMUs. | |
Ray-Tracing (RT) |
60 | none | 82 | 68 | Navi2 brings 60 RT cores like nVidia. | |
Shared Memory (kB) |
64kB | 64kB | 48kB / 96kB per SM | 48kB / 96kB per SM | No change in shared memory. | |
Constant Memory (GB) |
8GB [+2x] | 4GB | 64kB dedicated | 64kB dedicated | No dedicated constant memory but large. | |
Global Memory (GB) |
16GB GDDR6 16Gbps 256-bit | 8GB GDDR6 14Gbps 256-bit | 8GB GDDR6 14Gbps 256-bit | 11GB GDDR6 14Gbps 320-bit | No HBM at this level. | |
Memory Bandwidth (GB/s) |
512GB/s [+14%] | 448GB/s | 448GB/s | 616GB/s | Still bandwidth is 9% higher. | |
L1 Caches (kB) |
32kB / WG + 128kB/Array | 64kB/Array | 46x 128kB/SM | 68x 96kB/SM | L1 has been doubled (2x) | |
L2 Cache (MB) |
4MB | 4MB | 4MB | 5.5MB | L2 has not changed. | |
Maximum Work-group Size |
1024 / 1024 | 1024 / 1024 | 1024 / 2048 per SM | 1024 / 2048 per SM | AMD has unlocked work-group sizes to 4x. | |
FP64/double ratio |
1/16x | 1/16x | 1/32x | 1/32x | Ratio is 2x nVidia. | |
FP16/half ratio |
2x | 2x | 2x | 2x | Ratios are the same throughput. | |
Price/RRP (USD) |
579 [+29%] | 450 | 499 | 1,000 | Price is 30% higher than Navi1. |
Disclaimer
This is an independent article that has not been endorsed nor sponsored by any entity (e.g. AMD). All trademarks acknowledged and used for identification only under fair use.
The article contains only public information (available elsewhere on the Internet) and not provided under NDA nor embargoed. At publication time, the products have not been directly tested by SiSoftware and thus the accuracy of the benchmark scores cannot be verified; however, they appear consistent and do not appear to be false/fake.
Processing Performance
We are testing both OpenCL performance using the latest SDK / libraries / drivers from both AMD and competition.
Results Interpretation: Higher values (GOPS, MB/s, etc.) mean better performance.
Environment: Windows 10 x64, latest AMD and nVidia drivers. Turbo / Boost was enabled on all configurations.
Memory Performance
We are testing both OpenCL performance using the latest SDK / libraries / drivers from AMD and competition.
Results Interpretation: For bandwidth tests (MB/s, etc.) high values mean better performance, for latency tests (ns, etc.) low values mean better performance.
Environment: Windows 10 x64, latest AMD and nVidia. drivers. Turbo / Boost was enabled on all configurations.
SiSoftware Official Ranker Scores
Final Thoughts / Conclusions
Summary: Much, much faster than old Navi1 and stellar FP64 performance: 9.5/10
Ever since the release of Navi1 (“little-Navi”) and its revolutionary new architecture AMD fans everywhere have eagerly awaited the release of “big-Navi” that can bring the fight to nVidia. A year or so later – we have Navi2: pretty much twice Navi1 + ray-tracing – tensors/tiles. It is everything that was expected but in some ways perhaps we expected more?
With 50% more CU/SP and faster speed, the little brother of “big-Navi” Navi2L chip in 6800 does not disappoint: in compute FP16/32 tasks it is at least 60% faster and in many cases much, much faster. Unlike its bigger brother – it has no problem keeping up with nVidia’s latest competition (Ampere 3070).
Compute FP64 performance is much better thanks to lower FP32/64 ratio (1/16x vs. nVidia 1/32x) that allows it to really put the boot into nVidia – and if that’s the kind of algorithms you run – Navi2 is your choice.
Memory-wise, it seems the meager bandwidth improvement over Navi1is OK for Navi2L which seems to have sufficient resources; it does not seem OK for Navi2XL as we saw in the 6900XT review. Having 2x more memory (16GB) allows much bigger kernels to run – and again shows that 6900XT should have had more.
The price (USD 579) is a bit higher than the old 5700XT but perhaps a sign of the times. Power/TDP is just 11% higher (250W) which shows how much Navi2 has improved efficiency over Navi1.
In summary – Navi2L in mid-range performs much better (compute wise) for the money and also against its competition (3070). We hear that it does even better in games! If you have Navi1, it is time to upgrade. AMD has done good!
To see how its “big-brother” Navi2X performs, please see our AMD Radeon RX 6900XT (RDNA2, Navi2X) Review & Benchmarks – GPGPU Performance article.
Disclaimer
This is an independent article that has not been endorsed nor sponsored by any entity (e.g. AMD). All trademarks acknowledged and used for identification only under fair use.
The article contains only public information (available elsewhere on the Internet) and not provided under NDA nor embargoed. At publication time, the products have not been directly tested by SiSoftware and thus the accuracy of the benchmark scores cannot be verified; however, they appear consistent and do not appear to be false/fake.