What is “Alchemist”?
Is is the 2nd generation Intel discrete (DG2) graphics – based on the Xe arch that originally powered the integrated graphics of mobile (Xe-LP) processors from Gen 10 (“TigerLake” TGL) onwards. Intel did launch a limited desktop DG1 product as well as a limited server/workstation Xe-HP compute version.
With DG2 – Intel is back with a full stack, from low-end, middle and high-end – as well as workstation/server (data as per Tom’s Hardware : Intel Arc Alchemist: Release Date, Specs, Everything We Know):
- Up to 512CU / 4,096 SP
- A300 series 128CU / 1,024SP
- A500 series 384CU / 3,072SP
- A700 series 512CU / 4,096SP
- Up to 16GB GDDR6
- A300? series 6GB GDDR6 96-bit 14Gbit/s ~192GB/s
- A500? series 12GB GDDR6 192-bit 16Gbit/s ~384GB/s
- A700? series 16GB GDDR6 256-bit 16Gbit/s ~512GB/s
- 7nm TSMC process
- OpenCL 3.0, DirectX 12 Ultimate support
- FP16 half-floating point support
- NO FP64 double floating-point support (!)
- Tensors (“Matrix Engines” aka XMX) for Deep Learning
- FP16 and Int8 support (but no FP32/FP64 support)
- OpenCL extension – will need to be coded specifically to use it
The specification look good – we have competitive specs, although we seem to be missing native FP64 support, just as on the mobile/desktop previous Xe. We do gain tensors (matrix multipliers) and support for FP16/Int8 that should help low-precision neural networks. For high-precision workloads (FP64), it seems AMD is your only option – unless paying nVidia for professional graphics…
ARC “Alchemist” A380 GP-GPU Performance Benchmarking
In this article we test GP-GPU core performance; please see our other articles on:
- GP-GPU
Hardware Specifications
We are comparing the “entry-level” range A380 GP-GPU with the competition with a view to upgrading an inexpensive system.
Specifications | Intel Arc A380 (DG2) |
nVidia GeForce GTX 1660 Ti | nVidia GeForce RTX 3050 | AMD Radeon RX 6500 XT | Comments | |
Arch / Chipset | Xe2 (EV12+) | TU116 (Turing) | GA106 (Ampere) | Navi II 24 XT | Gen 12+ graphics – the latest. | |
Cores (CU) / Threads (SP) | 128 / 1,024 | 12 / 1,536 | 20 / 1,280 | 16 / 1,024 | Similar SPs count as competition | |
Tensor (TU) Cores | 16 | – | 20 | – | Now with extra tensors on top! | |
Speed (Min-Turbo) |
~2.45GHz |
1.5-1.78GHz | 1.55-1.78GHz | 2.3-2.6GHz | Turbo speed is pretty high. | |
Power (TDP) | ~75W | ~120W | ~130W | ~107W | TDP is lowest… on paper at least | |
ROP / TMU | 32 / 64 | 48 / 96 | 32 / 80 | 32 / 64 | ROPs and TMUs also increased. | |
Shared Memory |
64kB |
48kB | 96kB | 64kB | Standard shared memory size. | |
Constant Memory |
2.4GB | 64kB | 64kB | 2GB | Using global as const sadly like AMD. | |
Global Memory | 6GB GDDR6 96-bit |
6GB GDDR6 192-bit | 8GB GDDR6 128-bit | 4GB GDDR6 64-bit | Quite narrow bus but decent memory | |
Memory Bandwidth |
~192GB/s | ~288GB/s | ~224GB/s | ~144GB/s | Somewhat low bandwidth | |
L1 Caches | 64kB | 64kB | 128kB | 128kB | L1 could be larger | |
L2 / L3 Cache | 1MB | 1.5MB | 2MB | 1MB / 16MB | L2 seems quite small | |
Maximum Work-group Size |
1,024 x 1,024 | 1,024 x 1,024 | 1,024 x 1,024 | 1,024 x 1,024 | Same workgroup size | |
FP64/double ratio |
No! | Yes 1/32x | Yes 1/64x | Yes 1/16x | No FP64 support? | |
FP16/half ratio |
Yes, 2x | Yes, 2x | Yes, 2x | Yes, 2x | Same 2x ratio | |
Price / RRP (USD) |
~$199? |
~$280 | ~$249 | ~$199 | Keen price |
Disclaimer
This is an independent review (critical appraisal) that has not been endorsed nor sponsored by any entity (e.g. Intel, etc.). All trademarks acknowledged and used for identification only under fair use.
The review contains only public information and not provided under NDA nor embargoed. At publication time, the products have not been directly tested by SiSoftware but submitted to the public Benchmark Ranker; thus the accuracy of the benchmark scores cannot be verified, however, they appear consistent and pass current validation checks.
And please, don’t forget small ISVs like ourselves in these very challenging times. Please buy a copy of Sandra if you find our software useful. Your custom means everything to us!
Native OpenCL Performance
We are testing both OpenCL performance using the latest SDK / libraries / drivers from both Intel and competition.
Results Interpretation: Higher values (GOPS, MB/s, etc.) mean better performance.
Environment: Windows 10 x64, latest Intel graphics drivers. Turbo / Boost was enabled on all configurations.
SiSoftware Official Ranker Scores
- Intel Arc (Alchemist) A380 Graphics
- Intel Arc “Alchemist” A370M Mobile Graphics + Intel Iris Xe Internal Graphics
Final Thoughts / Conclusions
Summary: OK for the price, nothing special: 7/10
Intel is back doing discrete graphics – yay! There are more options at the low end – yay! No need to pair discrete AMD or nVidia GP-GPU with Intel CPU/Mainboard – err… not so fast…
nVidia has nothing to worry about – here we’ve only tested the low-end – and despite being more expensive it is definitely worth it. CUDA performance is the most consistent and even the “old” Turing has no problem dispatching both Intel ARC and AMD “Navi II” competition. Again, OpenCL support still leaves a lot to be desired (both Intel and AMD) and DirectX is really for games, not optimised for compute. Still, the A380 is competitive and should improve with drivers and optimisations – should Intel decide to support ISVs like ourselves – but your 2nd choice should be AMD as Navi has been around much longer and the drivers are mature.
As with integrated graphics versions, we lack FP64 support though at this (entry) level it does not matter – though for the top-end (Xe-HP) lack of it would be quite disastrous. Fortunately, nVidia has hobbled its consumer cards’ FP64 at 1/64x rate which makes it pretty much unusable… AMD is still your choice here if you don’t have the money (who does these days?) and need FP64 native support.
Tensor (matrix multiplier) support should improve performance (GEMM, CNN/RNN neural-networks, image processing) – but is limited to FP16/Int8 and won’t help standard FP32 performance. We will likely have to wait for updated tensors in future versions – just as nVidia has done with “Ampere”.
Perhaps it was too much to expect a nVidia/AMD killer – but all in all it is a decent effort. YMMV (Your Mileage May Vary).
Summary: OK for the price, nothing special: 7/10
Please see our other articles on:
- GP-GPU
Disclaimer
This is an independent review (critical appraisal) that has not been endorsed nor sponsored by any entity (e.g. Intel, etc.). All trademarks acknowledged and used for identification only under fair use.
The review contains only public information and not provided under NDA nor embargoed. At publication time, the products have not been directly tested by SiSoftware but submitted to the public Benchmark Ranker; thus the accuracy of the benchmark scores cannot be verified, however, they appear consistent and pass current validation checks.
And please, don’t forget small ISVs like ourselves in these very challenging times. Please buy a copy of Sandra if you find our software useful. Your custom means everything to us!