Who is “Zhaoxin”?
It is a joint venture of VIA Technologies that now effectively owns Centaur Technologies. Centaur/IDT used to make the “WinChip” – one of the few x86 compatible designs. After VIA aquisition, Centaur/VIA then produced the C3, C7 and finally Nano. After Zhaoxin formation, the designs continued with minor modifications of the “Nano” core with the ZX family of CPUs – however Zhaoxin now has brought out new designs:
- ZX-C “Zhangiang” (2016): 4-8C 2GHz 28nm ~45W
- KX-5000 “Wudaokou” (2017): 4-8C 2GHz 28nm ~45W
- KX-6000 “Lujiazui” (2019): 4-8C 3GHz 16nm ~70W
- KX-7000 due to launch this year (2021)
What is “ZHAOXIN KaiXian”?
It is 5th generation CPU/SoC released by Zhaoxin/Centaur released in 2019 that includes modern new features and functionality common in competitive designs e.g. AMD/Intel. Designed for the internal Chinese market, it reduces dependencies on foreign designs and also includes custom local features, e.g. Chinese National Standard cryptographic functions (SM3/hash, SM4/block ciphers, etc.). Its relatively simplicity and cost are also advantages against much more expensive imported western CPUs from AMD/Intel.
The latest KX-6000 arch is made on a relatively modern process and includes many modern features:
- 5th Gen micro-arch (derived of VIA/Centaur Nano) 16nm FinFET
- AVX and SSE 4.2/4.1, SSSE3, SSE2 instruction sets
- AES, SHA + SM3/SM4 HWA (hardware acceleration)
- 8C/8T “little” core design (no SMT) up to 3GHz (no turbo)
- 32kB L1D/L1I 8-way set caches (similar to competition)
- 8MB L2 16-way set cache (similar to L3 cache in competition)
- PCIe 3.0
- 2x channel DDR4 memory controller up to 2667MHzm 64MB total memory
- Virtualization support (VT-d)
- TDP / Power ~70W
- Price inc. motherboard ~$500 (internal China price)
Unfortunately for Zhaoxin, while 8-cores (even little ones) was quite something in 2016, with Intel keeping all desktop-class CPUs at 4C/8T (with Skylake and friends) – the recent AMD/Intel “war” has seen the number of CPU cores more than double – with Intel at 10C/20T and AMD at an incredible 16C/32T! Intel Atom “Denverton” (DVT) even goes as high as 16-cores (C3858 12C, C3958 16C) thus more cores will be needed in the future to stay competitive.
At 16nm the TDP for those 8 “little” cores is a somewhat large 70W that will need to reduce significantly if number of cores doubles or clock speed increases. Current designs have no dynamic over-clocking functionality (Turbo) which means they cannot increase speed over rated (3GHz top) that while decent – will also need to increase significantly for it to remain competitive.
AES & SHA hardware-acceleration makes it a good fit for a network appliance (though TDP is somewhat large) and the large number of cores allows a greater number of (network card) queues to be used (RSS). Virtualization support means it could serve as high-number of VM/low-compute/utilisation host similar to Intel’s Atom “Denverton” family.
On the instruction front, while we do have AVX (i.e. 4th Gen Core “Haswell”-class), we are missing further AVX2, FMA3 updates that have become pretty widely used since then. The next core (KX-7000) is meant to support even AVX512 which is not impossible considering Intel’s own modified Atom cores of “Phi” (Larabee) accelerators have done so. However, lets remember Intel has canned the project so it has not exactly been successful. However, Zhaoxin has different priorities and a 32-core/AVX512 KX-7000 may be a winner.
CPU (Core) Performance Benchmarking
In this article we test CPU core performance; please see our other articles on:
- CPU
Hardware Specifications
We are comparing the KaiXian with somewhat older competing architectures with a view of whether it has caught up in performance.
CPU Specifications | ZHAOXIN KaiXian KX-U6780A (8C/8T) | Intel Core i5-7500 (KBL) 4C/4T | AMD Ryzen 3 PRO 1300 (Zen) 4C/4T | Intel Atom C3758 (DVT) 8C/8T | Comments | |
Cores (CU) / Threads (SP) | 8C / 8T | 4C/4T | 4C/4T | 8C/8T | More cores for your money | |
Speed Rated/Turbo (GHz) |
2.7GHz | 3.4-3.8GHz | 3.5-3.7GHz | 2.2GHz | No turbo, clock could be higher | |
Power TDP/Turbo (W) |
70W | 65W | 65W | 25W | TDP somewhat high. | |
L1D / L1I Caches | 8x 32kB / 8x 32kB | 4x 32kB / 4x 32kB | 4x 32kB / 4x 64kB | 8x 32kB / 8x 24kB | L1 caches are competitive | |
L2 Caches | 8MB 16-way | 4x 256kB | 4x 512kB | 16MB 16-way | L2 is good size but could be larger. | |
L3 Caches | n/a | 6MB | 8MB | n/a | No L3 like Atom | |
Microcode (Firmware) | MU069E09-B4 | MU8F0101-38 | No indication of microcode | |||
Special Instruction Sets |
AVX?, SSE4/3/2 | AVX2/FMA, SHA | AVX2/FMA | SSE4/3/2 | Meant to support AVX | |
SIMD Width / Units |
128-bit | 256-bit | 256-bit | 128-bit | Widest SIMD units ever |
Disclaimer
This is an independent article that has not been endorsed or sponsored by any entity (e.g. Zhaoxin/Centaur). All trademarks acknowledged and used for identification only under fair use.
The article contains only public information (available elsewhere on the Internet) and not provided under NDA nor embargoed. At publication time, the products have not been directly tested by SiSoftware and thus the accuracy of the benchmark scores cannot be verified; however, they appear consistent and do not appear to be false/fake.
Native Performance
We are testing native arithmetic, SIMD and cryptography performance using the highest performing instruction sets. While the KaiXian does not support the very latest instruction sets (e.g. AVX512, VAES, AVX2/FMA) it does support older instruction sets (AVX, SSE4, SSSE3, SSE2).
Results Interpretation: Higher values (GOPS, MB/s, etc.) mean better performance.
Environment: Windows 10 x64, latest drivers. 2MB “large pages” were enabled and in use. Turbo / Boost was enabled on all configurations.
SiSoftware Official Ranker Scores
Final Thoughts / Conclusions
Summary: An interesting SoC, hoping for better: 7/10
Centaur/VIA will always have a special place for us (SiSoftware) as we have been working with them for almost 20 years (!) Despite their size they added their own mark on the x86 world (e.g. crypto acceleration with PadLock, etc.) and it is great to see Zhaoxin continuing the proud tradition. Here’s to the next 20 years!
The soon to be replaced KX-6000 series is an interesting SoC that – at launch (2017) – was likely the only real 8-core CPU you could get at non-ridiculous prices (e.g. Intel’s HEDT platform). But now desktop CPUs from AMD/Intel with 10-16C 20-32T are available and even Atom “Denverton” DVT goes up to 16C – thus many more cores are needed.
In testing the KX cores perform reasonably well, and can beat Atom DVT with the same number of cores; considering in how many network appliances (firewalls, routers, gateways, etc.), storage (NAS, low-cost SAN) and low-cost servers Atom is used – there is a market there for the KX if the price is right. No, it cannot beat modern AMD/Intel cores (even many years older) – and systems may still be more cost effective.
About the only issue is the somewhat high TDP (70W) which does not compare well with Atom DVT (25W) – especially as the latter includes 2.5/10Gbe LAN that are known to use significant power. Thus it may not be possible to build passive network/storage devices without fan cooling (e.g. for homes/offices) while for rack usage the higher power usage results in higher costs.
If Zhaoxin/Centaur can improve SIMD performance in the future KX and add more cores (16? 24? 32?) it can pretty much finish Intel’s Atom which would be a big achievement in itself. Here’s to the new low-cost king!
In summary – recommended!
Please see our other articles on:
- Intel Iris Plus G7 Gen 12 XE TigerLake ULV (i7-1165G7) Review & Benchmarks – GPGPU Performance
- AVX512-IFMA(52) Improvement for IceLake and TigerLake
- Intel Core Gen 10 IceLake ULV (i7-1065G7) Review & Benchmarks – Cache & Memory Performance
Disclaimer
This is an independent article that has not been endorsed or sponsored by any entity (e.g. Zhaoxin/Centaur). All trademarks acknowledged and used for identification only under fair use.
The article contains only public information (available elsewhere on the Internet) and not provided under NDA nor embargoed. At publication time, the products have not been directly tested by SiSoftware and thus the accuracy of the benchmark scores cannot be verified; however, they appear consistent and do not appear to be false/fake.