环境部署
背景
因nvidia 受限 公司转华为推理服务器 AT800(3000) + 昇腾 ,将推出一系列文章 ,记录过程。
服务器 硬件资源
系统:
shell
lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description: Ubuntu 20.04.6 LTS
Release: 20.04
Codename: focal
shell
sudo dmidecode -t system
# dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 3.2.0 present.
Handle 0x0001, DMI type 1, 27 bytes
System Information
Manufacturer: RCSIT
Product Name: AT800 (Model 3000)
Version: To be filled by O.E.M.
Serial Number: 2102313NMSP0N6100250
UUID: 3fe02c62-f82e-a139-ec11-afedc88ff166
Wake-up Type: Power Switch
SKU Number: To be filled by O.E.M.
Family: To be filled by O.E.M.
Handle 0x0005, DMI type 32, 11 bytes
System Boot Information
Status: No errors detected
CPU:
查看CPU信息
shell
lscpu
Architecture: aarch64
CPU op-mode(s): 64-bit
Byte Order: Little Endian
CPU(s): 64
On-line CPU(s) list: 0-63
Thread(s) per core: 1
Core(s) per socket: 32
Socket(s): 2
NUMA node(s): 2
Vendor ID: 0x48
Model: 0
Stepping: 0x1
CPU max MHz: 2600.0000
CPU min MHz: 200.0000
BogoMIPS: 200.00
L1d cache: 4 MiB
L1i cache: 4 MiB
L2 cache: 32 MiB
L3 cache: 64 MiB
NUMA node0 CPU(s): 0-31
NUMA node1 CPU(s): 32-63
Vulnerability Itlb multihit: Not affected
Vulnerability L1tf: Not affected
Vulnerability Mds: Not affected
Vulnerability Meltdown: Not affected
Vulnerability Spec store bypass: Not affected
Vulnerability Spectre v1: Mitigation; __user pointer sanitization
Vulnerability Spectre v2: Not affected
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort: Not affected
Flags: fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma dcpop asimddp asimdfhm
Vendor ID:0x48 // 鲲鹏 920
双核
查看具体国产CPU信息
shell
sudo dmidecode -t processor
# dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 3.2.0 present.
Handle 0x001B, DMI type 4, 48 bytes
Processor Information
Socket Designation: CPU01
Type: Central Processor
Family: ARM
Manufacturer: HiSilicon
ID: 10 D0 1F 48 00 00 00 00
Signature: Implementor 0x48, Variant 0x1, Architecture 15, Part 0xd01, Revision 0
Version: HUAWEI Kunpeng 920 5220
Voltage: 0.9 V
External Clock: 100 MHz
Max Speed: 2600 MHz
Current Speed: 2600 MHz
Status: Populated, Enabled
Upgrade: Unknown
L1 Cache Handle: 0x0018
L2 Cache Handle: 0x0019
L3 Cache Handle: 0x001A
Serial Number: E4E03DD500702C0C
Asset Tag: To be filled by O.E.M.
Part Number: To be filled by O.E.M.
Core Count: 32
Core Enabled: 32
Thread Count: 32
Characteristics:
64-bit capable
Multi-Core
Execute Protection
Enhanced Virtualization
Power/Performance Control
Handle 0x001F, DMI type 4, 48 bytes
Processor Information
Socket Designation: CPU02
Type: Central Processor
Family: ARM
Manufacturer: HiSilicon
ID: 10 D0 1F 48 00 00 00 00
Signature: Implementor 0x48, Variant 0x1, Architecture 15, Part 0xd01, Revision 0
Version: HUAWEI Kunpeng 920 5220
Voltage: 0.9 V
External Clock: 100 MHz
Max Speed: 2600 MHz
Current Speed: 2600 MHz
Status: Populated, Enabled
Upgrade: Unknown
L1 Cache Handle: 0x001C
L2 Cache Handle: 0x001D
L3 Cache Handle: 0x001E
Serial Number: 20803DD500702414
Asset Tag: To be filled by O.E.M.
Part Number: To be filled by O.E.M.
Core Count: 32
Core Enabled: 32
Thread Count: 32
Characteristics:
64-bit capable
Multi-Core
Execute Protection
Enhanced Virtualization
Power/Performance Control
信息显示 HUAWEI Kunpeng 920 5220 双核
NPU:
进入昇腾 社区
300V
固件与驱动
下载后安装
查看版本与设备
shell
npu-smi -v
npu-smi version: 23.0.rc2
npu-smi info -l
Total Count : 4
NPU ID : 1
Product Name : IT21PDDD
Serial Number : 2106030675ZENC000814
Chip Count : 1
NPU ID : 2
Product Name : IT21PDDD
Serial Number : 2106030675ZENC000922
Chip Count : 1
NPU ID : 5
Product Name : IT21PDDD
Serial Number : 2106030675ZENC000947
Chip Count : 1
NPU ID : 6
Product Name : IT21PDDD
Serial Number : 2106030675ZENC000843
Chip Count : 1
CANN
按照指导安装
shell
/usr/local/Ascend$ ls
ascend-toolkit driver host_servers_remove.sh host_services_exit.sh host_sys_init.sh
develop firmware host_servers_setup.sh host_services_setup.sh version.info