AT800(3000) +昇腾300V 之一 环境部署

环境部署

背景

因nvidia 受限 公司转华为推理服务器 AT800(3000) + 昇腾 ,将推出一系列文章 ,记录过程。

服务器 硬件资源

系统:

shell 复制代码
lsb_release -a
No LSB modules are available.
Distributor ID:	Ubuntu
Description:	Ubuntu 20.04.6 LTS
Release:	20.04
Codename:	focal
shell 复制代码
sudo dmidecode -t system
# dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 3.2.0 present.

Handle 0x0001, DMI type 1, 27 bytes
System Information
	Manufacturer: RCSIT
	Product Name: AT800 (Model 3000)
	Version: To be filled by O.E.M.
	Serial Number: 2102313NMSP0N6100250
	UUID: 3fe02c62-f82e-a139-ec11-afedc88ff166
	Wake-up Type: Power Switch
	SKU Number: To be filled by O.E.M.
	Family: To be filled by O.E.M.

Handle 0x0005, DMI type 32, 11 bytes
System Boot Information
	Status: No errors detected

CPU:

查看CPU信息

shell 复制代码
lscpu
Architecture:                    aarch64
CPU op-mode(s):                  64-bit
Byte Order:                      Little Endian
CPU(s):                          64
On-line CPU(s) list:             0-63
Thread(s) per core:              1
Core(s) per socket:              32
Socket(s):                       2
NUMA node(s):                    2
Vendor ID:                       0x48
Model:                           0
Stepping:                        0x1
CPU max MHz:                     2600.0000
CPU min MHz:                     200.0000
BogoMIPS:                        200.00
L1d cache:                       4 MiB
L1i cache:                       4 MiB
L2 cache:                        32 MiB
L3 cache:                        64 MiB
NUMA node0 CPU(s):               0-31
NUMA node1 CPU(s):               32-63
Vulnerability Itlb multihit:     Not affected
Vulnerability L1tf:              Not affected
Vulnerability Mds:               Not affected
Vulnerability Meltdown:          Not affected
Vulnerability Spec store bypass: Not affected
Vulnerability Spectre v1:        Mitigation; __user pointer sanitization
Vulnerability Spectre v2:        Not affected
Vulnerability Srbds:             Not affected
Vulnerability Tsx async abort:   Not affected
Flags:                           fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma dcpop asimddp asimdfhm

Vendor ID:0x48 // 鲲鹏 920

双核

查看具体国产CPU信息

shell 复制代码
 sudo dmidecode -t processor 
# dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 3.2.0 present.

Handle 0x001B, DMI type 4, 48 bytes
Processor Information
	Socket Designation: CPU01
	Type: Central Processor
	Family: ARM
	Manufacturer: HiSilicon
	ID: 10 D0 1F 48 00 00 00 00
	Signature: Implementor 0x48, Variant 0x1, Architecture 15, Part 0xd01, Revision 0
	Version: HUAWEI Kunpeng 920 5220
	Voltage: 0.9 V
	External Clock: 100 MHz
	Max Speed: 2600 MHz
	Current Speed: 2600 MHz
	Status: Populated, Enabled
	Upgrade: Unknown
	L1 Cache Handle: 0x0018
	L2 Cache Handle: 0x0019
	L3 Cache Handle: 0x001A
	Serial Number: E4E03DD500702C0C
	Asset Tag: To be filled by O.E.M.
	Part Number: To be filled by O.E.M.
	Core Count: 32
	Core Enabled: 32
	Thread Count: 32
	Characteristics:
		64-bit capable
		Multi-Core
		Execute Protection
		Enhanced Virtualization
		Power/Performance Control

Handle 0x001F, DMI type 4, 48 bytes
Processor Information
	Socket Designation: CPU02
	Type: Central Processor
	Family: ARM
	Manufacturer: HiSilicon
	ID: 10 D0 1F 48 00 00 00 00
	Signature: Implementor 0x48, Variant 0x1, Architecture 15, Part 0xd01, Revision 0
	Version: HUAWEI Kunpeng 920 5220
	Voltage: 0.9 V
	External Clock: 100 MHz
	Max Speed: 2600 MHz
	Current Speed: 2600 MHz
	Status: Populated, Enabled
	Upgrade: Unknown
	L1 Cache Handle: 0x001C
	L2 Cache Handle: 0x001D
	L3 Cache Handle: 0x001E
	Serial Number: 20803DD500702414
	Asset Tag: To be filled by O.E.M.
	Part Number: To be filled by O.E.M.
	Core Count: 32
	Core Enabled: 32
	Thread Count: 32
	Characteristics:
		64-bit capable
		Multi-Core
		Execute Protection
		Enhanced Virtualization
		Power/Performance Control

信息显示 HUAWEI Kunpeng 920 5220 双核

NPU:

进入昇腾 社区
300V

固件与驱动

驱动固件

下载后安装

查看版本与设备

shell 复制代码
npu-smi -v
npu-smi version: 23.0.rc2
npu-smi info -l
	Total Count                    : 4

	NPU ID                         : 1
	Product Name                   : IT21PDDD
	Serial Number                  : 2106030675ZENC000814
	Chip Count                     : 1

	NPU ID                         : 2
	Product Name                   : IT21PDDD
	Serial Number                  : 2106030675ZENC000922
	Chip Count                     : 1

	NPU ID                         : 5
	Product Name                   : IT21PDDD
	Serial Number                  : 2106030675ZENC000947
	Chip Count                     : 1

	NPU ID                         : 6
	Product Name                   : IT21PDDD
	Serial Number                  : 2106030675ZENC000843
	Chip Count                     : 1
CANN

cann6.2

按照指导安装

shell 复制代码
/usr/local/Ascend$ ls
ascend-toolkit  driver    host_servers_remove.sh  host_services_exit.sh   host_sys_init.sh
develop         firmware  host_servers_setup.sh   host_services_setup.sh  version.info

鲲鹏列表
昇腾列表