注意pytorch的原地操作

Nastu_Ho-小何同学2024-02-22 10:29

常见的原地操作：nn.ReLu(inplace=True)

python 复制代码

a = torch.randn(2)	# tensor([-0.3690,  0.0626])
b = a.clone() # tensor([-0.3690,  0.0626])
c = a	# tensor([-0.3690,  0.0626])
relu = nn.ReLu(inplace=True)

情况1

python 复制代码

out = relu(a) # tensor([0.0000, 0.0626])
a	# tensor([0.0000, 0.0626])
b	# tensor([-0.3690,  0.0626])
c	# tensor([0.0000, 0.0626])

如果没有clone, c则会变化

情况2

python 复制代码

out1 = a + relu(a)	# tensor([-0.3690, 0.1252])
out2 = relu(a) + a	# tensor([0, 0.1252])

两个完全不同结果

上一篇：【经验】STM32的一些细节

下一篇：记一次“Github雇佣意向”钓鱼邮件