【论文阅读】DaST: Data-free Substitute Training for Adversarial Attacks(2020)

摘要

Machine learning models(机器学习模型) are vulnerable(容易受到) to adversarial examples(对抗样本). For the black-box setting(对于黑盒设置), current substitute attacks(目前替代攻击) need pre-trained models(预训练模型) to generate adversarial examples(生成对抗样本). However, pre-trained models(预训练模型) are hard to obtain(很难获得) in real-world tasks(在实际任务中). In this paper, we propose a data-free substitute training method(无数据替代训练方法) (DaST) to obtain substitute models(获得替代模型) for adversarial black-box attacks(对抗性黑盒攻击) without the requirement of any real data(不需要任何真实数据). To achieve this, DaST utilizes specially designed(专门设计) generative adversarial networks(生成对抗网络) (GANs) to train the substitute models. In particular(特别地), we design a multi-branch architecture(多分支架构) and label-control loss(标签控制损失) for the generative model to deal with(处理) the uneven distribution(不均匀分布) of synthetic samples(生成样本). The substitute model is then trained by the synthetic samples(合成样本) generated by the generative model(生成模型), which are labeled by the attacked model subsequently(随后). The experiments demonstrate(实验表明) the substitute models produced by DaST can achieve competitive performance(达到有竞争力的性能) compared with the baseline models(基线模型) which are trained by the same train set(相同的训练集) with attacked models(攻击模型). Additionally(此外), to evaluate the practicability(评估实用性) of the proposed method(所提出的方法) on the real-world task(在现实世界任务), we attack an online machine learning model(在线机器学习模型) on the Microsoft Azure platform. The remote model(远程模型) misclassifies(错误分类) 98.35% of the adversarial examples crafted(制作) by our method. To the best of our knowledge(据我们所知), we are the first(第一个) to train a substitute model for adversarial attacks(对抗样本) without any real data(没有任何真实数据).

方法

总结

We have presented(提出) a data-free method DaST to train substitute models(替代模型) for adversarial attacks(对抗性攻击). DaST reduces(降低) the prerequisites(先决条件) of adversarial substitute attacks(对抗性攻击) by utilizing(利用) GANs to generate synthetic samples(生成合成样本). This is the first method that can train substitute models without the requirement of any real data(不需要任何真实数据). The experiments showed(实验表明) the effectiveness(有效性) of our method. It presented(表明) that machine learning systems have significant risks(存在重大风险), attackers can train substitute models even when the real input data is hard to collect(即使难以手机真实的输入数据).

The proposed DaST cannot generate adversarial examples alone(不能单独生成对抗性样本), it should be used with other gradient-based attack methods(应该与其他基于梯度的攻击方法一起使用). In future work, we will design a new substitute training method, which can generate attacks directly(直接). Furthermore, we will explore(探索) the defense for DaST.

论文链接

DaST: Data-free Substitute Training for Adversarial Attacks

相关推荐
大模型最新论文速读1 分钟前
字节跳动 Seed: 用“分子结构”对思维建模
论文阅读·人工智能·深度学习·机器学习·自然语言处理
njsgcs6 小时前
MG-Nav: 基于稀疏空间记忆的双尺度视觉导航 论文阅读
论文阅读
大模型最新论文速读7 小时前
「图文讲解」Profit:用概率挑选重要 token 解决 SFT 过拟合问题
论文阅读·人工智能·深度学习·机器学习·自然语言处理
有Li20 小时前
DACG:用于放射学报告生成的双重注意力和上下文引导模型/文献速递-基于人工智能的医学影像技术
论文阅读·人工智能·文献·医学生
AustinCyy1 天前
【论文笔记】ADL: A Declarative Language for Agent-Based Chatbots
论文阅读
墨绿色的摆渡人2 天前
论文笔记(一百一十八)One2Any: One-Reference 6D Pose Estimation for Any Object
论文阅读
崔高杰2 天前
【论文阅读笔记】Agent Memory相关文献追踪——异构存储和经验记忆相关
论文阅读·笔记
李加号pluuuus2 天前
【论文阅读】ColorFlow: Retrieval-Augmented Image Sequence Colorization
论文阅读
DuHz2 天前
自动驾驶雷达干扰缓解:探索主动策略论文精读
论文阅读·人工智能·算法·机器学习·自动驾驶·汽车·信号处理
m0_650108242 天前
Alpamayo-R1:打通推理与动作预测,迈向稳健 L4 级自动驾驶
论文阅读·端到端自动驾驶·融合结构化因果推理与车辆控制·长尾场景稳健性·开环轨迹预测·闭环驾驶安全