[动手学深度学习]生成对抗网络GAN学习笔记

论文原文:Generative Adversarial Nets (neurips.cc)

李沐GAN论文逐段精读:GAN论文逐段精读【论文精读】_哔哩哔哩_bilibili

论文代码:http://www.github.com/goodfeli/adversarial

Ian, J. et al. (2014) 'Generative adversarial network', NIPS'14: Proceedings of the 27th International Conference on Neural Information Processing Systems , Vol. 2, pp 2672--2680. doi: https://doi.org/10.48550/arXiv.1406.2661

未完待续

  1. GAN

1.1. 整体实现步骤

1.2. GAN理念

1.3. GAN弊端和局限

2. GAN论文原文学习

2.1. Abstract

①They combined generative model and discriminative model together, which forms a new model. is the "cheating" part which focus on imitating and is the "distinguishing" part which focus on distinguishing where the data comes from.

②This model is rely on a "minmax" function

③GAN does not need Markov chains or unrolled approximate inference nets

④They designed qualitative and quantitative evaluation to analyse the feasibility of GAN

2.2. Introduction

①The authors praised deep learning and briefly mentioned its prospects

②Due to the difficulty of fitting or approximating the distribution of the ground truth, the designed a new generative model

③They compare the generated model to the person who makes counterfeit money, and the discriminative model to the police. Both parties will mutually promote and grow. The authors ultimately hope that the ability of the counterfeiter can be indistinguishable from the genuine product

④Both and are MLP, and passes random noise

⑤They just adopt backpropagation and dropout in training

corpora 全集;corpus 的复数

counterfeiter n.伪造者;制假者;仿造者

①Recent works are concentrated on approximating function, such as succesful deep Boltzmann machine. However, their likelihood functions are too complex to process.

②Therefore, here comes generative model, which only generates samples but does not approximates function. Generative stochastic networks are an classic generative model.

③Their backpropagation:

④Variational autoencoders (VAEs) in Kingma and Welling and Rezende et al. do the similar work. However, VAEs are modeled by differentiate hidden units, which is contrary to GANs.

2.4. Adversarial nets

2.5. Theoretical Results

2.5.1. Global Optimality of p_g = p_data

2.5.2. Convergence of Algorithm 1

2.6. Experiments

2.7. Advantages and disadvantages

2.8. Conclusions and future work

  1. 知识补充

3.1. 散度

(1)KL散度

(2)JS散度

相关推荐
泠不丁3 分钟前
React/Next.js 前端开发与治愈系 UI 设计
人工智能
码语智行4 分钟前
Claude Code 免费白嫖 Qwen3.6,Token 无限量
人工智能
阿文的代码库7 分钟前
机器学习之精确率和召回率的关系
人工智能·算法·机器学习
Raink老师7 分钟前
【AI面试临阵磨枪-100】Harness 与 MCP/A2A 协议、Skill 体系如何集成?
人工智能·面试·职场和发展
我爱cope11 分钟前
【Agent智能体21 | 构建AI工作流的技巧-优化组件的常用方法】
人工智能·设计模式·语言模型·职场和发展
x_lrong11 分钟前
AMD 7800xt + WSL2 + ROCm7.2.1 配置AI开发环境
人工智能
逐梦苍穹13 分钟前
我开源了一个Claude Code历史可视化工具:本地Prompt一键浏览、搜索、导出
人工智能·开源·prompt·codex·claudecode
刘国华-平价IT运维课堂16 分钟前
Ubuntu 26.04 LTS 发布,研发与运维需要关注什么?
linux·运维·服务器·人工智能·ubuntu
专注搞钱17 分钟前
半导体行业中基于 LSTM 神经网络的 SPC 异常预测实战
人工智能·rnn·lstm
糖果店的幽灵18 分钟前
Spring AI 从入门到精通-ChatClient你与 AI 对话的终极武器
人工智能·python·spring