计算机视觉——图像修复综述篇

目录

[1. Deterministic Image Inpainting 判别器图像修复](#1. Deterministic Image Inpainting 判别器图像修复)

[1.1. sigle-shot framework](#1.1. sigle-shot framework)

[(1) Generators](#(1) Generators)

[(2) training objects / Loss Functions](#(2) training objects / Loss Functions)

[1.2. two-stage framework](#1.2. two-stage framework)

[2. Stochastic Image Inpainting 随机图像修复](#2. Stochastic Image Inpainting 随机图像修复)

[2.1. VAE-based methods](#2.1. VAE-based methods)

[2.2. GAN-based methods](#2.2. GAN-based methods)

[2.3. Flow-based methods](#2.3. Flow-based methods)

[2.4. MLM-based methods](#2.4. MLM-based methods)

[2.5. Diffusion model-based methods](#2.5. Diffusion model-based methods)

[3. text-guided image inpainting ⽂本引导的图像修复](#3. text-guided image inpainting ⽂本引导的图像修复)

[4. Inpainting Mask 掩码机制](#4. Inpainting Mask 掩码机制)

[(1) regular mask](#(1) regular mask)

[(2) irregular mask](#(2) irregular mask)

[5. Loss Function 损失函数](#5. Loss Function 损失函数)

[6. Dataset 图像修复领域数据集](#6. Dataset 图像修复领域数据集)

[(1) faces(CelebA & CelebA-HQ)](#(1) faces(CelebA & CelebA-HQ))

[(2) real-world encountered scenes(Places2)](#(2) real-world encountered scenes(Places2))

[(3) street scenes(Paris)](#(3) street scenes(Paris))

[(4) texture(DTD)](#(4) texture(DTD))

[(5) objects (ImageNet)](#(5) objects (ImageNet))

[7. Evaluation Protocol 评估指标](#7. Evaluation Protocol 评估指标)

[7.1. pixel-aware metrics](#7.1. pixel-aware metrics)

[7.2. (human) perception-aware metriics](#7.2. (human) perception-aware metriics)

[8. Performance Evaluation 表现评估](#8. Performance Evaluation 表现评估)

[8.1 Representative Image Inpainting Methods](#8.1 Representative Image Inpainting Methods)

[8.2 Loss Functions](#8.2 Loss Functions)

[9. Inpainting-based Application 基于图像修复的领域应⽤](#9. Inpainting-based Application 基于图像修复的领域应⽤)

[(1) Object Removal](#(1) Object Removal)

[(2) Text Editing](#(2) Text Editing)

[(3) Old Photo Restoration](#(3) Old Photo Restoration)

[(4) Image Compression](#(4) Image Compression)

[(5) Text-guided image editing](#(5) Text-guided image editing)

Reference


1. Deterministic Image Inpainting 判别器图像修复

1.1. sigle-shot framework
(1) Generators
  1. mask-aware design
  2. attention mechanism
  3. multi-scale aggregation
  4. transform domain
  5. encoder-decoder connection
  6. deep prior guidance
(2) training objects / Loss Functions
  1. Pixel-wise reconstruction loss
  2. perceptual loss
  3. style loss
  4. adversarial loss
  5. prevalent training objectives
1.2. two-stage framework

(1) coarse-to-fiine methods
(2) structure-then-texture methods

2. Stochastic Image Inpainting 随机图像修复

2.1. VAE-based methods
2.2. GAN-based methods
2.3. Flow-based methods
2.4. MLM-based methods
2.5. Diffusion model-based methods

(1) sample stratage design
(2) computational cost reduction

3. text-guided image inpainting ⽂本引导的图像修复

4. Inpainting Mask 掩码机制

(1) regular mask
(2) irregular mask

5. Loss Function 损失函数

同1-1.1-(2) training objects

6. Dataset 图像修复领域数据集

(1) faces(CelebA & CelebA-HQ)
(2) real-world encountered scenes(Places2)
(3) street scenes(Paris)
(4) texture(DTD)
(5) objects (ImageNet)

7. Evaluation Protocol 评估指标

7.1. pixel-aware metrics

focus on the precision of reconstructed pixels
(1) l1 error
(1) l2 error
(3) PSNR(peak signal-to-noise ratio)
(4) SSIM(the structure similarity index)
(5) MS-SSIM(muti-scale SSIM)

7.2. (human) perception-aware metriics

the visual perception quality
(1) FID(Frechet Inception diistance)
(2) LPIPS(learned perceptual image patch similarity)
(3) P/U-IDS(pair-unpair Inception discriminative score)

8. Performance Evaluation 表现评估

8.1 Representative Image Inpainting Methods

(1) Models: RFR, MADF, DSI, CR-Fill, CoModGAN, LGNet, RePaint
(2) Dataset: CeleBA-HQ, Places2
(3) Mask: M1, M2, M3, M4, M5, M6
(4) Metrics: l1, PSNR, SSIM, MS-SSIM, FID, LP-IPS
(5) Loss: pixes reconstruction loss, perceptual loss, resnetpl loss, style loss, stylemeanstd,
percept-style loss, lsgan

8.2 Loss Functions

同1-1.1-(2) training objects

9. Inpainting-based Application 基于图像修复的领域应⽤

(1) Object Removal
(2) Text Editing
(3) Old Photo Restoration
(4) Image Compression
(5) Text-guided image editing

Reference

  1. Deep Learning-based Image and Video Inpainting: A Survey
相关推荐
大模型任我行11 分钟前
英伟达:解耦训练与推演的服务架构
人工智能·语言模型·自然语言处理·论文笔记
newsxun13 分钟前
中创汇联双城峰会圆满举办 多维赋能实体高质量发展
大数据·人工智能
人工智能AI技术13 分钟前
Karpathy开源第二大脑方案,有望替代向量数据库,让AI永不失忆
人工智能
之歆27 分钟前
打造你的 AI 浏览器助手:从零到一的完整实践
人工智能
小陈工29 分钟前
Python Web开发入门(十一):RESTful API设计原则与最佳实践——让你的API既优雅又好用
开发语言·前端·人工智能·后端·python·安全·restful
humors22135 分钟前
AI工具合集,不定期更新
人工智能·windows·ai·工具·powershell·deepseek
做个文艺程序员37 分钟前
2026 年开源大模型选型指南:Qwen3.5 / DeepSeek V3.2 / Llama 4 横向对比
人工智能·开源·llama
LabVIEW开发40 分钟前
LabVIEW控制阀性能测试评估系统
人工智能·labview·labview知识·labview功能·labview程序
测试_AI_一辰42 分钟前
AI 如何参与 Playwright 自动化维护:一次自动修复闭环实践
人工智能·算法·ai·自动化·ai编程
chenglin01642 分钟前
AI服务的可观测性与运维
运维·人工智能