计算机视觉——图像修复综述篇

目录

[1. Deterministic Image Inpainting 判别器图像修复](#1. Deterministic Image Inpainting 判别器图像修复)

[1.1. sigle-shot framework](#1.1. sigle-shot framework)

[(1) Generators](#(1) Generators)

[(2) training objects / Loss Functions](#(2) training objects / Loss Functions)

[1.2. two-stage framework](#1.2. two-stage framework)

[2. Stochastic Image Inpainting 随机图像修复](#2. Stochastic Image Inpainting 随机图像修复)

[2.1. VAE-based methods](#2.1. VAE-based methods)

[2.2. GAN-based methods](#2.2. GAN-based methods)

[2.3. Flow-based methods](#2.3. Flow-based methods)

[2.4. MLM-based methods](#2.4. MLM-based methods)

[2.5. Diffusion model-based methods](#2.5. Diffusion model-based methods)

[3. text-guided image inpainting ⽂本引导的图像修复](#3. text-guided image inpainting ⽂本引导的图像修复)

[4. Inpainting Mask 掩码机制](#4. Inpainting Mask 掩码机制)

[(1) regular mask](#(1) regular mask)

[(2) irregular mask](#(2) irregular mask)

[5. Loss Function 损失函数](#5. Loss Function 损失函数)

[6. Dataset 图像修复领域数据集](#6. Dataset 图像修复领域数据集)

[(1) faces(CelebA & CelebA-HQ)](#(1) faces(CelebA & CelebA-HQ))

[(2) real-world encountered scenes(Places2)](#(2) real-world encountered scenes(Places2))

[(3) street scenes(Paris)](#(3) street scenes(Paris))

[(4) texture(DTD)](#(4) texture(DTD))

[(5) objects (ImageNet)](#(5) objects (ImageNet))

[7. Evaluation Protocol 评估指标](#7. Evaluation Protocol 评估指标)

[7.1. pixel-aware metrics](#7.1. pixel-aware metrics)

[7.2. (human) perception-aware metriics](#7.2. (human) perception-aware metriics)

[8. Performance Evaluation 表现评估](#8. Performance Evaluation 表现评估)

[8.1 Representative Image Inpainting Methods](#8.1 Representative Image Inpainting Methods)

[8.2 Loss Functions](#8.2 Loss Functions)

[9. Inpainting-based Application 基于图像修复的领域应⽤](#9. Inpainting-based Application 基于图像修复的领域应⽤)

[(1) Object Removal](#(1) Object Removal)

[(2) Text Editing](#(2) Text Editing)

[(3) Old Photo Restoration](#(3) Old Photo Restoration)

[(4) Image Compression](#(4) Image Compression)

[(5) Text-guided image editing](#(5) Text-guided image editing)

Reference


1. Deterministic Image Inpainting 判别器图像修复

1.1. sigle-shot framework
(1) Generators
  1. mask-aware design
  2. attention mechanism
  3. multi-scale aggregation
  4. transform domain
  5. encoder-decoder connection
  6. deep prior guidance
(2) training objects / Loss Functions
  1. Pixel-wise reconstruction loss
  2. perceptual loss
  3. style loss
  4. adversarial loss
  5. prevalent training objectives
1.2. two-stage framework

(1) coarse-to-fiine methods
(2) structure-then-texture methods

2. Stochastic Image Inpainting 随机图像修复

2.1. VAE-based methods
2.2. GAN-based methods
2.3. Flow-based methods
2.4. MLM-based methods
2.5. Diffusion model-based methods

(1) sample stratage design
(2) computational cost reduction

3. text-guided image inpainting ⽂本引导的图像修复

4. Inpainting Mask 掩码机制

(1) regular mask
(2) irregular mask

5. Loss Function 损失函数

同1-1.1-(2) training objects

6. Dataset 图像修复领域数据集

(1) faces(CelebA & CelebA-HQ)
(2) real-world encountered scenes(Places2)
(3) street scenes(Paris)
(4) texture(DTD)
(5) objects (ImageNet)

7. Evaluation Protocol 评估指标

7.1. pixel-aware metrics

focus on the precision of reconstructed pixels
(1) l1 error
(1) l2 error
(3) PSNR(peak signal-to-noise ratio)
(4) SSIM(the structure similarity index)
(5) MS-SSIM(muti-scale SSIM)

7.2. (human) perception-aware metriics

the visual perception quality
(1) FID(Frechet Inception diistance)
(2) LPIPS(learned perceptual image patch similarity)
(3) P/U-IDS(pair-unpair Inception discriminative score)

8. Performance Evaluation 表现评估

8.1 Representative Image Inpainting Methods

(1) Models: RFR, MADF, DSI, CR-Fill, CoModGAN, LGNet, RePaint
(2) Dataset: CeleBA-HQ, Places2
(3) Mask: M1, M2, M3, M4, M5, M6
(4) Metrics: l1, PSNR, SSIM, MS-SSIM, FID, LP-IPS
(5) Loss: pixes reconstruction loss, perceptual loss, resnetpl loss, style loss, stylemeanstd,
percept-style loss, lsgan

8.2 Loss Functions

同1-1.1-(2) training objects

9. Inpainting-based Application 基于图像修复的领域应⽤

(1) Object Removal
(2) Text Editing
(3) Old Photo Restoration
(4) Image Compression
(5) Text-guided image editing

Reference

  1. Deep Learning-based Image and Video Inpainting: A Survey
相关推荐
Liue612312313 小时前
基于YOLOv26的口罩佩戴检测与识别系统实现与优化
人工智能·yolo·目标跟踪
小二·5 小时前
Python Web 开发进阶实战 :AI 原生数字孪生 —— 在 Flask + Three.js 中构建物理世界实时仿真与优化平台
前端·人工智能·python
chinesegf5 小时前
文本嵌入模型的比较(一)
人工智能·算法·机器学习
珠海西格电力5 小时前
零碳园区的能源结构优化需要哪些技术支持?
大数据·人工智能·物联网·架构·能源
Black蜡笔小新5 小时前
视频汇聚平台EasyCVR打造校园消防智能监管新防线
网络·人工智能·音视频
珠海西格电力科技5 小时前
双碳目标下,微电网为何成为能源转型核心载体?
网络·人工智能·物联网·云计算·智慧城市·能源
2501_941837265 小时前
【计算机视觉】基于YOLOv26的交通事故检测与交通状况分析系统详解_1
人工智能·yolo·计算机视觉
HyperAI超神经6 小时前
加州大学构建基于全连接神经网络的片上光谱仪,在芯片级尺寸上实现8纳米的光谱分辨率
人工智能·深度学习·神经网络·机器学习·ai编程
badfl6 小时前
AI漫剧技术方案拆解:NanoBanana+Sora视频生成全流程
人工智能·ai·ai作画
杭州杭州杭州6 小时前
李沐动手学深度学习笔记(4)---物体检测基础
人工智能·笔记·深度学习