CVPR2024|AIGC(图像生成,视频生成等)相关论文汇总(附论文链接/开源代码/解析)【持续更新】

CVPR2024|AIGC相关论文汇总(如果觉得有帮助,欢迎点赞和收藏)

  • Awesome-CVPR2024-AIGC
  • [1.图像生成(Image Generation/Image Synthesis)](#1.图像生成(Image Generation/Image Synthesis))
      • [ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations](#ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations)
      • [InstanceDiffusion: Instance-level Control for Image Generation](#InstanceDiffusion: Instance-level Control for Image Generation)
      • [Instruct-Imagen: Image Generation with Multi-modal Instruction](#Instruct-Imagen: Image Generation with Multi-modal Instruction)
      • [MACE: Mass Concept Erasure in Diffusion Models](#MACE: Mass Concept Erasure in Diffusion Models)
      • [PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models](#PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models)
      • [Residual Denoising Diffusion Models](#Residual Denoising Diffusion Models)
  • [2.图像编辑(Image Editing)](#2.图像编辑(Image Editing))
      • [PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models](#PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models)
  • [3.视频生成(Video Generation/Image Synthesis)](#3.视频生成(Video Generation/Image Synthesis))
      • [Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners](#Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners)
  • [4.视频编辑(Video Editing)](#4.视频编辑(Video Editing))
  • [5.3D生成(3D Generation/3D Synthesis)](#5.3D生成(3D Generation/3D Synthesis))
      • [EscherNet: A Generative Model for Scalable View Synthesis](#EscherNet: A Generative Model for Scalable View Synthesis)
  • 6.其他多任务(Others)
      • [InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks](#InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks)
      • [Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models](#Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models)
  • 参考
  • 相关整理

Awesome-CVPR2024-AIGC

A Collection of Papers and Codes for CVPR2024 AIGC

整理汇总下今年CVPR AIGC相关的论文和代码,具体如下。

欢迎star,fork和PR~
优先在Github更新Awesome-CVPR2024-AIGC,欢迎star~
知乎https://zhuanlan.zhihu.com/p/684325134

参考或转载请注明出处

CVPR2024官网:https://cvpr.thecvf.com/Conferences/2024

CVPR完整论文列表:

开会时间:2024年6月17日-6月21日

论文接收公布时间:

【Contents】

  • [1.图像生成(Image Generation/Image Synthesis)](#1.图像生成(Image Generation/Image Synthesis))
  • [2.图像编辑(Image Editing)](#2.图像编辑(Image Editing))
  • [3.视频生成(Video Generation/Image Synthesis)](#3.视频生成(Video Generation/Image Synthesis))
  • [4.视频编辑(Video Editing)](#4.视频编辑(Video Editing))
  • [5.3D生成(3D Generation/3D Synthesis)](#5.3D生成(3D Generation/3D Synthesis))
  • 6.其他多任务(Others)

1.图像生成(Image Generation/Image Synthesis)

ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations

InstanceDiffusion: Instance-level Control for Image Generation

Instruct-Imagen: Image Generation with Multi-modal Instruction

MACE: Mass Concept Erasure in Diffusion Models

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models

Residual Denoising Diffusion Models

2.图像编辑(Image Editing)

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

3.视频生成(Video Generation/Image Synthesis)

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

4.视频编辑(Video Editing)

5.3D生成(3D Generation/3D Synthesis)

EscherNet: A Generative Model for Scalable View Synthesis

6.其他多任务(Others)

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

参考

CVPR 2024 论文和开源项目合集(Papers with Code)

相关整理

相关推荐
sali-tec1 天前
C# 基于halcon的视觉工作流-章56-彩图转云图
人工智能·算法·计算机视觉·c#
m0_650108241 天前
【论文精读】MotionEditor:基于内容感知扩散模型的视频运动编辑
aigc·论文精读·视频运动编辑·潜在扩散模型(ldm)·注意力注入·时空一致性
墨风如雪1 天前
OAK:打破壁垒,共绘智能体生态新蓝图
aigc
笑脸惹桃花1 天前
目标检测数据集——路面裂缝检测数据集
人工智能·深度学习·yolo·目标检测·计算机视觉·数据集
算家计算1 天前
一张白纸,无限画布:SkyReels刚刚重新定义了AI视频创作
人工智能·aigc·资讯
有为少年1 天前
告别乱码:OpenCV 中文路径(Unicode)读写的解决方案
人工智能·opencv·计算机视觉
清风与日月1 天前
halcon分类器使用标准流程
深度学习·目标检测·计算机视觉
初学小刘1 天前
基于 U-Net 的医学图像分割
python·opencv·计算机视觉
极客BIM工作室1 天前
U-Net 的输入与输出:通用场景与扩散模型场景解析
人工智能·深度学习·计算机视觉
格林威1 天前
AOI在化学药剂检测领域中的应用
人工智能·数码相机·计算机视觉·目标跟踪·视觉检测·制造·机器视觉