CVPR2024|AIGC(图像生成,视频生成等)相关论文汇总(附论文链接/开源代码/解析)【持续更新】

CVPR2024|AIGC相关论文汇总(如果觉得有帮助,欢迎点赞和收藏)

  • Awesome-CVPR2024-AIGC
  • [1.图像生成(Image Generation/Image Synthesis)](#1.图像生成(Image Generation/Image Synthesis))
      • [ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations](#ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations)
      • [InstanceDiffusion: Instance-level Control for Image Generation](#InstanceDiffusion: Instance-level Control for Image Generation)
      • [Instruct-Imagen: Image Generation with Multi-modal Instruction](#Instruct-Imagen: Image Generation with Multi-modal Instruction)
      • [MACE: Mass Concept Erasure in Diffusion Models](#MACE: Mass Concept Erasure in Diffusion Models)
      • [PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models](#PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models)
      • [Residual Denoising Diffusion Models](#Residual Denoising Diffusion Models)
  • [2.图像编辑(Image Editing)](#2.图像编辑(Image Editing))
      • [PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models](#PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models)
  • [3.视频生成(Video Generation/Image Synthesis)](#3.视频生成(Video Generation/Image Synthesis))
      • [Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners](#Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners)
  • [4.视频编辑(Video Editing)](#4.视频编辑(Video Editing))
  • [5.3D生成(3D Generation/3D Synthesis)](#5.3D生成(3D Generation/3D Synthesis))
      • [EscherNet: A Generative Model for Scalable View Synthesis](#EscherNet: A Generative Model for Scalable View Synthesis)
  • 6.其他多任务(Others)
      • [InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks](#InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks)
      • [Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models](#Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models)
  • 参考
  • 相关整理

Awesome-CVPR2024-AIGC

A Collection of Papers and Codes for CVPR2024 AIGC

整理汇总下今年CVPR AIGC相关的论文和代码,具体如下。

欢迎star,fork和PR~
优先在Github更新Awesome-CVPR2024-AIGC,欢迎star~
知乎https://zhuanlan.zhihu.com/p/684325134

参考或转载请注明出处

CVPR2024官网:https://cvpr.thecvf.com/Conferences/2024

CVPR完整论文列表:

开会时间:2024年6月17日-6月21日

论文接收公布时间:

【Contents】

  • [1.图像生成(Image Generation/Image Synthesis)](#1.图像生成(Image Generation/Image Synthesis))
  • [2.图像编辑(Image Editing)](#2.图像编辑(Image Editing))
  • [3.视频生成(Video Generation/Image Synthesis)](#3.视频生成(Video Generation/Image Synthesis))
  • [4.视频编辑(Video Editing)](#4.视频编辑(Video Editing))
  • [5.3D生成(3D Generation/3D Synthesis)](#5.3D生成(3D Generation/3D Synthesis))
  • 6.其他多任务(Others)

1.图像生成(Image Generation/Image Synthesis)

ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations

InstanceDiffusion: Instance-level Control for Image Generation

Instruct-Imagen: Image Generation with Multi-modal Instruction

MACE: Mass Concept Erasure in Diffusion Models

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models

Residual Denoising Diffusion Models

2.图像编辑(Image Editing)

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

3.视频生成(Video Generation/Image Synthesis)

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

4.视频编辑(Video Editing)

5.3D生成(3D Generation/3D Synthesis)

EscherNet: A Generative Model for Scalable View Synthesis

6.其他多任务(Others)

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

参考

CVPR 2024 论文和开源项目合集(Papers with Code)

相关整理

相关推荐
z千鑫38 分钟前
【人工智能】PyTorch、TensorFlow 和 Keras 全面解析与对比:深度学习框架的终极指南
人工智能·pytorch·深度学习·aigc·tensorflow·keras·codemoss
WeeJot嵌入式1 小时前
OpenCV:计算机视觉的瑞士军刀
计算机视觉
思通数科多模态大模型2 小时前
10大核心应用场景,解锁AI检测系统的智能安全之道
人工智能·深度学习·安全·目标检测·计算机视觉·自然语言处理·数据挖掘
学不会lostfound2 小时前
三、计算机视觉_05MTCNN人脸检测
pytorch·深度学习·计算机视觉·mtcnn·p-net·r-net·o-net
Mr.谢尔比3 小时前
李宏毅机器学习课程知识点摘要(1-5集)
人工智能·pytorch·深度学习·神经网络·算法·机器学习·计算机视觉
思通数科AI全行业智能NLP系统3 小时前
六大核心应用场景,解锁AI检测系统的智能安全之道
图像处理·人工智能·深度学习·安全·目标检测·计算机视觉·知识图谱
程序员X小鹿5 小时前
AI视频自动剪辑神器!点赞上万的影视剧片段,一键全自动剪辑,效率提升80%!(附保姆级教程)
aigc
李歘歘7 小时前
Stable Diffusion经典应用场景
人工智能·深度学习·计算机视觉
饭碗、碗碗香7 小时前
OpenCV笔记:图像去噪对比
人工智能·笔记·opencv·计算机视觉
蚂蚁没问题s9 小时前
图像处理 - 色彩空间转换
图像处理·人工智能·算法·机器学习·计算机视觉