CVPR2024|AIGC(图像生成,视频生成等)相关论文汇总(附论文链接/开源代码/解析)【持续更新】

CVPR2024|AIGC相关论文汇总(如果觉得有帮助,欢迎点赞和收藏)

  • Awesome-CVPR2024-AIGC
  • [1.图像生成(Image Generation/Image Synthesis)](#1.图像生成(Image Generation/Image Synthesis))
      • [ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations](#ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations)
      • [InstanceDiffusion: Instance-level Control for Image Generation](#InstanceDiffusion: Instance-level Control for Image Generation)
      • [Instruct-Imagen: Image Generation with Multi-modal Instruction](#Instruct-Imagen: Image Generation with Multi-modal Instruction)
      • [MACE: Mass Concept Erasure in Diffusion Models](#MACE: Mass Concept Erasure in Diffusion Models)
      • [PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models](#PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models)
      • [Residual Denoising Diffusion Models](#Residual Denoising Diffusion Models)
  • [2.图像编辑(Image Editing)](#2.图像编辑(Image Editing))
      • [PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models](#PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models)
  • [3.视频生成(Video Generation/Image Synthesis)](#3.视频生成(Video Generation/Image Synthesis))
      • [Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners](#Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners)
  • [4.视频编辑(Video Editing)](#4.视频编辑(Video Editing))
  • [5.3D生成(3D Generation/3D Synthesis)](#5.3D生成(3D Generation/3D Synthesis))
      • [EscherNet: A Generative Model for Scalable View Synthesis](#EscherNet: A Generative Model for Scalable View Synthesis)
  • 6.其他多任务(Others)
      • [InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks](#InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks)
      • [Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models](#Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models)
  • 参考
  • 相关整理

Awesome-CVPR2024-AIGC

A Collection of Papers and Codes for CVPR2024 AIGC

整理汇总下今年CVPR AIGC相关的论文和代码,具体如下。

欢迎star,fork和PR~
优先在Github更新Awesome-CVPR2024-AIGC,欢迎star~
知乎https://zhuanlan.zhihu.com/p/684325134

参考或转载请注明出处

CVPR2024官网:https://cvpr.thecvf.com/Conferences/2024

CVPR完整论文列表:

开会时间:2024年6月17日-6月21日

论文接收公布时间:

【Contents】

  • [1.图像生成(Image Generation/Image Synthesis)](#1.图像生成(Image Generation/Image Synthesis))
  • [2.图像编辑(Image Editing)](#2.图像编辑(Image Editing))
  • [3.视频生成(Video Generation/Image Synthesis)](#3.视频生成(Video Generation/Image Synthesis))
  • [4.视频编辑(Video Editing)](#4.视频编辑(Video Editing))
  • [5.3D生成(3D Generation/3D Synthesis)](#5.3D生成(3D Generation/3D Synthesis))
  • 6.其他多任务(Others)

1.图像生成(Image Generation/Image Synthesis)

ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations

InstanceDiffusion: Instance-level Control for Image Generation

Instruct-Imagen: Image Generation with Multi-modal Instruction

MACE: Mass Concept Erasure in Diffusion Models

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models

Residual Denoising Diffusion Models

2.图像编辑(Image Editing)

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

3.视频生成(Video Generation/Image Synthesis)

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

4.视频编辑(Video Editing)

5.3D生成(3D Generation/3D Synthesis)

EscherNet: A Generative Model for Scalable View Synthesis

6.其他多任务(Others)

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

参考

CVPR 2024 论文和开源项目合集(Papers with Code)

相关整理

相关推荐
Blossom.1181 小时前
量子通信:从科幻走向现实的未来通信技术
人工智能·深度学习·目标检测·机器学习·计算机视觉·语音识别·量子计算
杂雾无尘1 小时前
用 Trae 打造全栈项目魔法师 - 让项目初始化不再是噩梦
aigc·openai·ai编程
bj32814 小时前
机器学习实验八--基于pca的人脸识别
人工智能·机器学习·计算机视觉
程序员X小鹿4 小时前
全球首个能无限跑的AI来了!AI Agents的下一站?这才是真的颠覆式革新!(附10个邀请码)
aigc
清醒的兰5 小时前
OpenCV 图像像素的逻辑操作
人工智能·opencv·计算机视觉
刘维克5 小时前
(预发布)[阿维笔记]分析优化CloudStudio高性能工作空间的GPU训练速度和效果
深度学习·计算机视觉
CoovallyAIHub7 小时前
AI+无人机如何守护濒危物种?YOLOv8实现95%精准识别
深度学习·算法·计算机视觉
掘我的金10 小时前
深入解析Stream函数与生成器本质
llm·aigc
掘我的金10 小时前
Prompt Cache 与 Streaming:核心机制与优化实践
llm·aigc
新知图书11 小时前
OpenCV在图像上绘制文字示例
人工智能·opencv·计算机视觉