CVPR2024|AIGC(图像生成,视频生成等)相关论文汇总(附论文链接/开源代码/解析)【持续更新】

CVPR2024|AIGC相关论文汇总(如果觉得有帮助,欢迎点赞和收藏)

  • Awesome-CVPR2024-AIGC
  • [1.图像生成(Image Generation/Image Synthesis)](#1.图像生成(Image Generation/Image Synthesis))
      • [ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations](#ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations)
      • [InstanceDiffusion: Instance-level Control for Image Generation](#InstanceDiffusion: Instance-level Control for Image Generation)
      • [Instruct-Imagen: Image Generation with Multi-modal Instruction](#Instruct-Imagen: Image Generation with Multi-modal Instruction)
      • [MACE: Mass Concept Erasure in Diffusion Models](#MACE: Mass Concept Erasure in Diffusion Models)
      • [PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models](#PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models)
      • [Residual Denoising Diffusion Models](#Residual Denoising Diffusion Models)
  • [2.图像编辑(Image Editing)](#2.图像编辑(Image Editing))
      • [PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models](#PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models)
  • [3.视频生成(Video Generation/Image Synthesis)](#3.视频生成(Video Generation/Image Synthesis))
      • [Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners](#Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners)
  • [4.视频编辑(Video Editing)](#4.视频编辑(Video Editing))
  • [5.3D生成(3D Generation/3D Synthesis)](#5.3D生成(3D Generation/3D Synthesis))
      • [EscherNet: A Generative Model for Scalable View Synthesis](#EscherNet: A Generative Model for Scalable View Synthesis)
  • 6.其他多任务(Others)
      • [InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks](#InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks)
      • [Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models](#Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models)
  • 参考
  • 相关整理

Awesome-CVPR2024-AIGC

A Collection of Papers and Codes for CVPR2024 AIGC

整理汇总下今年CVPR AIGC相关的论文和代码,具体如下。

欢迎star,fork和PR~
优先在Github更新Awesome-CVPR2024-AIGC,欢迎star~
知乎https://zhuanlan.zhihu.com/p/684325134

参考或转载请注明出处

CVPR2024官网:https://cvpr.thecvf.com/Conferences/2024

CVPR完整论文列表:

开会时间:2024年6月17日-6月21日

论文接收公布时间:

【Contents】

  • [1.图像生成(Image Generation/Image Synthesis)](#1.图像生成(Image Generation/Image Synthesis))
  • [2.图像编辑(Image Editing)](#2.图像编辑(Image Editing))
  • [3.视频生成(Video Generation/Image Synthesis)](#3.视频生成(Video Generation/Image Synthesis))
  • [4.视频编辑(Video Editing)](#4.视频编辑(Video Editing))
  • [5.3D生成(3D Generation/3D Synthesis)](#5.3D生成(3D Generation/3D Synthesis))
  • 6.其他多任务(Others)

1.图像生成(Image Generation/Image Synthesis)

ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations

InstanceDiffusion: Instance-level Control for Image Generation

Instruct-Imagen: Image Generation with Multi-modal Instruction

MACE: Mass Concept Erasure in Diffusion Models

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models

Residual Denoising Diffusion Models

2.图像编辑(Image Editing)

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

3.视频生成(Video Generation/Image Synthesis)

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

4.视频编辑(Video Editing)

5.3D生成(3D Generation/3D Synthesis)

EscherNet: A Generative Model for Scalable View Synthesis

6.其他多任务(Others)

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

参考

CVPR 2024 论文和开源项目合集(Papers with Code)

相关整理

相关推荐
墨风如雪15 小时前
阿里Qwen3-VL双子星开源:图文视频混合检索的“降维打击”
aigc
格林威17 小时前
传送带上运动模糊图像复原:提升动态成像清晰度的 6 个核心方案,附 OpenCV+Halcon 实战代码!
人工智能·opencv·机器学习·计算机视觉·ai·halcon·工业相机
棒棒的皮皮18 小时前
【深度学习】YOLO模型速度优化Checklist
人工智能·深度学习·yolo·计算机视觉
EdisonZhou18 小时前
MAF快速入门(11)并行工作流
llm·aigc·agent·.net core
JQLvopkk19 小时前
智能AI“学习功能”在程序开发部分的逻辑
人工智能·机器学习·计算机视觉
职业码农NO.120 小时前
AI 技术栈完整解析,从 GPU 到应用的五层架构
人工智能·架构·系统架构·aigc·agent
TOPGUS20 小时前
黑帽GEO手法揭秘:AI搜索阴影下的新型搜索劫持与风险
人工智能·搜索引擎·chatgpt·aigc·谷歌·数字营销
狗狗学不会1 天前
视觉检测的新范式:从“像素感知”到“时序语义推理”—— 基于 Qwen3-VL 与时序拼图策略的通用事件检测系统
人工智能·计算机视觉·视觉检测
桂花饼1 天前
量化双雄争霸:九坤 IQuest-Coder-V1 的技术突破
人工智能·aigc·nano banana 2·openai兼容接口·claude opus 4.5·sora2 pro
undsky_1 天前
【n8n教程】:RSS Feed Trigger节点,玩转RSS订阅自动化
人工智能·ai·aigc·ai编程