CVPR2024|AIGC(图像生成,视频生成等)相关论文汇总(附论文链接/开源代码/解析)【持续更新】

CVPR2024|AIGC相关论文汇总(如果觉得有帮助,欢迎点赞和收藏)

  • Awesome-CVPR2024-AIGC
  • [1.图像生成(Image Generation/Image Synthesis)](#1.图像生成(Image Generation/Image Synthesis))
      • [ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations](#ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations)
      • [InstanceDiffusion: Instance-level Control for Image Generation](#InstanceDiffusion: Instance-level Control for Image Generation)
      • [Instruct-Imagen: Image Generation with Multi-modal Instruction](#Instruct-Imagen: Image Generation with Multi-modal Instruction)
      • [MACE: Mass Concept Erasure in Diffusion Models](#MACE: Mass Concept Erasure in Diffusion Models)
      • [PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models](#PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models)
      • [Residual Denoising Diffusion Models](#Residual Denoising Diffusion Models)
  • [2.图像编辑(Image Editing)](#2.图像编辑(Image Editing))
      • [PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models](#PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models)
  • [3.视频生成(Video Generation/Image Synthesis)](#3.视频生成(Video Generation/Image Synthesis))
      • [Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners](#Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners)
  • [4.视频编辑(Video Editing)](#4.视频编辑(Video Editing))
  • [5.3D生成(3D Generation/3D Synthesis)](#5.3D生成(3D Generation/3D Synthesis))
      • [EscherNet: A Generative Model for Scalable View Synthesis](#EscherNet: A Generative Model for Scalable View Synthesis)
  • 6.其他多任务(Others)
      • [InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks](#InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks)
      • [Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models](#Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models)
  • 参考
  • 相关整理

Awesome-CVPR2024-AIGC

A Collection of Papers and Codes for CVPR2024 AIGC

整理汇总下今年CVPR AIGC相关的论文和代码,具体如下。

欢迎star,fork和PR~
优先在Github更新Awesome-CVPR2024-AIGC,欢迎star~
知乎https://zhuanlan.zhihu.com/p/684325134

参考或转载请注明出处

CVPR2024官网:https://cvpr.thecvf.com/Conferences/2024

CVPR完整论文列表:

开会时间:2024年6月17日-6月21日

论文接收公布时间:

【Contents】

  • [1.图像生成(Image Generation/Image Synthesis)](#1.图像生成(Image Generation/Image Synthesis))
  • [2.图像编辑(Image Editing)](#2.图像编辑(Image Editing))
  • [3.视频生成(Video Generation/Image Synthesis)](#3.视频生成(Video Generation/Image Synthesis))
  • [4.视频编辑(Video Editing)](#4.视频编辑(Video Editing))
  • [5.3D生成(3D Generation/3D Synthesis)](#5.3D生成(3D Generation/3D Synthesis))
  • 6.其他多任务(Others)

1.图像生成(Image Generation/Image Synthesis)

ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations

InstanceDiffusion: Instance-level Control for Image Generation

Instruct-Imagen: Image Generation with Multi-modal Instruction

MACE: Mass Concept Erasure in Diffusion Models

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models

Residual Denoising Diffusion Models

2.图像编辑(Image Editing)

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models

3.视频生成(Video Generation/Image Synthesis)

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

4.视频编辑(Video Editing)

5.3D生成(3D Generation/3D Synthesis)

EscherNet: A Generative Model for Scalable View Synthesis

6.其他多任务(Others)

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

参考

CVPR 2024 论文和开源项目合集(Papers with Code)

相关整理

相关推荐
后端小肥肠7 小时前
OpenClaw实战|从识图到公众号内容自动化,我跑通了完整链路
人工智能·aigc·agent
SharpCJ8 小时前
OpenClaw 大结局——接入个人微信
ai·aigc·openclaw·养龙虾
K姐研究社13 小时前
阿里JVS Claw实测 – 手机一键部署 OpenClaw,开箱即用
人工智能·智能手机·aigc·飞书
量子位13 小时前
黄仁勋要发Token当工资!硅谷兴起刷量大赛,一人一周烧掉33个维基百科
aigc
CoovallyAIHub13 小时前
Pipecat:构建实时语音 AI Agent 的开源编排框架,500ms 级端到端延迟
深度学习·算法·计算机视觉
CoovallyAIHub13 小时前
Energies | 8版YOLO对8版Transformer实测光伏缺陷检测,RF-DETR-Small综合胜出
深度学习·算法·计算机视觉
幸福的猪在江湖13 小时前
🚀 Claude Code 入门完全指南(一):安装与首次体验
aigc·ai编程
Hommy8814 小时前
【剪映小助手-客户端】构建与部署
python·aigc·剪映小助手
树獭叔叔14 小时前
GRPO:比PPO更简单的RLHF算法
后端·aigc·openai
CoovallyAIHub15 小时前
2.5GB 塞进浏览器:Mistral 开源实时语音识别,延迟不到半秒
深度学习·算法·计算机视觉