1、3D方向
- Rapid 3D Model Generation with Intuitive 3D Input
- Instantaneous Perception of Moving Objects in 3D
- NEAT: Distilling 3D Wireframes from Neural Attraction Fields
⭐code - Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training
- LowRankOcc: Tensor Decomposition and Low-Rank Recovery for Vision-based 3D Semantic Occupancy Prediction
- TexOct: Generating Textures of 3D Models with Octree-based Diffusion
- Unsupervised 3D Structure Inference from Category-Specific Image Collections
- Garment Recovery with Shape and Deformation Priors
- ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
⭐code - CAGE: Controllable Articulation GEneration
⭐code
🏠project3D - Sparse views, Near light: A practical paradigm for uncalibrated point-light photometric stereo
- Dispersed Structured Light for Hyperspectral 3D Imaging
- G-FARS: Gradient-Field-based Auto-Regressive Sampling for 3D Part Grouping
⭐code - Wonder3D: Single Image to 3D using Cross-Domain Diffusion
🏠project - UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence
⭐code服装操作 - GoMVS: Geometrically Consistent Cost Aggregation for Multi-View Stereo
⭐code
⭐code - EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Priors
🏠project - MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation
🏠project - Digital Life Project: Autonomous 3D Characters with Social Intelligence
🏠project - Image Sculpting: Precise Object Editing with 3D Geometry Control
🏠project - TutteNet: Injective 3D Deformations by Composition of 2D Mesh Deformations
- Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception
- GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
⭐code
🏠project - SHAP-EDITOR: Instruction-Guided Latent 3D Editing in Seconds
- ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Images
- Differentiable Display Photometric Stereo
- ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion
⭐code
🏠project - Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps
- REACTO: Reconstructing Articulated Objects from a Single Video
⭐code - Low-Latency Neural Stereo Streaming
- Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes
- Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation
🏠project - Wired Perspectives: Multi-View Wire Art Embraces Generative AI
⭐code
🏠project - Memory-based Adapters for Online 3D Scene Perception
⭐code - FastMAC: Stochastic Spectral Sampling of Correspondence Graph
⭐code - One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion
⭐code
🏠project - PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
🏠project - CityDreamer: Compositional Generative Model of Unbounded 3D Cities
🏠project - EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
⭐code - Mosaic-SDF for 3D Generative Models
🏠project - Federated Online Adaptation for Deep Stereo
🏠project - ControlRoom3D: Room Generation using Semantic Proxy Rooms
- 三维视觉
- 三维重建
- 3D Neural Edge Reconstruction
- 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surfaces
🏠project
📺video - PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video
- NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images
- NTO3D: Neural Target Object 3D Reconstruction with Segment Anything
- pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction
- ReconFusion: 3D Reconstruction with Diffusion Priors
- VGGSfM: Visual Geometry Grounded Deep Structure From Motion
⭐code
🏠project - Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction
🏠project - GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction
- Coherence As Texture - Passive Textureless 3D Reconstruction by Self-interference
⭐code - Structure-Aware Sparse-View X-ray 3D Reconstruction
⭐code
👍如何给 NeRF 开透视眼? - Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers
🏠project - Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
⭐code
🏠project多视图重建 - PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar
🏠project - WonderJourney: Going from Anywhere to Everywhere
🏠project - Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments
⭐code
🏠project - DiffHuman: Probabilistic Photorealistic 3D Reconstruction of Humans
- IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images
⭐code - Splatter Image: Ultra-Fast Single-View 3D Reconstruction
⭐code
🏠project - PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar
⭐code
🏠project - MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections
⭐code - ZeroShape: Regression-based Zero-shot Shape Reconstruction
⭐code
🏠project - DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction
- G3DR: Generative 3D Reconstruction in ImageNet
⭐code
🏠project - 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface
⭐code - Bayesian Diffusion Models for 3D Shape Reconstruction
- RNb-NeuS: Reflectance and Normal-based Multi-View 3D Reconstruction
- ZeroRF: Fast Sparse View 360deg Reconstruction with Zero Pretraining
🏠project视图 360° 重建
- 表面重建
- SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration
- MVCPS-NeuS: Multi-view Constrained Photometric Stereo for Neural Surface Reconstruction
- MorpheuS: Neural Dynamic 360deg Surface Reconstruction from Monocular RGB-D Video
⭐code
🏠project - UFORecon: Generalizable Sparse-View Surface Reconstruction from Arbitrary and UnFavOrable Data Sets
⭐code
⭐code - UFORecon: Generalizable Sparse-View Surface Reconstruction from Arbitrary and Unfavorable Sets
⭐code
- 三维网格重建
- 三维形状
- GPLD3D: Latent Diffusion of 3D Shape Generative Models by Enforcing Geometric and Physical Priors
- TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding
⭐code - Doodle Your 3D: From Abstract Freehand Sketches to Precise 3D Shapes
🏠project - ShapeWalk: Compositional Shape Editing Through Language-Guided Chains
⭐code
🏠project - Spectral Meets Spatial: Harmonising 3D Shape Matching and Interpolation
- Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships
🏠project - FSC: Few-point Shape Completion
- 3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation
⭐code
🏠project3D 形状 - Category-Level Multi-Part Multi-Joint 3D Shape Assembly
- Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation
- Stereo Matching
- Selective-Stereo: Adaptive Frequency Information Selection for Stereo Matching
⭐code - LoS: Local Structure-Guided Stereo Matching
- Robust Synthetic-to-Real Transfer for Stereo Matching
- Adaptive Multi-Modal Cross-Entropy Loss for Stereo Matching
- Neural Markov Random Field for Stereo Matching
⭐code - Reusable Architecture Growth for Continual Stereo Matching
- MoCha-Stereo: Motif Channel Attention Network for Stereo Matching
⭐code
🏠project - Learning Intra-view and Cross-view Geometric Knowledge for Stereo Matching
- Selective-Stereo: Adaptive Frequency Information Selection for Stereo Matching
- 表面法线估计
- 特征匹配
- 三维检索
- 深度补全
- Flexible Depth Completion for Sparse and Varying Point Densities
- Improving Depth Completion via Depth Feature Upsampling
- Test-Time Adaptation for Depth Completion
- Bilateral Propagation Network for Depth Completion
- DeCoTR: Enhancing Depth Completion with 2D and 3D Attentions
- Tri-Perspective View Decomposition for Geometry-Aware Depth Completion
⭐code
- 深度估计
- Cross-spectral Gated-RGB Stereo Depth Estimation
- Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation
- Depth Prompting for Sensor-Agnostic Depth Estimation
- Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion
⭐code - On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation
🏠project - Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation
⭐code - PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
- Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
🏠project - Elite360D: Towards Efficient 360 Depth Estimation via Semantic- and Distance-Aware Bi-Projection Fusion
- ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
⭐code - From-Ground-To-Objects: Coarse-to-Fine Self-supervised Monocular Depth Estimation of Dynamic Objects with Ground Contact Prior
- UniDepth: Universal Monocular Metric Depth Estimation
⭐code - WorDepth: Variational Language Prior for Monocular Depth Estimation
- SPIDeRS: Structured Polarization for Invisible Depth and Reflectance Sensing
- Snapshot Lidar: Fourier Embedding of Amplitude and Phase for Single-Image Depth Reconstruction
- 全景定位
- 3D关键点检测
- 布局重建
- CAD 重建
- 形状匹配
- 3DGS
- COLMAP-Free 3D Gaussian Splatting
⭐code
🏠project - Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields
- GS-IR: 3D Gaussian Splatting for Inverse Rendering
- FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization
🏠project - Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering
🏠project - GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models
🏠project - Mip-Splatting: Alias-free 3D Gaussian Splatting
⭐code
🏠project - CoGS: Controllable Gaussian Splatting
⭐code
🏠project - LangSplat: 3D Language Gaussian Splatting
⭐code
🏠project - Compact 3D Gaussian Representation for Radiance Field
🏠project - 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos
🏠project - [HUGS: Human Gaussian Splatting]
- HUGS: Human Gaussian Splats
⭐code
🏠project - Multi-Scale 3D Gaussian Splatting for Anti-Aliased Rendering3DGS
- GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces
- COLMAP-Free 3D Gaussian Splatting
- 场景重建
- Gated Fields: Learning Scene Reconstruction from Gated Videos
- Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses
- SuperPrimitive: Scene Reconstruction at a Primitive Level
🏠project - Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction
⭐code - Behind the Veil: Enhanced Indoor 3D Scene Reconstruction with Occluded Surfaces Completion
- OmniSDF: Scene Reconstruction using Omnidirectional Signed Distance Functions and Adaptive Binoctrees
- VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction
⭐code - Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction
⭐code
🏠project
👍CVPR 2024满分论文:浙大提出基于可变形三维高斯的高质量单目动态重建新方法 - Polarization Wavefront Lidar: Learning Large Scene Reconstruction from Polarized Wavefronts
🏠project
- 3D 场景合成
- GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs
🏠project - DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation
⭐code - BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation
🏠project3D 场景生成 - Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion
- 文本驱动的 3D 场景生成
- GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs
- 3D 场景图
- 3D 场景编辑
- GaussianEditor:Editing 3D Gaussians Delicately with Text Instructions
🏠project - Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
👍文本或图像提示精准编辑3D场景,美图&信工所&北航&中大联合提出3D编辑方法CustomNeRF - PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI
⭐code - Neural 3D Strokes: Creating Stylized 3D Scenes with Vectorized 3D Strokes
🏠project3D 场景 - PAPR in Motion: Seamless Point-level 3D Scene Interpolation3D 场景插值
- ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing
- GaussianEditor:Editing 3D Gaussians Delicately with Text Instructions
- 语义匹配
- 室内照明估计
- 三维服装生成
- 3D 形状匹配