1、Image Progress(图像处理)
- 去鬼影
- 去阴影
- 去模糊
- Unsupervised Blind Image Deblurring Based on Self-Enhancement
- Latency Correction for Event-guided Deblurring and Frame Interpolation
- LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network
- ID-Blau: Image Deblurring by Implicit Diffusion-based reBLurring AUgmentation
- Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains
⭐code - AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image Deblurring
⭐code
⭐code - A Unified Framework for Microscopy Defocus Deblur with Multi-Pyramid Transformer and Contrastive Learning
⭐code
- 去雾
- 去噪
- Real-World Mobile Image Denoising Dataset with Efficient Baselines
- GenesisTex: Adapting Image Denoising Diffusion to Texture Space
- Robust Image Denoising through Adversarial Frequency Mixup
- Exploring Efficient Asymmetric Blind-Spots for Self-Supervised Denoising in Real-World Scenarios
- Masked and Shuffled Blind Spot Denoising for Real-World Images
- LAN: Learning to Adapt Noise for Image Denoising
- Unmixing Diffusion for Self-Supervised Hyperspectral Image Denoising
- Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation
- Transfer CLIP for Generalizable Image Denoising
- Residual Denoising Diffusion Models
⭐code - Equivariant plug-and-play image reconstruction
⭐code - Patch2Self2: Self-supervised Denoising on Coresets via Matrix Sketching
- Hyper-MD: Mesh Denoising with Customized Parameters Aware of Noise Intensity and Geometric Characteristics
- Zero-Shot Illumination-Guided Joint Denoising and Adaptive Enhancement for Low-Light Images
👍中文简介
- 去雨
- 去反射
- 修图
- 图像增强
- Color Shift Estimation-and-Correction for Image Enhancement
- FlowIE:Efficient Image Enhancement via Rectified Flow
- Fourier Priors-Guided Diffusion for Zero-Shot Joint Low-Light Enhancement and Deblurring
- Specularity Factorization for Low-Light Enhancement
- Zero-Reference Low-Light Enhancement via Physical Quadruple Priors
⭐code - Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach
⭐code - Empowering Resampling Operation for Ultra-High-Definition Image Enhancement with Model-Aware Guidance
- Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving
- 图像恢复
- Learning Diffusion Texture Priors for Image Restoration
- CoDe: An Explicit Content Decoupling Framework for Image Restoration
- Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image Restoration
- Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
- Look-Up Table Compression for Efficient Image Restoration
- HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models
- DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
- Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance
⭐code - Deep Equilibrium Diffusion Restoration with Parallel Sampling
⭐code - Distilling Semantic Priors from SAM to Efficient Image Restoration Models
- Boosting Image Restoration via Priors from Pre-trained Models
- Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration
- Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
⭐code - Restoration by Generation with Constrained Priors
🏠project - Multimodal Prompt Perceiver: Empower Adaptiveness Generalizability and Fidelity for All-in-One Image Restoration
- Improving Image Restoration through Removing Degradations in Textual Representations
⭐code
- 图像修复
- Brush2Prompt: Contextual Prompt Generator for Object Inpainting
- Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting
- NeRFiller: Completing Scenes via Generative 3D Inpainting
- MVIP-NeRF: Multi-view 3D Inpainting on NeRF Scenes via Diffusion Prior3D 修复
- Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting
⭐code
- 图像超级补全
- 图像质量
- Blind Image Quality Assessment Based on Geometric Order Learning
- Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization
- Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment
- TextCraftor: Your Text Encoder Can be Image Quality Controller
- Boosting Image Quality Assessment through Efficient Transformer Adaptation with Local Feature Enhancement
- 恶劣天气消除
- 大气湍流去除
- Image Portrait Relighting(图像重照光)
- 图片缩小
- 图像校正
- 图像着色
- 运动(去)模糊
- Motion Blur Decomposition with Cross-shutter Guidance
- Spike-guided Motion Deblurring with Unknown Modal Spatiotemporal Alignment
⭐code - Real-World Efficient Blind Motion Deblurring via Blur Pixel Discretization
- Efficient Multi-scale Network with Learnable Discrete Wavelet Transform for Blind Motion Deblurring
- Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring
⭐code
- 视频修复
- 视频去雾
- 视频去渲染
- 视频去模糊
- Frequency-aware Event-based Video Deblurring for Real-World Motion Blur
- Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring
⭐code
🏠project - FMA-Net: Flow Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring
⭐code
🏠project - DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video
🏠project
- 视频增强
- 视频质量评估
- PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild
- Learned Scanpaths Aid Blind Panoramic Video Quality Assessment
- Modular Blind Video Quality Assessment
- KVQ: Kwai Video Quality Assessment for Short-form Videos
- CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement
⭐code
- 夜间颜色恒定
- 照明估计
2、Image Segmentation(图像分割)
- Matching Anything by Segmenting Anything
⭐code - Unsupervised Universal Image Segmentation
- MESA: Matching Everything by Segmenting Anything
- MRFS: Mutually Reinforcing Image Fusion and Segmentation
- RobustSAM: Segment Anything Robustly on Degraded Images
- Hierarchical Histogram Threshold Segmentation - Auto-terminating High-detail Oversegmentation
- Multi-Space Alignments Towards Universal LiDAR Segmentation
- CoralSCOP: Segment any COral Image on this Planet分割
- SANeRF-HQ: Segment Anything for NeRF in High Quality
🏠project - ASAM: Boosting Segment Anything Model with Adversarial Tuning
- ODIN: A Single Model for 2D and 3D Segmentation
⭐code - FocSAM: Delving Deeply into Focused Objects in Segmenting Anything
👍摘要 - EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
- Universal Segmentation at Arbitrary Granularity with Language Instruction通用分割
- Segment and Caption Anything
🏠project - COCONut: Modernizing COCO Segmentation
⭐code - Multi-view Aggregation Network for Dichotomous Image Segmentation
⭐code - OMG-Seg: Is One Model Good Enough For All Segmentation?
🏠project - Unsegment Anything by Simulating Deformation
- BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model
⭐code - VRP-SAM: SAM with Visual Reference Prompt
- PEM: Prototype-based Efficient MaskFormer for Image Segmentation
- Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAM
⭐code - CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
🏠project - Benchmarking Segmentation Models with Mask-Preserved Attribute Editing
⭐code - CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers
- Continual Segmentation with Disentangled Objectness Learning and Class Recognition
⭐code - Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms
⭐code - Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation
⭐code
🏠project - Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model
- A Simple Recipe for Language-guided Domain Generalized Segmentation
🏠project - Rethinking Interactive Image Segmentation with Low Latency High Quality and Diverse Prompts
⭐code - Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation
⭐code
👍分割一切模型SAM泛化能力差?域适应策略给解决了 - 开放词汇分割
- Transferable and Principled Efficiency for Open-Vocabulary Segmentation
- USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation
- Open-Vocabulary Segmentation with Semantic-Assisted Calibration
⭐code - OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation
- 视频分割
- UniVS: Unified and Universal Video Segmentation with Prompts as Queries
⭐code - Turb-Seg-Res: A Segment-then-Restore Pipeline for Dynamic Videos with Atmospheric Turbulence
🏠project视频分割 - Learning to Segment Referred Objects from Narrated Egocentric Videos
- Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
⭐code
- UniVS: Unified and Universal Video Segmentation with Prompts as Queries
- 语义分割
- Open Set Domain Adaptation for Semantic Segmentation
- ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention
- MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation
- TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
- ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers
- HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation
- Contextrast: Contextual Contrastive Learning for Semantic Segmentation
- Open-Set Domain Adaptation for Semantic Segmentation
- SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation
⭐code - Frequency-Adaptive Dilated Convolution for Semantic Segmentation
⭐code - GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation
- Improving Bird's Eye View Semantic Segmentation by Task Decomposition
⭐code - UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
- Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
- Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball
- 3D 语义分割
- 点云语义分割
- 无监督语义分割
- 小样本语义分割
- 零样本语义分割
- 半监督语义分割
- Training Vision Transformers for Semi-Supervised Semantic Segmentation
- Density-Guided Semi-Supervised 3D Semantic Segmentation with Dual-Space Hardness Sampling
- AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation
⭐code - CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation
⭐code - Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation
⭐code - RankMatch: Exploring the Better Consistency Regularization for Semi-supervised Semantic Segmentation
- 弱监督语义分割
- Class Tokens Infusion for Weakly Supervised Semantic Segmentation
- Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation
- DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation
⭐code - Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation
⭐code - Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation
⭐code - PSDPM: Prototype-based Secondary Discriminative Pixels Mining for Weakly Supervised Semantic Segmentation
- From SAM to CAMs: Exploring Segment Anything Model for Weakly Supervised Semantic Segmentation
- 域泛化语义分割
- Collaborating Foundation Models for Domain Generalized Semantic Segmentation
⭐code - Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning
⭐code - Stronger Fewer & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
⭐code
- Collaborating Foundation Models for Domain Generalized Semantic Segmentation
- 文本监督语义分割
- 开放世界语义分割
- 开放词汇语义分割
- Open-Vocabulary 3D Semantic Segmentation with Foundation Models
- Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
- CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
- Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation
- SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
⭐code - Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
⭐code
- 全景分割
- Semantics Distortion and Style Matter: Towards Source-free UDA for Panoramic Segmentation
- ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning
⭐code - PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
⭐code - Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations
⭐code
- 实例分割
- Extreme Point Supervised Instance Segmentation
- Mudslide: A Universal Nuclear Instance Segmentation Method
- Semantic-aware SAM for Point-Prompted Instance Segmentation
- SAI3D: Segment Any Instance in 3D Scenes
🏠project - DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
- FISBe: A Real-World Benchmark Dataset for Instance Segmentation of Long-Range Thin Filamentous Structures
⭐code - Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Multi-Scale Aggregation and Anthropic Prior Knowledge
- 开放词汇实例分割
- 3D 实例分割
- BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation
⭐code - Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
- Edge-Aware 3D Instance Segmentation Network with Intelligent Semantic Prior
- UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes
- BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation
- 场景分割
- 动作分割
- Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos
- Coherent Temporal Synthesis for Incremental Action Segmentation
- Efficient and Effective Weakly-Supervised Action Segmentation via Action-Transition-Aware Boundary Alignment
- Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation
- FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Action Segmentation
⭐code
- 参考图像分割
- 指代表达式分割
- VOS
- Point-VOS: Pointing Up Video Object Segmentation
🏠project - Dual Prototype Attention for Unsupervised Video Object Segmentation
⭐code - Depth-aware Test-Time Training for Zero-shot Video Object Segmentation
⭐code - Putting the Object Back into Video Object Segmentation
🏠project - Event-assisted Low-Light Video Object Segmentation
- Guided Slot Attention for Unsupervised Video Object Segmentation
- LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
⭐code - RMem: Restricted Memory Banks Improve Video Object Segmentation
- Point-VOS: Pointing Up Video Object Segmentation
- VSS
- VIS
- 抠图
- 少样本分割
- Rethinking Prior Information Generation with CLIP for Few-Shot Segmentation
- Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation
⭐code - Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
- Adapt Before Comparison: A New Perspective on Cross-Domain Few-Shot Segmentation
- LLaFS: When Large Language Models Meet Few-Shot Segmentation
- Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining
⭐code - Addressing Background Context Bias in Few-Shot Segmentation through Iterative Modulation
- 零样本分割
- 裂纹分割
- 交互式分割
- 无模态分割
- 3D 分割