计算机视觉开源代码汇总

1.【基础网络架构】Regularization of polynomial networks for image recognition

论文地址:https://arxiv.org/pdf/2303.13896.pdf

开源代码:https://github.com/grigorisg9gr/regularized_polynomials

2.【目标检测:域自适应】2PCNet: Two-Phase Consistency Training for Day-to-Night Unsupervised Domain Adaptive Object Detection

论文地址: https://arxiv.org/pdf/2303.13853.pdf

开源代码:https://github.com/mecarill/2pcnet

4.【目标跟踪:数据集】ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data

论文地址:**[https://](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13885.pdf)**

* 工程主页:https://arkittrack.github.io/

* 开源代码(即将开源):[GitHub - lawrence-cj/ARKitTrack: PyTorch implementation of ARKitTrack for CVPR'2023 paper "ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data", by Haojie Zhao, Junsong Chen, Lijun Wang, Huchuan Lu. Code will be released here.](https://github.com/lawrence-cj/ARKitTrack)

5.【异常检测】Anomaly Detection under Distribution Shift

* 论文地址:**[https://arxiv.org/pdf/2303.13845.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13845.pdf)**

* 开源代码(即将开源):**[https://github.com/mala-lab/ADS\](https://link.zhihu.com/?target=https%3A//github.com/mala-lab/ADShift)**

7.【视觉3D目标检测】MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth Estimation

* 论文地址:**[https://arxiv.org/pdf/2303.13561.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13561.pdf)**

* 代码即将开源

8.【3D目标检测】Collaboration Helps Camera Overtake LiDAR in 3D Detection

* 论文地址:**[https://arxiv.org/pdf/2303.13560.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13560.pdf)**

* 开源代码(即将开源):**[https://github.com/MediaBrain-S\](https://link.zhihu.com/?target=https%3A//github.com/MediaBrain-SJTU/CoCa3D)**

正在上传...重新上传取消

9.【医学图像分割:半监督】Inherent Consistent Learning for Accurate Semi-supervised Medical Image Segmentation

* 论文地址:**[https://arxiv.org/pdf/2303.14175.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.14175.pdf)**

* 开源代码:**[https://github.com/zhuye98/ICL.\](https://link.zhihu.com/?target=https%3A//github.com/zhuye98/ICL.git)**

正在上传...重新上传取消

10.【医学图像分割:Few Shot】Few Shot Medical Image Segmentation with Cross Attention Transformer

* 论文地址:**[https://arxiv.org/pdf/2303.13867.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13867.pdf)**

* 代码即将开源

11.【三维重建】BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects

* 论文地址:**[https://arxiv.org/pdf/2303.14158.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.14158.pdf)**

* 工程主页:**[https://bundlesdf.github.io/\](https://link.zhihu.com/?target=https%3A//bundlesdf.github.io/)**

* 代码即将开源

12.【人脸重建】NeuFace: Realistic 3D Neural Face Rendering from Multi-view Images

* 论文地址:**[https://arxiv.org/pdf/2303.14092.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.14092.pdf)**

* 开源代码:**[https://github.com/aejion/NeuFace\](https://link.zhihu.com/?target=https%3A//github.com/aejion/NeuFace)**

13.【类别增量学习】Class-Incremental Exemplar Compression for Class-Incremental Learning

* 论文地址:**[https://arxiv.org/pdf/2303.14042.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.14042.pdf)**

* 开源代码(即将开源):**[https://github.com/xfflzl/CIM-C\](https://link.zhihu.com/?target=https%3A//github.com/xfflzl/CIM-CIL)**

14.【自动驾驶:场景补全】StereoScene: BEV-Assisted Stereo Matching Empowers 3D Semantic Scene Completion

* 论文地址:**[https://arxiv.org/pdf/2303.13959.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13959.pdf)**

* 开源代码:**[https://github.com/Arlo0o/Stere\](https://link.zhihu.com/?target=https%3A//github.com/Arlo0o/StereoScene)**

![image.png](https://upload-images.jianshu.io/upload_images/18639252-7055375cffd8f3ab.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

![image.png](https://upload-images.jianshu.io/upload_images/18639252-305205771b4acacb.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

15.【类别增量学习】Two-level Graph Network for Few-Shot Class-Incremental Learning

* 论文地址:**[https://arxiv.org/pdf/2303.13862.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13862.pdf)**

* 开源代码(即将开源):**[https://github.com/sukechenhao/\](https://link.zhihu.com/?target=https%3A//github.com/sukechenhao/SCGN)**

![image.png](https://upload-images.jianshu.io/upload_images/18639252-32d38fe1cc32b4db.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

16.【神经网络量化】Hard Sample Matters a Lot in Zero-Shot Quantization

* 论文地址:**[https://arxiv.org/pdf/2303.13826.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13826.pdf)**

* 开源代码:**[https://github.com/lihuantong/HAST\](https://link.zhihu.com/?target=https%3A//github.com/lihuantong/HAST)**

![image.png](https://upload-images.jianshu.io/upload_images/18639252-f6af27639c82f5df.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

20.【姿态估计】NOPE: Novel Object Pose Estimation from a Single Image

* 论文地址:**[https://arxiv.org/pdf/2303.13612.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13612.pdf)**

* 开源代码(即将开源):**[https://github.com/nv-nguyen/nope\](https://link.zhihu.com/?target=https%3A//github.com/nv-nguyen/nope)**

![image.png](https://upload-images.jianshu.io/upload_images/18639252-1ecc94a9182edd54.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

21.【医学图像】Leveraging Old Knowledge to Continually Learn New Classes in Medical Images

* 论文地址:**[https://arxiv.org/pdf/2303.13752.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13752.pdf)**

* 开源代码:**[https://github.com/EvelynChee/LO2LN\](https://link.zhihu.com/?target=https%3A//github.com/EvelynChee/LO2LN)**

![image.png](https://upload-images.jianshu.io/upload_images/18639252-b2998e6205619f2e.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

22.【点云异常检测】Complementary Pseudo Multimodal Feature for Point Cloud Anomaly Detection

* 论文地址:**[https://arxiv.org/ftp/arxiv/papers/2303/2303.13194.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/ftp/arxiv/papers/2303/2303.13194.pdf)**

* 开源代码(即将开源):**[https://github.com/caoyunkang/CPMF\](https://link.zhihu.com/?target=https%3A//github.com/caoyunkang/CPMF)**

![image.png](https://upload-images.jianshu.io/upload_images/18639252-f3cdcf9ced296900.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

18.【点云分割】Position-Guided Point Cloud Panoptic Segmentation Transformer

* 论文地址:**[https://arxiv.org/pdf/2303.13509.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13509.pdf)**

* 开源代码(即将开源):**[https://github.com/SmartBot-PJLab/P3Former\](https://link.zhihu.com/?target=https%3A//github.com/SmartBot-PJLab/P3Former)**

![image.png](https://upload-images.jianshu.io/upload_images/18639252-679138cbf3b9874a.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

17.【点云3D目标检测:自监督预训练】MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training

* 论文地址:**[https://arxiv.org/pdf/2303.13510.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13510.pdf)**

* 开源代码(即将开源):**[https://github.com/SmartBot-PJL\](https://link.zhihu.com/?target=https%3A//github.com/SmartBot-PJLab/MV-JAR)**

![image.png](https://upload-images.jianshu.io/upload_images/18639252-2ca7ba07786c9809.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

15.【点云:自监督学习】PointGame: Geometrically and Adaptively Masked Auto-Encoder on Point Clouds

* 论文地址:**[https://arxiv.org/pdf/2303.1310\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13100.pdf)**

![image.png](https://upload-images.jianshu.io/upload_images/18639252-9af4031dcef1d9ca.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

7.【异常检测:医学图像】Confidence-Aware and Self-Supervised Image Anomaly Localisation

* 论文地址:**[https://arxiv.org/pdf/2303.13227.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13227.pdf)**

* 代码即将开源

![image.png](https://upload-images.jianshu.io/upload_images/18639252-c07081abdd2ea080.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

2.【动作识别】A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition

* 论文地址:**[https://arxiv.org/pdf/2303.13505.pdf\](https://link.zhihu.com/?target=https%3A//arxiv.org/pdf/2303.13505.pdf)**

* 开源代码:**[https://github.com/AndongDeng/B\](https://link.zhihu.com/?target=https%3A//github.com/AndongDeng/BEAR)**

![image.png](https://upload-images.jianshu.io/upload_images/18639252-b82c91d967467220.png?imageMogr2/auto-orient/strip|imageView2/2/w/1240)

相关推荐
weiwuxian1 分钟前
揭开智能体的神秘面纱:原来你不是"超级AI"!
人工智能
Codebee2 分钟前
“自举开发“范式:OneCode如何用低代码重构自身工具链
java·人工智能·架构
说私域13 分钟前
基于开源AI智能名片链动2+1模式的S2B2C商城小程序:门店私域流量与视频号直播融合的生态创新研究
人工智能·小程序·开源
Ronin-Lotus16 分钟前
深度学习篇---Yolov系列
人工智能·深度学习
爱学习的茄子17 分钟前
AI驱动的单词学习应用:从图片识别到语音合成的完整实现
前端·深度学习·react.js
静心问道44 分钟前
GoT:超越思维链:语言模型中的有效思维图推理
人工智能·计算机视觉·语言模型
aneasystone本尊1 小时前
学习 Claude Code 的工具使用(三)
人工智能
szxinmai主板定制专家1 小时前
【精密测量】基于ARM+FPGA的多路光栅信号采集方案
服务器·arm开发·人工智能·嵌入式硬件·fpga开发
T__TIII1 小时前
Dify 自定义插件
人工智能·github
快起来别睡了2 小时前
LangChain 介绍及使用指南:从“会聊天”到“能干活”的 AI 应用开发工具
人工智能