1 Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. Imagenet classification with deep convolutional neural networks . In Advances in neural information processing systems, pages 1097--1105, 2012. 1
2 Jia Li, Yafei Song, Jianfeng Zhu, Lele Cheng, Ying Su, Lin Ye, Pengcheng Yuan, and Shumin Han. Learning from large scale noisy web data with ubiquitous reweighting for image classification . IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019. 1
3 Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. Faster r-cnn: Towards real-time object detection with region proposal networks . In Advances in neural information processing systems, pages 91--99, 2015. 1
4 Ramprasaath R Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. Grad-cam: Visual explanations from deep networks via gradient-based localization . In Proceedings of the IEEE international conference on computer vision, pages 618--626, 2017. 1
5 Tianzhu Zhang, Changsheng Xu, and Ming-Hsuan Yang. Multi-task correlation particle filter for robust object tracking . In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4335--4343, 2017. 1
6 Karen Simonyan and Andrew Zisserman. Two-stream convolutional networks for action recognition in videos . In Advances in neural information processing systems, pages 568--576, 2014. 1
7 Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L Yuille. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs . IEEE transactions on pattern analysis and machine intelligence, 40(4):834--848, 2017. 1
8 Liang-Chieh Chen, George Papandreou, Florian Schroff, and Hartwig Adam. Rethinking atrous convolution for semantic image segmentation . arXiv preprint arXiv:1706.05587, \2017. 1, 4, 5
9 Ali Borji, Ming-Ming Cheng, Qibin Hou, Huaizu Jiang, and Jia Li. Salient object detection: A survey . Computational visual media, pages 1--34, 2019. 1
10 Yun Liu, Ming-Ming Cheng, Xiaowei Hu, Kai Wang, and Xiang Bai. Richer convolutional features for edge detection . In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3000--3009, 2017. 1
11 Barret Zoph and Quoc V Le. Neural architecture search with reinforcement learning . arXiv preprint arXiv:1611.01578, \2016. 2
12 Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition . arXiv preprint arXiv:1409.1556, 2014. 2
13 Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. Going deeper with convolutions . In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1--9, 2015. 2
14 Andrew G Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco An dreetto, and Hartwig Adam. Mobilenets: Efficient convolutional neural networks for mobile vision applications . arXiv preprint arXiv:1704.04861, 2017. 2, 3, 4
15 Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. Mobilenetv2: Inverted residuals and linear bottlenecks . In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4510--4520, 2018. 2, 4
16 Xiangyu Zhang, Xinyu Zhou, Mengxiao Lin, and Jian Sun. Shufflenet: An extremely efficient convolutional neural network for mobile devices . In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 6848--6856, 2018. 2
17 Ningning Ma, Xiangyu Zhang, Hai-Tao Zheng, and Jian Sun. Shufflenet v2: Practical guidelines for efficient cnn architecture design . In Proceedings of the European conference on computer vision (ECCV), pages 116--131, 2018. 2, 4
18 Kai Han, Yunhe Wang, Qi Tian, Jianyuan Guo, Chunjing Xu, and Chang Xu. Ghostnet: More features from cheap operations . In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1580--1589, 2020. 2
19 Mingxing Tan and Quoc V Le. Efficientnet: Rethinking model scaling for convolutional neural networks . arXiv preprint arXiv:1905.11946, 2019. 2, 3
20 Andrew Howard, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang, Yukun Zhu, Ruoming Pang, Vijay Vasudevan, et al. Searching for mobilenetv3 . In Proceedings of the IEEE International Conference on Computer Vision, pages 1314--1324, 2019. 2, 3, 4,5
21 Bichen Wu, Xiaoliang Dai, Peizhao Zhang, Yanghan Wang, Fei Sun, Yiming Wu, Yuandong Tian, Peter Vajda, Yangqing Jia, and Kurt Keutzer. Fbnet: Hardware-aware efficient convnet design via differentiable neural architecture search . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 10734--10742, 2019. 2
22 Changlin Li, Jiefeng Peng, Liuchun Yuan, Guangrun Wang, Xiaodan Liang, Liang Lin, and Xiaojun Chang. Blockwisely supervised neural architecture search with knowledge distillation . In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1989--1998, 2020. 2
23 Han Cai, Chuang Gan, Tianzhe Wang, Zhekai Zhang, and Song Han. Once-for-all: Train one network and specialize it for efficient deployment . arXiv preprint arXiv:1908.09791, \2019. 2
24 Mingxing Tan and Quoc V Le. Mixconv: Mixed depthwise convolutional kernels . arXiv preprint arXiv:1907.09595, \2019. 2, 3
25 Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition . In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770--778, 2016. 2
26 Jie Hu, Li Shen, and Gang Sun. Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 7132--7141, 2018. 3,5, 6
27 Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248--255. Ieee, 2009. 3, 4
28 Cheng Cui, Ruoyu Guo, Yuning Du, Dongliang He, Fu Li, Zewu Wu, Qiwen Liu, Shilei Wen, Jizhou Huang, Xiaoguang Hu, et al. Beyond self-supervision: A simple yet effective network distillation alternative to improve backbones . arXiv preprint arXiv:2103.05959, 2021. 4, 5
29 PaddlePaddle Authors. Paddledetection, object detection and instance segmentation toolkit based on paddlepaddle . github.com/PaddlePaddl..., 2019. 4
30 Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Doll´ar, and C Lawrence Zitnick. Microsoft coco: Common objects in context . In European conference on computer vision, pages 740--755. Springer, 2014. 5
31 Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, and Bernt Schiele. The cityscapes dataset for semantic urban scene understanding . In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3213--3223, 2016. 5
实践
任务:分类图片中是否有人还是无人,先git clone paddleclas项目,然后进入项目;
环境安装
安装paddlepaddle
shell复制代码
# CPU only
python3 -m pip install paddlepaddle==2.5.2 -i https://pypi.tuna.tsinghua.edu.cn/simple
# CUDA 10.2
python3 -m pip install paddlepaddle-gpu==2.5.2 -i https://pypi.tuna.tsinghua.edu.cn/simple
# CUDA 11.2
python3 -m pip install paddlepaddle-gpu==2.5.2.post112 -f https://www.paddlepaddle.org.cn/whl/linux/mkl/avx/stable.html
# CUDA 11.6
python3 -m pip install paddlepaddle-gpu==2.5.2.post116 -f https://www.paddlepaddle.org.cn/whl/linux/mkl/avx/stable.html
# CUDA 11.7
python3 -m pip install paddlepaddle-gpu==2.5.2.post117 -f https://www.paddlepaddle.org.cn/whl/linux/mkl/avx/stable.html
# CUDA 12.0
python3 -m pip install paddlepaddle-gpu==2.5.2.post120 -f https://www.paddlepaddle.org.cn/whl/linux/mkl/avx/stable.html