8.16模型整理

文章目录

  • [Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation(ECCV2018)](#Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation(ECCV2018))
  • [Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning(2016)](#Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning(2016))
  • [Wide Residual Networks(2017)](#Wide Residual Networks(2017))
  • [mixup: Beyond Empirical Risk Minimization(ICLR2018)](#mixup: Beyond Empirical Risk Minimization(ICLR2018))
  • [Swin Transformer: Hierarchical Vision Transformer using Shifted Windows](#Swin Transformer: Hierarchical Vision Transformer using Shifted Windows)
  • [Pyramid Scene Parsing Network(2017)](#Pyramid Scene Parsing Network(2017))
  • [Searching for MobileNetV3(2019)](#Searching for MobileNetV3(2019))
  • [SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size(2016)](#SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size(2016))
  • [Identity Mappings in Deep Residual Networks(2016)](#Identity Mappings in Deep Residual Networks(2016))
  • [Aggregated Residual Transformations for Deep Neural Networks](#Aggregated Residual Transformations for Deep Neural Networks)
  • [MLP-Mixer: An all-MLP Architecture for Vision(2021)](#MLP-Mixer: An all-MLP Architecture for Vision(2021))
  • [MOCO:Momentum Contrast for Unsupervised Visual Representation Learning](#MOCO:Momentum Contrast for Unsupervised Visual Representation Learning)
  • [A ConvNet for the 2020s](#A ConvNet for the 2020s)
  • [MAE:Masked Autoencoders Are Scalable Vision Learners](#MAE:Masked Autoencoders Are Scalable Vision Learners)
  • [Xception: Deep Learning with Depthwise Separable Convolutions](#Xception: Deep Learning with Depthwise Separable Convolutions)
  • [CLIP:Learning Transferable Visual Models From Natural Language Supervision](#CLIP:Learning Transferable Visual Models From Natural Language Supervision)
  • [ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices](#ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices)
  • [ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design](#ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design)
  • [ResNeSt: Split-Attention Networks](#ResNeSt: Split-Attention Networks)

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation(ECCV2018)

方法

代码地址

DeepLabV3+结构

Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning(2016)

方法



















Wide Residual Networks(2017)

方法

代码地址

我感觉是没啥变化

mixup: Beyond Empirical Risk Minimization(ICLR2018)

方法

主要看代码里面得lam和alpha

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

方法



Vit的滑动窗口版本

Pyramid Scene Parsing Network(2017)


Searching for MobileNetV3(2019)

方法

这是一篇关于网络架构搜索的文章

SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size(2016)

方法

Identity Mappings in Deep Residual Networks(2016)

方法

讲了各种各样的跳跃连接分析




Aggregated Residual Transformations for Deep Neural Networks

方法



相当于就是参数减少

MLP-Mixer: An all-MLP Architecture for Vision(2021)

token混合和channel混合

MOCO:Momentum Contrast for Unsupervised Visual Representation Learning

采用不同存储结构,moco采用的是队列

A ConvNet for the 2020s

做到极致的卷积

MAE:Masked Autoencoders Are Scalable Vision Learners

类似于bert,预测mask部分,自监督学习

Xception: Deep Learning with Depthwise Separable Convolutions

方法


CLIP:Learning Transferable Visual Models From Natural Language Supervision

方法

ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

方法

分组卷积并混合

ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design

方法

ResNeSt: Split-Attention Networks

本文方法



相关推荐
audyxiao0016 分钟前
AI一周重要会议和活动概览
人工智能·计算机视觉·数据挖掘·多模态
Jeremy_lf24 分钟前
【生成模型之三】ControlNet & Latent Diffusion Models论文详解
人工智能·深度学习·stable diffusion·aigc·扩散模型
桃花键神1 小时前
AI可信论坛亮点:合合信息分享视觉内容安全技术前沿
人工智能
野蛮的大西瓜1 小时前
开源呼叫中心中,如何将ASR与IVR菜单结合,实现动态的IVR交互
人工智能·机器人·自动化·音视频·信息与通信
CountingStars6192 小时前
目标检测常用评估指标(metrics)
人工智能·目标检测·目标跟踪
tangjunjun-owen2 小时前
第四节:GLM-4v-9b模型的tokenizer源码解读
人工智能·glm-4v-9b·多模态大模型教程
冰蓝蓝2 小时前
深度学习中的注意力机制:解锁智能模型的新视角
人工智能·深度学习
橙子小哥的代码世界2 小时前
【计算机视觉基础CV-图像分类】01- 从历史源头到深度时代:一文读懂计算机视觉的进化脉络、核心任务与产业蓝图
人工智能·计算机视觉
新加坡内哥谈技术3 小时前
苏黎世联邦理工学院与加州大学伯克利分校推出MaxInfoRL:平衡内在与外在探索的全新强化学习框架
大数据·人工智能·语言模型
fanstuck3 小时前
Prompt提示工程上手指南(七)Prompt编写实战-基于智能客服问答系统下的Prompt编写
人工智能·数据挖掘·openai