【实例分割｜Detectron2】在主干网络 ResNet50 中添加 SE 注意力模块

2023-05-16

文章目录

在 Detectron2 中使用 SE 模块
在 Mask2Former 中使用 SE 模块

在 Detectron2 中使用 SE 模块

直接在 detectron2/detectron2/modeling/backbone/resnet.py 中添加 SELayer ，若想自己新建一个文件，重新写模型类，这样 fpn.py 等文件中的 build_resnet_backbone 等函数均要修改，担心自己漏了，所以直接在原文件上添加了。

其中 SENet-Pytorch 的代码参考 https://github.com/moskomule/senet.pytorch/blob/master/senet。

__all__ = [
    "SELayer",
    "ResNetBlockBase",
    "BasicBlock",
    "BottleneckBlock",
    "DeformBottleneckBlock",
    "BasicStem",
    "ResNet",
    "make_stage",
    "build_resnet_backbone",
]

class SELayer(nn.Module):
    def __init__(self, channel, reduction=16):
        super(SELayer, self).__init__()
        self.avg_pool = nn.AdaptiveAvgPool2d(1)
        self.fc = nn.Sequential(
            nn.Linear(channel, channel // reduction, bias=False),
            nn.ReLU(inplace=True),
            nn.Linear(channel // reduction, channel, bias=False),
            nn.Sigmoid()
        )

    def forward(self, x):
        b, c, _, _ = x.size()
        y = self.avg_pool(x).view(b, c)
        print(f"shape: {y.shape}")
        y = self.fc(y).view(b, c, 1, 1)
        return x * y.expand_as(x)

class BottleneckBlock(CNNBlockBase):
    """
    The standard bottleneck residual block used by ResNet-50, 101 and 152
    defined in :paper:`ResNet`.  It contains 3 conv layers with kernels
    1x1, 3x3, 1x1, and a projection shortcut if needed.
    """

    def __init__(
        self,
        in_channels,
        out_channels,
        *,
        bottleneck_channels,
        stride=1,
        num_groups=1,
        norm="BN",
        stride_in_1x1=False,
        dilation=1,
    ):
        """
        Args:
            bottleneck_channels (int): number of output channels for the 3x3
                "bottleneck" conv layers.
            num_groups (int): number of groups for the 3x3 conv layer.
            norm (str or callable): normalization for all conv layers.
                See :func:`layers.get_norm` for supported format.
            stride_in_1x1 (bool): when stride>1, whether to put stride in the
                first 1x1 convolution or the bottleneck 3x3 convolution.
            dilation (int): the dilation rate of the 3x3 conv layer.
        """
        super().__init__(in_channels, out_channels, stride)

        if in_channels != out_channels:
            self.shortcut = Conv2d(
                in_channels,
                out_channels,
                kernel_size=1,
                stride=stride,
                bias=False,
                norm=get_norm(norm, out_channels),
            )
        else:
            self.shortcut = None

        # The original MSRA ResNet models have stride in the first 1x1 conv
        # The subsequent fb.torch.resnet and Caffe2 ResNe[X]t implementations have
        # stride in the 3x3 conv
        stride_1x1, stride_3x3 = (stride, 1) if stride_in_1x1 else (1, stride)

        self.conv1 = Conv2d(
            in_channels,
            bottleneck_channels,
            kernel_size=1,
            stride=stride_1x1,
            bias=False,
            norm=get_norm(norm, bottleneck_channels),
        )

        self.conv2 = Conv2d(
            bottleneck_channels,
            bottleneck_channels,
            kernel_size=3,
            stride=stride_3x3,
            padding=1 * dilation,
            bias=False,
            groups=num_groups,
            dilation=dilation,
            norm=get_norm(norm, bottleneck_channels),
        )

        self.conv3 = Conv2d(
            bottleneck_channels,
            out_channels,
            kernel_size=1,
            bias=False,
            norm=get_norm(norm, out_channels),
        )

        for layer in [self.conv1, self.conv2, self.conv3, self.shortcut]:
            if layer is not None:  # shortcut can be None
                weight_init.c2_msra_fill(layer)
        
        self.se = SELayer(bottleneck_channels * 4, 16)

        # Zero-initialize the last normalization in each residual branch,
        # so that at the beginning, the residual branch starts with zeros,
        # and each residual block behaves like an identity.
        # See Sec 5.1 in "Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour":
        # "For BN layers, the learnable scaling coefficient γ is initialized
        # to be 1, except for each residual block's last BN
        # where γ is initialized to be 0."

        # nn.init.constant_(self.conv3.norm.weight, 0)
        # TODO this somehow hurts performance when training GN models from scratch.
        # Add it as an option when we need to use this code to train a backbone.

    def forward(self, x):
        out = self.conv1(x)
        out = F.relu_(out)

        out = self.conv2(out)
        out = F.relu_(out)

        out = self.conv3(out)
        out = self.se(out) # 添加了 SELayer

        if self.shortcut is not None:
            shortcut = self.shortcut(x)
        else:
            shortcut = x

        out += shortcut
        out = F.relu_(out)
        return out

在 Mask2Former 中使用 SE 模块

Mask2Former 调用的是 Detectron2 中 ResNet ，所以在 Detectron2 中添加 SE 模块后，在 Mask2Former 中也会有改动。要注意的是当本地存在多个 Detectron2 项目文件夹时，需要在虚拟环境安装过的 Detectron2 中进行修改。

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

【实例分割｜Detectron2】在主干网络 ResNet50 中添加 SE 注意力模块的相关文章

数据库三种故障恢复处理

三种故障类型如下 xff1a 故障类型事务故障系统故障介质故障 1 事务故障恢复由系统自动完成 xff0c 对用户是透明的 DBMS执行恢复操作的步骤如下 xff1a 反向扫描日志文件即从最后向前扫描日志文件 xff0c 查找该事务的更
各种语言字符串一定以‘\0‘结尾吗？

最近学校刚开python课 xff0c 以前大一的时候也简单接触过 xff0c 并未深入了解过 xff0c 也没有去总结因为我是计算机专业的 xff0c 学了c c 43 43 java xff0c 这次又新开了python xff0c
深度学习中的激活函数总结

激活函数饱和问题一个激活函数 h n h n h n xff0c 当n趋近于正无穷 xff0c 激活函数的导数趋近于0 xff0c 称之为右饱和 xff1b 当n趋近于负无穷 xff0c 激活函
2021年第十二届蓝桥杯省赛B组（C/C++）部分填空个人题解

1 试题 A xff1a 空间小蓝准备用256MB的内存空间开一个数组 xff0c 数组的每个元素都是32位二进制整数 xff0c 如果不考虑程序占用的空间和维护内存需要的辅助空间 xff0c 请问256MB的空间可以存储多少个32位二进
PyCharm中 “import torch“出现问题时的解决方法

1 安装中遇到的坑直接在PyCharm中点击file gt setting gt project gt project interpreter 结果 xff1a 安装失败在命令窗口输入pip install torch 结果 xff1a
python数据分析可视化大作业——对地铁数据的简单数据分析

一选题意义随着我国经济的快速发展 xff0c 我们国家的地铁事业正在快速发展 xff0c 很多城市都拥有了地铁自1969年北京开通第一条地铁线路建成通车 xff0c 到2021年全国总线路总长达7253 73公里 xff0c 我们只用
如何使用 Apache IoTDB 分布式系统监控模块

从 Apache IoTDB 0 13 0 版本开始 xff0c 我们引入了系统监控模块 xff0c 可以完成对 Apache IoTDB 的重要运行指标进行监控 xff0c 本文介绍了如何在 Apache IoTDB 分布式开启系统监控模
rosdep update 的解决

我是按照 ububtu20 04安装ROS zhangsxa的博客 CSDN博客这个安装的ros 但是初始化的时候遇到了各种各样的报错 xff0c 反正基本上是一直在百度 xff0c 一直在尝试各种方法最后的解决过程 xff08 其中也
ubuntu虚拟机roslaunch usb_cam usb_cam-test.launch报错

运行roslaunch usb cam usb cam test launch后出现警告Unable to open camera calibration file home hri ros camera info head camera
github下载老版本的项目

首先github是一种远程git仓库一般大型项目 xff08 如tensorflow xff09 每发行一个版本就是一个分支branch 而一般主分支master是最新发布的版本因为程序的兼容问题 xff0c 很多时候需要我们安装旧版本
FreeRTOS入门学

任务要求 xff1a 在STM32下完成一个基于FreeRTOS的多任务程序 xff0c 执行3个周期性task 目录一介绍FreeRTOS二 FreeRTOS的多任务程序实现一介绍FreeRTOS 1 简介 xff1a xff08
如何更改excel直线拟合有效数字的位数

如何更改excel直线拟合有效数字的位数在公式上单击鼠标右键 xff0c 选择设置趋势线标签格式在数据类别中选择数字 xff0c 小数位自行确定
【已解决】WARNING: Ignoring invalid distribution xxx

问题解决方案解释问题 WARNING Ignoring invalid distribution umpy c users xxx appdata roaming python python36 site packages 解决方案在报
transformer

简介 transformer最早于2017年google机器翻译团队提出 xff0c 也就是著名的 Attention Is All You Need xff0c transformer完全取代了以往的RNN和CNN结构 xff0c 改为由
单精度float与双精度double

单精度双精度 xff1a 单精度 xff0c 也即float xff0c 一般在计算机中存储占用4字节 xff0c 也32位 xff0c 有效位数为7位 xff1b 双精度 xff08 double xff09 在计算机中存储占用8字节
【完美解决】Github action报错remote: Write access to repository not granted.

报错及效果图报错代码效果图解决方案必要步骤可能有效的步骤报错及效果图本解决方案是笔者通过Github action运行项目时报错的解决方案 xff0c 如果是本地运行报此错 xff0c 未必有效果报错代码 remote Write
【已解决】error: failed to push some refs to ‘git@github.com:BATdalao/Github-green.git‘

文章目录报错及效果图报错代码最终效果图解决方案报错及效果图报错代码 git push To github com xxx xxx git rejected main gt main fetch first error failed
【已解决】winmm.dll被报病毒的解决方案

安装typora时的winmm dll被报病毒 xff0c 关闭防火墙可以 xff0c 但是重启电脑会再次报病毒在windows安全中心的如图路径 xff0c 找到该报错并设置允许即可安全性问题 xff1a winmm dll作者曾发文
【已解决】UnicodeDecodeError: ‘gbk‘ codec can‘t decode byte 0xad in position 10: illegalmultibytesequence

报错代码 xff1a f span class token operator 61 span span class token builtin open span span class token punctuation span span
【已解决】kex_exchange_identification: Connection closed by remote host fatal: Could not read from

文章目录报错及效果图报错代码成功效果图解决方案必要的解决方法可能有用的解决方法报错及效果图报错代码 kex exchange identification Connection closed by remote span class

随机推荐

【已解决】VMware Player 无法与 VMware Workstation 一起安装。请先卸载 VMware Workstation，再尝试安装VMware Player

文章目录报错本解决方案适用情境解决方案必要的解决方法可能有用的解决方法报错 VMware Player 无法与 VMware Workstation 一起安装请先卸载 VMware Workstation xff0c 再尝试安装VM
基于蓝牙智能家庭影音控制系统---粤嵌GEC6818嵌入式系统实训

版本介绍普通版完整版至尊版版本介绍分为普通版完整版至尊版三个版本普通版可以满足实训要求 xff0c 提供代码 xff0c 不提供技术指导实现功能 xff1a 1所有界面自行设计 xff0c 要求尽可能好看 2 执行程序 xff
【已解决】Flask当中render_template函数使用过程当中css文件无法正常渲染

文章目录报错可能原因解决方案必要的解决方法可能有用的解决方法报错 Flask当中render template函数使用过程当中css文件无法正常渲染 xff0c 直接显示的html 可能原因当在Flask应用程序中使用render
【已解决】License checkout failed. License Manager Error -8 Make sure the HostlD of the license

文章目录报错图解决方案报错图安装matlab2020b xff0c 双击matlab exe报错解决方案下载对应的破解包 xff0c 一般安装教程里面都有 1 将破解文件中 34 Crack R2020a bin win64 ma
【已解决】AttributeError: module ‘nmap‘ has no attribute ‘PortScanner‘

文章目录报错解决方案必要的解决方法下载安装nmap代码中添加exe路径可能有用的解决方法报错 AttributeError module nmap has no attribute PortScanner 解决方案必要的解决方法抛
Trajectory Forecasting：TrajNet++

概述由于自动驾驶和服务机器人等人工智能新兴应用的需求不断增长 xff0c 拥挤场景中的轨迹预测已成为近年来的一个重要话题轨迹预测的一项重要挑战是有效地建模社交互动在过去的几年中 xff0c 已经提出了几种新颖的方法然而 xff0c
【已解决】AttributeError: ‘Index‘ object has no attribute ‘to_list‘

文章目录报错及效果图报错代码效果图解决方案必要的解决方法报错及效果图报错代码 AttributeError span class token punctuation span span class token string 39 I
【代码】读取图像，计算面宽比，并保存至表格

计算面宽比读取某一文件夹下的图片并计算面宽比 xff0c 并保存至表格安装dlib报错怎么办计算面宽比此处计算 xff08 第一个点和第17个点之间的距离 xff09 xff08 第28个点和第52个点之间的距离 xff09 span
探究肺癌患者的CT图像的图像特征并构建一个诊断模型

目标效果图操作说明代码目标探究肺癌患者的CT图像的图像特征并构建一个诊断模型效果图操作说明代码中我以建立10张图为例 xff0c 多少你自己定准备工作 xff1a 1 准备肺癌或非肺癌每个各10张图 xff0c 在本地创建一个名
【已解决】Pygame无法显示中文

文章目录报错截图及效果图报错图效果图解决方案其他问题报错截图及效果图报错图效果图解决方案添加这行代码即可 font span class token operator 61 span pygame span class tok
【已解决】Resource wordnet not found. Please use the NLTK Downloader to obtain the resource

文章目录报错代码解决方案必要的解决方法可能有用的解决方法非常重要报错代码 Resource wordnet not found Please use the NLTK Downloader to obtain the resource
Launch启动文件的使用方法

Launch启动文件的使用方法案例一 xff1a 运行两个节点案例二 xff1a 加载参数与命名空间案例三 xff1a 小海龟跟随的launch启动方法案例四 xff1a remap修改节点名 Launch文件可以通过XML文件实现多节点
什么是死锁？死锁如何解决？

1 死锁是什么 xff1f 死锁是指两个或多个事务在同一资源上相互占用 xff0c 并请求锁定对方的资源 xff0c 从而导致恶性循环的现象当多个进程因竞争资源而造成的一种僵局 xff08 互相等待 xff09 xff0c 若无外力作用
Ubuntu20.04+ros+PX4学习第三天

激光slam学习 xff1a 激光slam所用到的传感器 xff1a 惯性测量单元 xff08 IMU xff09 43 轮式里程计 43 激光雷达轮式里程计算角度误差会很大 xff0c 一般用IMU计算角度 xff0c 轮式里程计用来算
C语言实现将彩色bmp图像转化为灰图、灰度图像反色

彩色图像转灰度图像彩色 xff08 24位 xff09 bmp图像结构 xff1a span class token keyword typedef span span class token keyword struct span sp
【实例分割｜Mask2Former】解决模型推理预测的代码中存在的一些问题

文章目录取消终端输出网络结构推理置信度设置预测实例存在多个轮廓预测模型返回筛选后实例取消终端输出网络结构在运行 demo py 时 xff0c 终端会输出大量网络结构信息 xff0c 影响调试代码需要在 Detectron2 中的
梯度消失与梯度爆炸

简介梯度消失问题和梯度爆炸问题 xff0c 总的来说可以称为梯度不稳定问题 ReLU激活函数 xff0c 用Batch Normal xff0c 用残差结构解决梯度消失问题正则化来限制梯度爆炸梯度消失梯度消失的原始是反向传播时的链式法
【Pytorch｜Bug】解决 RuntimeError: Error(s) in loading state_dict for Network: size mismatch

文章目录问题背景解决方法问题背景 Github开源项目 xff1a https github com zhang tao whu e2ec python train net py coco finetune bs span class
【数据集｜COCO】COCO格式数据集制作与数据集参数计算

文章目录 1 批量修改 JSON 文件中的参数1 1 问题背景1 2 代码实现 2 划分训练集和测试集2 1 问题背景2 2 环境配置2 3 代码实现 3 生成 JSON 标签文件3 1 环境配置3 2 代码实现 4 计算训练集三通道均值4
【实例分割｜Detectron2】在主干网络 ResNet50 中添加 SE 注意力模块

文章目录在 Detectron2 中使用 SE 模块在 Mask2Former 中使用 SE 模块在 Detectron2 中使用 SE 模块直接在 detectron2 detectron2 modeling backbone re