详细信息
基于无锚框分割网络改进的实例分割方法
Improved Instance Segmentation Method Based on Anchor-Free Segmentation Network
文献类型:期刊文献
中文题名:基于无锚框分割网络改进的实例分割方法
英文题名:Improved Instance Segmentation Method Based on Anchor-Free Segmentation Network
作者:刘腾[1,2];刘宏哲[1,2];李学伟[1,2];徐成[1,2]
第一作者:刘腾
机构:[1]北京联合大学北京市信息服务工程重点实验室,北京100101;[2]北京联合大学机器人学院,北京100101
第一机构:北京联合大学北京市信息服务工程重点实验室
年份:2022
卷号:48
期号:9
起止页码:239-247
中文期刊名:计算机工程
外文期刊名:Computer Engineering
收录:CSTPCD;;北大核心:【北大核心2020】;CSCD:【CSCD_E2021_2022】;
基金:国家自然科学基金(61871039,62102033,62171042,61906017);北京市教委项目(KM202111417001,KM201911417001);视觉智能协同创新中心项目(CYXC2011);北京联合大学学术研究项目(ZB10202003,ZK40202101,ZK120202104);北京联合大学研究生科研创新项目(YZ2020K001)。
语种:中文
中文关键词:无锚框实例分割;深度学习;编码-解码结构;注意力机制;空洞卷积
外文关键词:anchor-free instance segmentation;Deep Learning(DL);encoder-decoder structure;attention mechanism;dilated convolution
摘要:在无人驾驶应用场景中,现有无锚框实例分割方法存在大目标特征覆盖小目标特征、缺少两阶段检测器中的感兴趣区域对齐操作、忽略类别分支对掩膜分支提供的位置和空间信息等问题,导致特征提取不充分且无法准确获取目标区域。提出一种改进的无锚框实例分割方法。结合可变形卷积,设计编码-解码特征提取网络提取高分辨率特征,以增强对小目标特征的提取能力,并采用空洞卷积和合并连接的方式,在不增加计算量的前提下有效融合多种分辨率的特征。在此基础上,将注意力机制引入到类别分支中,同时设计结合空间信息和通道信息的信息增强模块,以提高目标检测能力。实验结果表明,该方法在COCO 2017和Cityscapes数据集上平均精度和平均交并比分别为41.1%和83.3%,相比Mask R-CNN、SOLO、Yolact等方法,能够有效改进实例分割效果并具有较优的鲁棒性。
In autonomous driving application scenarios,the existing anchor-free instance segmentation methods have problems such as large target features covering small target features,lack of a Region Of Interest(ROI)-Align operation in the two-stage detector,ignoring the position and spatial information provided by the regression branch to the mask branch,resulting in insufficient feature extraction and unable to accurately obtain the target region.An improved method for instance,anchor-free segmentation is proposed. Combined with deformable convolution,a encoder-decoder feature extraction network is designed to extract high-resolution features and enhance the extraction ability of small target features. The dilated convolution and merging connection method is adopted to effectively fuse the features of multiple resolutions without increasing the computation amount. On this basis,the attention mechanism is introduced into the regression branch,and an information enhancement module combining spatial and channel information is designed to improve the ability of target detection. The experimental results show that the Average Precision(AP) and mean Intersection over Union(mIoU) of the proposed method on the COCO 2017 and Cityscapes datasets are 41.1% and83.3%,respectively. Compared with Mask R-CNN,SOLO,Yolact,and other methods,the proposed method can effectively improve the effect of instance segmentation and has better robustness.
参考文献:
正在载入数据...