基于特征融合与自注意力机制的图像语义分割算法

刘丽婷; 高飞; 群诺

doi:10.19304/J.ISSN1000-7180.2023.0110

基于特征融合与自注意力机制的图像语义分割算法

An image semantic segmentation method based on feature fusion and self-attention mechanism

摘要

摘要: 提出了一种基于特征融合与自注意力机制的图像语义分割方法，设计了特征融合模块、自注意力模块、增强模块、全局空间信息融合模块和损失函数。特征融合模块融合多个图像的所有组件，通过自注意力机制来执行。自注意力模块从而有效地捕获远程上下文信息。增强模块旨在增强输入图像以获得更多样化的特征。全局空间信息注意模块相对于图像尺寸只有线性的复杂度，能够带来显著的提升效果。利用损失函数，对模型进行优化，将每个像素的分类结果优化到最接近真实值。实验结果表明，所提出的方法可以显著提高PASCAL VOC 2012数据集、COCO-Stuff 10K数据集和ISIC 2018数据集这3个数据集的性能，并在3个数据集上进行了验证，实验还通过对自注意力、推理速度和消融实验进行比较，验证了本文方法的优越性。

Abstract: This paper proposes an image semantic segmentation method based on feature fusion and self attention mechanism, and designs feature fusion module, self attention module, enhancement module, global spatial information fusion module and loss function. The feature fusion module fuses all components from multiple images and executes them through the self attention mechanism. Self attention module can effectively capture remote context information. The enhancement module aims to enhance the input image to obtain more diversified features. The global spatial information attention module has only linear complexity relative to the image size, which can bring significant improvement effect. The loss function is used to optimize the model, and the classification result of each pixel is optimized to the nearest real value. The experimental results show that the proposed method can significantly improve the performance of PASCAL VOC 2012 dataset, COCO Stuff 10K dataset and ISIC 2018 dataset, and has been verified on three datasets. The experiment also verifies the advantages of the method in this paper by comparing self attention, reasoning speed and ablation experiments.

HTML全文

参考文献(39)

施引文献

资源附件(0)