高级检索

基于AM-TransUNet的X射线图像中铜管钎焊缺陷分割算法

Defect segmentation algorithm for soldering defects in copper tube X-ray images based on AM-TransUNet

  • 摘要: 在小径铜管钎焊结构的X射线数字化图像中,由于钎焊缺陷影像的尺寸、形态及对比度差异大,导致缺陷难以实现自动识别.为解决这一问题,通过多尺度特征表示与Biformer注意力机制相结合,提出一种基于TransUNet改进的缺陷分割模型AM-TransUNet.使用结合SimAM注意力机制的Res2Net模块替换基线模型TransUNet编码器部分的ResNet模块,以提升模型的多尺度特征提取能力和对复杂背景下缺陷局部细节特征的捕捉能力;在编码器末端级联Biformer动态稀疏注意力机制,以提高模型的检测速度和对全局信息的捕捉能力.利用构建的小径铜管X射线图像数据集进行缺陷分割试验,研究结果表明:AM-TransUNet模型的准确率、召回率比基线模型分别提高了2.8%和3.2%,HD95比基线模型减少了13.2,检测灵敏度优于径向0.5 mm的未钎透缺陷,AM-TransUNet缺陷分割模型具有更高的可靠性;同时,AM-TransUNet模型的检测速度比基线模型提高了9帧每秒,模型参数量减小了66 M.

     

    Abstract: In the X-ray digital images of soldering copper pipes with small diameter, the significant variations in size, shape, and contrast of soldering defect images make it difficult to achieve automatic defect recognition. To address this issue, we propose an improved defect segmentation model, AM-TransUNet, based on TransUNet, which combines multi-scale feature representation with the Biformer attention mechanism. The Res2Net module combined with the SimAM attention mechanism replaces the ResNet module in the encoder of the baseline TransUNet model, enhancing the model's multi-scale feature extraction capabilities and ability to capture local defect details in complex backgrounds. Additionally, the Biformer dynamic sparse attention mechanism is cascaded at the end of the encoder to improve detection speed and enhance the model's ability to capture global information. Experiments conducted on a constructed X-ray image dataset of small-diameter copper tubes demonstrate that the AM-TransUNet model achieves 2.8% and 3.2% higher precision and recall, respectively, compared to the baseline model. Moreover, the HD95 metric is reduced by 13.2, and the model shows superior sensitivity in detecting unfused defects with a radial dimension of 0.5 mm. The results indicate that the AM-TransUNet model has higher reliability in defect segmentation. Meanwhile, the detection speed of the AM-TransUNet model is improved by 9 frames per second over the baseline model, and the number of model parameters is reduced by 66 M.

     

/

返回文章
返回