CDAU-Net: A Novel CoordConv-Integrated Deep Dual Cross Attention Mechanism for Enhanced Road Extraction in Remote Sensing Imagery

In the realm of remote sensing image analysis, the task of road extraction poses significant complexities, especially in the context of intricate scenes and diminutive targets. In response to these challenges, we have developed a novel deep learning network, christened CDAU-Net, designed to discern...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:Remote Sensing
المؤلفون الرئيسيون: Anchao Yin, Chao Ren, Weiting Yue, Hongjuan Shao, Xiaoqin Xue
التنسيق: مقال
اللغة:الإنجليزية
منشور في: MDPI AG 2023-10-01
الموضوعات:
الوصول للمادة أونلاين:https://www.mdpi.com/2072-4292/15/20/4914
_version_ 1851875417412075520
author Anchao Yin
Chao Ren
Weiting Yue
Hongjuan Shao
Xiaoqin Xue
author_facet Anchao Yin
Chao Ren
Weiting Yue
Hongjuan Shao
Xiaoqin Xue
author_sort Anchao Yin
collection DOAJ
container_title Remote Sensing
description In the realm of remote sensing image analysis, the task of road extraction poses significant complexities, especially in the context of intricate scenes and diminutive targets. In response to these challenges, we have developed a novel deep learning network, christened CDAU-Net, designed to discern and delineate these features with enhanced precision. This network takes its structural inspiration from the fundamental architecture of U-Net while introducing innovative enhancements: we have integrated CoordConv convolutions into both the initial layer of the U-Net encoder and the terminal layer of the decoder, thereby facilitating a more efficacious processing of spatial information inherent in remote sensing images. Moreover, we have devised a unique mechanism termed the Deep Dual Cross Attention (DDCA), purposed to capture long-range dependencies within images—a critical factor in remote sensing image analysis. Our network replaces the skip-connection component of the U-Net with this newly designed mechanism, dealing with feature maps of the first four scales in the encoder and generating four corresponding outputs. These outputs are subsequently linked with the decoder stage to further capture the remote dependencies present within the remote sensing imagery. We have subjected CDAU-Net to extensive empirical validation, including testing on the Massachusetts Road Dataset and DeepGlobe Road Dataset. Both datasets encompass a diverse range of complex road scenes, making them ideal for evaluating the performance of road extraction algorithms. The experimental results showcase that whether in terms of accuracy, recall rate, or Intersection over Union (IoU) metrics, the CDAU-Net outperforms existing state-of-the-art methods in the task of road extraction. These findings substantiate the effectiveness and superiority of our approach in handling complex scenes and small targets, as well as in capturing long-range dependencies in remote sensing imagery. In sum, the design of CDAU-Net not only enhances the accuracy of road extraction but also presents new perspectives and possibilities for deep learning analysis of remote sensing imagery.
format Article
id doaj-art-4dc67dca6dac49898aaa0f3bd0fb3914
institution Directory of Open Access Journals
issn 2072-4292
language English
publishDate 2023-10-01
publisher MDPI AG
record_format Article
spelling doaj-art-4dc67dca6dac49898aaa0f3bd0fb39142025-08-19T22:15:14ZengMDPI AGRemote Sensing2072-42922023-10-011520491410.3390/rs15204914CDAU-Net: A Novel CoordConv-Integrated Deep Dual Cross Attention Mechanism for Enhanced Road Extraction in Remote Sensing ImageryAnchao Yin0Chao Ren1Weiting Yue2Hongjuan Shao3Xiaoqin Xue4College of Geomatics and Geoinformation, Guilin University of Technology, Guilin 541006, ChinaCollege of Geomatics and Geoinformation, Guilin University of Technology, Guilin 541006, ChinaCollege of Geomatics and Geoinformation, Guilin University of Technology, Guilin 541006, ChinaCollege of Geomatics and Geoinformation, Guilin University of Technology, Guilin 541006, ChinaCollege of Geomatics and Geoinformation, Guilin University of Technology, Guilin 541006, ChinaIn the realm of remote sensing image analysis, the task of road extraction poses significant complexities, especially in the context of intricate scenes and diminutive targets. In response to these challenges, we have developed a novel deep learning network, christened CDAU-Net, designed to discern and delineate these features with enhanced precision. This network takes its structural inspiration from the fundamental architecture of U-Net while introducing innovative enhancements: we have integrated CoordConv convolutions into both the initial layer of the U-Net encoder and the terminal layer of the decoder, thereby facilitating a more efficacious processing of spatial information inherent in remote sensing images. Moreover, we have devised a unique mechanism termed the Deep Dual Cross Attention (DDCA), purposed to capture long-range dependencies within images—a critical factor in remote sensing image analysis. Our network replaces the skip-connection component of the U-Net with this newly designed mechanism, dealing with feature maps of the first four scales in the encoder and generating four corresponding outputs. These outputs are subsequently linked with the decoder stage to further capture the remote dependencies present within the remote sensing imagery. We have subjected CDAU-Net to extensive empirical validation, including testing on the Massachusetts Road Dataset and DeepGlobe Road Dataset. Both datasets encompass a diverse range of complex road scenes, making them ideal for evaluating the performance of road extraction algorithms. The experimental results showcase that whether in terms of accuracy, recall rate, or Intersection over Union (IoU) metrics, the CDAU-Net outperforms existing state-of-the-art methods in the task of road extraction. These findings substantiate the effectiveness and superiority of our approach in handling complex scenes and small targets, as well as in capturing long-range dependencies in remote sensing imagery. In sum, the design of CDAU-Net not only enhances the accuracy of road extraction but also presents new perspectives and possibilities for deep learning analysis of remote sensing imagery.https://www.mdpi.com/2072-4292/15/20/4914remote sensing image analysisroad extractionCDAU-NetCoordConvconvolutionsDeep Dual Cross Attention (DDCA)
spellingShingle Anchao Yin
Chao Ren
Weiting Yue
Hongjuan Shao
Xiaoqin Xue
CDAU-Net: A Novel CoordConv-Integrated Deep Dual Cross Attention Mechanism for Enhanced Road Extraction in Remote Sensing Imagery
remote sensing image analysis
road extraction
CDAU-Net
CoordConv
convolutions
Deep Dual Cross Attention (DDCA)
title CDAU-Net: A Novel CoordConv-Integrated Deep Dual Cross Attention Mechanism for Enhanced Road Extraction in Remote Sensing Imagery
title_full CDAU-Net: A Novel CoordConv-Integrated Deep Dual Cross Attention Mechanism for Enhanced Road Extraction in Remote Sensing Imagery
title_fullStr CDAU-Net: A Novel CoordConv-Integrated Deep Dual Cross Attention Mechanism for Enhanced Road Extraction in Remote Sensing Imagery
title_full_unstemmed CDAU-Net: A Novel CoordConv-Integrated Deep Dual Cross Attention Mechanism for Enhanced Road Extraction in Remote Sensing Imagery
title_short CDAU-Net: A Novel CoordConv-Integrated Deep Dual Cross Attention Mechanism for Enhanced Road Extraction in Remote Sensing Imagery
title_sort cdau net a novel coordconv integrated deep dual cross attention mechanism for enhanced road extraction in remote sensing imagery
topic remote sensing image analysis
road extraction
CDAU-Net
CoordConv
convolutions
Deep Dual Cross Attention (DDCA)
url https://www.mdpi.com/2072-4292/15/20/4914
work_keys_str_mv AT anchaoyin cdaunetanovelcoordconvintegrateddeepdualcrossattentionmechanismforenhancedroadextractioninremotesensingimagery
AT chaoren cdaunetanovelcoordconvintegrateddeepdualcrossattentionmechanismforenhancedroadextractioninremotesensingimagery
AT weitingyue cdaunetanovelcoordconvintegrateddeepdualcrossattentionmechanismforenhancedroadextractioninremotesensingimagery
AT hongjuanshao cdaunetanovelcoordconvintegrateddeepdualcrossattentionmechanismforenhancedroadextractioninremotesensingimagery
AT xiaoqinxue cdaunetanovelcoordconvintegrateddeepdualcrossattentionmechanismforenhancedroadextractioninremotesensingimagery