RoadFocusNet: road extraction from remote sensing imagery using focused transformer and focused masked image modeling

Road extraction from remote sensing (RS) imagery is crucial for urban management, traffic planning, and autonomous driving. However, extracting accurate and complete roads remains challenging due to occlusions and severe class imbalance, where non-road regions dominate. To address these challenges,...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:	International Journal of Digital Earth
المؤلفون الرئيسيون:	Hao Chen, Liangzhe Yang, Qingren Jia, Wei Xiong
التنسيق:	مقال
اللغة:	الإنجليزية
منشور في:	Taylor & Francis Group 2025-12-01
الموضوعات:	Road extraction masked image modeling data augmentation attention mechanism remote sensing image
الوصول للمادة أونلاين:	https://www.tandfonline.com/doi/10.1080/17538947.2025.2549435

الوصف
الملخص:	Road extraction from remote sensing (RS) imagery is crucial for urban management, traffic planning, and autonomous driving. However, extracting accurate and complete roads remains challenging due to occlusions and severe class imbalance, where non-road regions dominate. To address these challenges, we propose a novel road extraction method incorporating two key components. The first is a Focused Masked Image Modeling (FocusMIM) strategy for data augmentation, which randomly masks road-related regions to efficiently model the latent dependency between occluded and non-occluded road parts. With FocusMIM, the model's ability to infer occluded roads is obviously improved. The second is a Focused Transformer (FocusFormer), which enhances road-related feature interactions through a Transformer-based encoder with Channel Self-Attention (CSA) modules and a Transformer decoder that leverages masked attention. The CSA modules aggregate global features of RS images to enhance contextual inference and mitigate occlusions. Meanwhile, the Transformer decoder employs a single road query that attends exclusively to road features, alleviating the class imbalance issue. Comprehensive experiments on the DeepGlobe Road, Massachusetts Road, and CHN6-CUG datasets demonstrate that our method outperforms several state-of-the-art methods, achieving an IoU increase of 0.96–5.38%. These results confirm the effectiveness of FocusMIM and FocusFormer in improving road continuity and reducing background interference.
تدمد:	1753-8947 1753-8955

RoadFocusNet: road extraction from remote sensing imagery using focused transformer and focused masked image modeling

مواد مشابهة