CMGNet: Context-aware middle-layer guidance network for salient object detection

Salient object detection (SOD) is a critical task in computer vision that involves accurately identifying and segmenting visually significant objects in an image. To address the challenges of gridding issues and feature dilution effects commonly encountered in SOD, we propose a sophisticated context...

وصف كامل

التفاصيل البيبلوغرافية
الحاوية / القاعدة:Journal of King Saud University: Computer and Information Sciences
المؤلفون الرئيسيون: Inam Ullah, Sumaira Hussain, Kashif Shaheed, Wajid Ali, Shahid Ali Khan, Yilong Yin, Yuling Ma
التنسيق: مقال
اللغة:الإنجليزية
منشور في: Springer 2024-01-01
الموضوعات:
الوصول للمادة أونلاين:http://www.sciencedirect.com/science/article/pii/S1319157823003920
الوصف
الملخص:Salient object detection (SOD) is a critical task in computer vision that involves accurately identifying and segmenting visually significant objects in an image. To address the challenges of gridding issues and feature dilution effects commonly encountered in SOD, we propose a sophisticated context-aware middle-layer guidance network (CMGNet). CMGNet incorporates the context-aware central-layer guidance module (CCGM), which utilizes cost-effective large kernels of depth-wise convolutions with embedded parallel channel attentions and squeeze-and-excitation (SeE) attentions mechanisms. It enables the model to effectively perceive objects of varying scales in complex scenarios. Additionally, the incorporation of the adjacent-to-central-layers paradigm enriches the model’s ability to capture more structural and contextual information. To further enhance performance, we introduce the dual-phase central-layer refinement module (DCRM), which effectively removes the minute blurry residuals in complex scenarios and enhances object segmentation. Moreover, we propose a novel hybrid loss function that handles hard pixels at or near boundaries by incorporating a weighting formula. This hybrid loss function combines binary cross-entropy (BCE), intersection over union (IoU), and consistency-enhanced loss (CEL), resulting in smoother and more precise saliency maps. Extensive evaluations on challenging datasets demonstrate the superiority of our approach over 15 state-of-the-art methods in salient object detection.
تدمد:1319-1578