Multiscale and Hierarchical Feature-Aggregation Network for Segmenting Medical Images

We propose an encoder–decoder architecture using wide and deep convolutional layers combined with different aggregation modules for the segmentation of medical images. Initially, we obtain a rich representation of features that span from low to high levels and from small to large scales by stacking...

Full description

Bibliographic Details
Main Authors: Choi, J.Y (Author), Lee, B. (Author), Yamanakkanavar, N. (Author)
Format: Article
Language:English
Published: MDPI 2022
Subjects:
Online Access:View Fulltext in Publisher
LEADER 02951nam a2200397Ia 4500
001 10.3390-s22093440
008 220706s2022 CNT 000 0 und d
020 |a 14248220 (ISSN) 
245 1 0 |a Multiscale and Hierarchical Feature-Aggregation Network for Segmenting Medical Images 
260 0 |b MDPI  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.3390/s22093440 
520 3 |a We propose an encoder–decoder architecture using wide and deep convolutional layers combined with different aggregation modules for the segmentation of medical images. Initially, we obtain a rich representation of features that span from low to high levels and from small to large scales by stacking multiple k × k kernels, where each k × k kernel operation is split into k × 1 and 1 × k convolutions. In addition, we introduce two feature-aggregation modules—multiscale feature aggregation (MFA) and hierarchical feature aggregation (HFA)—to better fuse information across end-to-end network layers. The MFA module progressively aggregates features and enriches feature representation, whereas the HFA module merges the features iteratively and hierarchically to learn richer combinations of the feature hierarchy. Furthermore, because residual connections are advantageous for assembling very deep networks, we employ an MFA-based long residual connections to avoid vanishing gradients along the aggregation paths. In addition, a guided block with multilevel convolution provides effective attention to the features that were copied from the encoder to the decoder to recover spatial information. Thus, the proposed method using feature-aggregation modules combined with a guided skip connection improves the segmentation accuracy, achieving a high similarity index for ground-truth segmentation maps. Experimental results indicate that the proposed model achieves a superior segmentation performance to that obtained by conventional methods for skin-lesion segmentation, with an average accuracy score of 0.97 on the ISIC-2018, PH2, and UFBA-UESC datasets. © 2022 by the authors. Licensee MDPI, Basel, Switzerland. 
650 0 4 |a Aggregation network 
650 0 4 |a Convolution 
650 0 4 |a convolutional neural network 
650 0 4 |a Convolutional neural network 
650 0 4 |a Decoding 
650 0 4 |a Encoder-decoder architecture 
650 0 4 |a Feature aggregation 
650 0 4 |a feature fusion 
650 0 4 |a Features fusions 
650 0 4 |a Hierarchical features 
650 0 4 |a Image segmentation 
650 0 4 |a Iterative methods 
650 0 4 |a Low-to-high 
650 0 4 |a Medical image segmentation 
650 0 4 |a Medical imaging 
650 0 4 |a medical-image segmentation 
650 0 4 |a Multi-scale features 
650 0 4 |a Network layers 
650 0 4 |a Segmentation of medical images 
650 0 4 |a Signal encoding 
700 1 0 |a Choi, J.Y.  |e author 
700 1 0 |a Lee, B.  |e author 
700 1 0 |a Yamanakkanavar, N.  |e author 
773 |t Sensors