Study on Data Partition for Delimitation of Masses in Mammography

Mammography is the primary medical imaging method used for routine screening and early detection of breast cancer in women. However, the process of manually inspecting, detecting, and delimiting the tumoral massess in 2D images is a very time-consuming task, subject to human errors due to fatigue. T...

Full description

Bibliographic Details
Main Authors: Luís Viegas, Inês Domingues, Mateus Mendes
Format: Article
Language:English
Published: MDPI AG 2021-09-01
Series:Journal of Imaging
Subjects:
Online Access:https://www.mdpi.com/2313-433X/7/9/174
id doaj-8bb7b71a66e4472a8e19c3efc444c616
record_format Article
spelling doaj-8bb7b71a66e4472a8e19c3efc444c6162021-09-26T00:29:35ZengMDPI AGJournal of Imaging2313-433X2021-09-01717417410.3390/jimaging7090174Study on Data Partition for Delimitation of Masses in MammographyLuís Viegas0Inês Domingues1Mateus Mendes2Polytechnic of Coimbra—ISEC, Rua Pedro Nunes, Quinta da Nora, 3030-199 Coimbra, PortugalMedical Physics, Radiobiology and Radiation Protection Group, IPO Porto Research Centre (CI-IPOP), 4200-072 Porto, PortugalPolytechnic of Coimbra—ISEC, Rua Pedro Nunes, Quinta da Nora, 3030-199 Coimbra, PortugalMammography is the primary medical imaging method used for routine screening and early detection of breast cancer in women. However, the process of manually inspecting, detecting, and delimiting the tumoral massess in 2D images is a very time-consuming task, subject to human errors due to fatigue. Therefore, integrated computer-aided detection systems have been proposed, based on modern computer vision and machine learning methods. In the present work, mammogram images from the publicly available Inbreast dataset are first converted to pseudo-color and then used to train and test a Mask R-CNN deep neural network. The most common approach is to start with a dataset and split the images into train and test set randomly. However, since there are often two or more images of the same case in the dataset, the way the dataset is split may have an impact on the results. Our experiments show that random partition of the data can produce unreliable training, so the dataset must be split using case-wise partition for more stable results. In experimental results, the method achieves an average true positive rate of 0.936 with 0.063 standard deviation using random partition and 0.908 with 0.002 standard deviation using case-wise partition, showing that case-wise partition must be used for more reliable results.https://www.mdpi.com/2313-433X/7/9/174mammographycomputer-aided detectionbreast massmass detectionmass segmentationMask R-CNN
collection DOAJ
language English
format Article
sources DOAJ
author Luís Viegas
Inês Domingues
Mateus Mendes
spellingShingle Luís Viegas
Inês Domingues
Mateus Mendes
Study on Data Partition for Delimitation of Masses in Mammography
Journal of Imaging
mammography
computer-aided detection
breast mass
mass detection
mass segmentation
Mask R-CNN
author_facet Luís Viegas
Inês Domingues
Mateus Mendes
author_sort Luís Viegas
title Study on Data Partition for Delimitation of Masses in Mammography
title_short Study on Data Partition for Delimitation of Masses in Mammography
title_full Study on Data Partition for Delimitation of Masses in Mammography
title_fullStr Study on Data Partition for Delimitation of Masses in Mammography
title_full_unstemmed Study on Data Partition for Delimitation of Masses in Mammography
title_sort study on data partition for delimitation of masses in mammography
publisher MDPI AG
series Journal of Imaging
issn 2313-433X
publishDate 2021-09-01
description Mammography is the primary medical imaging method used for routine screening and early detection of breast cancer in women. However, the process of manually inspecting, detecting, and delimiting the tumoral massess in 2D images is a very time-consuming task, subject to human errors due to fatigue. Therefore, integrated computer-aided detection systems have been proposed, based on modern computer vision and machine learning methods. In the present work, mammogram images from the publicly available Inbreast dataset are first converted to pseudo-color and then used to train and test a Mask R-CNN deep neural network. The most common approach is to start with a dataset and split the images into train and test set randomly. However, since there are often two or more images of the same case in the dataset, the way the dataset is split may have an impact on the results. Our experiments show that random partition of the data can produce unreliable training, so the dataset must be split using case-wise partition for more stable results. In experimental results, the method achieves an average true positive rate of 0.936 with 0.063 standard deviation using random partition and 0.908 with 0.002 standard deviation using case-wise partition, showing that case-wise partition must be used for more reliable results.
topic mammography
computer-aided detection
breast mass
mass detection
mass segmentation
Mask R-CNN
url https://www.mdpi.com/2313-433X/7/9/174
work_keys_str_mv AT luisviegas studyondatapartitionfordelimitationofmassesinmammography
AT inesdomingues studyondatapartitionfordelimitationofmassesinmammography
AT mateusmendes studyondatapartitionfordelimitationofmassesinmammography
_version_ 1717365991132889088