Cardiothoracic ratio measurement using artificial intelligence: observer and method validation studies

Abstract Background Artificial Intelligence (AI) is a promising tool for cardiothoracic ratio (CTR) measurement that has been technically validated but not clinically evaluated on a large dataset. We observed and validated AI and manual methods for CTR measurement using a large dataset and investiga...

Full description

Bibliographic Details
Main Authors: Pairash Saiviroonporn, Kanchanaporn Rodbangyang, Trongtum Tongdee, Warasinee Chaisangmongkon, Pakorn Yodprom, Thanogchai Siriapisith, Suwimon Wonglaksanapimon, Phakphoom Thiravit
Format: Article
Language:English
Published: BMC 2021-06-01
Series:BMC Medical Imaging
Subjects:
AI
Online Access:https://doi.org/10.1186/s12880-021-00625-0
id doaj-43771ac3ffc84d769f5417e801d17ea3
record_format Article
spelling doaj-43771ac3ffc84d769f5417e801d17ea32021-06-13T11:51:57ZengBMCBMC Medical Imaging1471-23422021-06-0121111110.1186/s12880-021-00625-0Cardiothoracic ratio measurement using artificial intelligence: observer and method validation studiesPairash Saiviroonporn0Kanchanaporn Rodbangyang1Trongtum Tongdee2Warasinee Chaisangmongkon3Pakorn Yodprom4Thanogchai Siriapisith5Suwimon Wonglaksanapimon6Phakphoom Thiravit7Department of Radiology, Faculty of Medicine Siriraj Hospital, Mahidol UniversityDepartment of Radiology, Faculty of Medicine Siriraj Hospital, Mahidol UniversityDepartment of Radiology, Faculty of Medicine Siriraj Hospital, Mahidol UniversityInstitute of Field Robotics, King Mongkut’s University of Technology ThonburiDepartment of Radiology, Faculty of Medicine Siriraj Hospital, Mahidol UniversityDepartment of Radiology, Faculty of Medicine Siriraj Hospital, Mahidol UniversityDepartment of Radiology, Faculty of Medicine Siriraj Hospital, Mahidol UniversityDepartment of Radiology, Faculty of Medicine Siriraj Hospital, Mahidol UniversityAbstract Background Artificial Intelligence (AI) is a promising tool for cardiothoracic ratio (CTR) measurement that has been technically validated but not clinically evaluated on a large dataset. We observed and validated AI and manual methods for CTR measurement using a large dataset and investigated the clinical utility of the AI method. Methods Five thousand normal chest x-rays and 2,517 images with cardiomegaly and CTR values, were analyzed using manual, AI-assisted, and AI-only methods. AI-only methods obtained CTR values from a VGG-16 U-Net model. An in-house software was used to aid the manual and AI-assisted measurements and to record operating time. Intra and inter-observer experiments were performed on manual and AI-assisted methods and the averages were used in a method variation study. AI outcomes were graded in the AI-assisted method as excellent (accepted by both users independently), good (required adjustment), and poor (failed outcome). Bland–Altman plot with coefficient of variation (CV), and coefficient of determination (R-squared) were used to evaluate agreement and correlation between measurements. Finally, the performance of a cardiomegaly classification test was evaluated using a CTR cutoff at the standard (0.5), optimum, and maximum sensitivity. Results Manual CTR measurements on cardiomegaly data were comparable to previous radiologist reports (CV of 2.13% vs 2.04%). The observer and method variations from the AI-only method were about three times higher than from the manual method (CV of 5.78% vs 2.13%). AI assistance resulted in 40% excellent, 56% good, and 4% poor grading. AI assistance significantly improved agreement on inter-observer measurement compared to manual methods (CV; bias: 1.72%; − 0.61% vs 2.13%; − 1.62%) and was faster to perform (2.2 ± 2.4 secs vs 10.6 ± 1.5 secs). The R-squared and classification-test were not reliable indicators to verify that the AI-only method could replace manual operation. Conclusions AI alone is not yet suitable to replace manual operations due to its high variation, but it is useful to assist the radiologist because it can reduce observer variation and operation time. Agreement of measurement should be used to compare AI and manual methods, rather than R-square or classification performance tests.https://doi.org/10.1186/s12880-021-00625-0Cardiothoracic ratioDeep learningClinical validationObserver variationAI
collection DOAJ
language English
format Article
sources DOAJ
author Pairash Saiviroonporn
Kanchanaporn Rodbangyang
Trongtum Tongdee
Warasinee Chaisangmongkon
Pakorn Yodprom
Thanogchai Siriapisith
Suwimon Wonglaksanapimon
Phakphoom Thiravit
spellingShingle Pairash Saiviroonporn
Kanchanaporn Rodbangyang
Trongtum Tongdee
Warasinee Chaisangmongkon
Pakorn Yodprom
Thanogchai Siriapisith
Suwimon Wonglaksanapimon
Phakphoom Thiravit
Cardiothoracic ratio measurement using artificial intelligence: observer and method validation studies
BMC Medical Imaging
Cardiothoracic ratio
Deep learning
Clinical validation
Observer variation
AI
author_facet Pairash Saiviroonporn
Kanchanaporn Rodbangyang
Trongtum Tongdee
Warasinee Chaisangmongkon
Pakorn Yodprom
Thanogchai Siriapisith
Suwimon Wonglaksanapimon
Phakphoom Thiravit
author_sort Pairash Saiviroonporn
title Cardiothoracic ratio measurement using artificial intelligence: observer and method validation studies
title_short Cardiothoracic ratio measurement using artificial intelligence: observer and method validation studies
title_full Cardiothoracic ratio measurement using artificial intelligence: observer and method validation studies
title_fullStr Cardiothoracic ratio measurement using artificial intelligence: observer and method validation studies
title_full_unstemmed Cardiothoracic ratio measurement using artificial intelligence: observer and method validation studies
title_sort cardiothoracic ratio measurement using artificial intelligence: observer and method validation studies
publisher BMC
series BMC Medical Imaging
issn 1471-2342
publishDate 2021-06-01
description Abstract Background Artificial Intelligence (AI) is a promising tool for cardiothoracic ratio (CTR) measurement that has been technically validated but not clinically evaluated on a large dataset. We observed and validated AI and manual methods for CTR measurement using a large dataset and investigated the clinical utility of the AI method. Methods Five thousand normal chest x-rays and 2,517 images with cardiomegaly and CTR values, were analyzed using manual, AI-assisted, and AI-only methods. AI-only methods obtained CTR values from a VGG-16 U-Net model. An in-house software was used to aid the manual and AI-assisted measurements and to record operating time. Intra and inter-observer experiments were performed on manual and AI-assisted methods and the averages were used in a method variation study. AI outcomes were graded in the AI-assisted method as excellent (accepted by both users independently), good (required adjustment), and poor (failed outcome). Bland–Altman plot with coefficient of variation (CV), and coefficient of determination (R-squared) were used to evaluate agreement and correlation between measurements. Finally, the performance of a cardiomegaly classification test was evaluated using a CTR cutoff at the standard (0.5), optimum, and maximum sensitivity. Results Manual CTR measurements on cardiomegaly data were comparable to previous radiologist reports (CV of 2.13% vs 2.04%). The observer and method variations from the AI-only method were about three times higher than from the manual method (CV of 5.78% vs 2.13%). AI assistance resulted in 40% excellent, 56% good, and 4% poor grading. AI assistance significantly improved agreement on inter-observer measurement compared to manual methods (CV; bias: 1.72%; − 0.61% vs 2.13%; − 1.62%) and was faster to perform (2.2 ± 2.4 secs vs 10.6 ± 1.5 secs). The R-squared and classification-test were not reliable indicators to verify that the AI-only method could replace manual operation. Conclusions AI alone is not yet suitable to replace manual operations due to its high variation, but it is useful to assist the radiologist because it can reduce observer variation and operation time. Agreement of measurement should be used to compare AI and manual methods, rather than R-square or classification performance tests.
topic Cardiothoracic ratio
Deep learning
Clinical validation
Observer variation
AI
url https://doi.org/10.1186/s12880-021-00625-0
work_keys_str_mv AT pairashsaiviroonporn cardiothoracicratiomeasurementusingartificialintelligenceobserverandmethodvalidationstudies
AT kanchanapornrodbangyang cardiothoracicratiomeasurementusingartificialintelligenceobserverandmethodvalidationstudies
AT trongtumtongdee cardiothoracicratiomeasurementusingartificialintelligenceobserverandmethodvalidationstudies
AT warasineechaisangmongkon cardiothoracicratiomeasurementusingartificialintelligenceobserverandmethodvalidationstudies
AT pakornyodprom cardiothoracicratiomeasurementusingartificialintelligenceobserverandmethodvalidationstudies
AT thanogchaisiriapisith cardiothoracicratiomeasurementusingartificialintelligenceobserverandmethodvalidationstudies
AT suwimonwonglaksanapimon cardiothoracicratiomeasurementusingartificialintelligenceobserverandmethodvalidationstudies
AT phakphoomthiravit cardiothoracicratiomeasurementusingartificialintelligenceobserverandmethodvalidationstudies
_version_ 1721379338202382336