Enhanced Input-Doubling Method Leveraging Response Surface Linearization to Improve Classification Accuracy in Small Medical Data Processing

Currently, the tasks of intelligent data analysis in medicine are becoming increasingly common. Existing artificial intelligence tools provide high effectiveness in solving these tasks when analyzing sufficiently large datasets. However, when there is very little training data available, current mac...

Full description

Bibliographic Details
Published in:Computation
Main Authors: Ivan Izonin, Roman Tkachenko, Pavlo Yendyk, Iryna Pliss, Yevgeniy Bodyanskiy, Michal Gregus
Format: Article
Language:English
Published: MDPI AG 2024-10-01
Subjects:
Online Access:https://www.mdpi.com/2079-3197/12/10/203
_version_ 1849750335597314048
author Ivan Izonin
Roman Tkachenko
Pavlo Yendyk
Iryna Pliss
Yevgeniy Bodyanskiy
Michal Gregus
author_facet Ivan Izonin
Roman Tkachenko
Pavlo Yendyk
Iryna Pliss
Yevgeniy Bodyanskiy
Michal Gregus
author_sort Ivan Izonin
collection DOAJ
container_title Computation
description Currently, the tasks of intelligent data analysis in medicine are becoming increasingly common. Existing artificial intelligence tools provide high effectiveness in solving these tasks when analyzing sufficiently large datasets. However, when there is very little training data available, current machine learning methods do not ensure adequate classification accuracy or may even produce inadequate results. This paper presents an enhanced input-doubling method for classification tasks in the case of limited data analysis, achieved via expanding the number of independent attributes in the augmented dataset with probabilities of belonging to each class of the task. The authors have developed an algorithmic implementation of the improved method using two Naïve Bayes classifiers. The method was modeled on a small dataset for cardiovascular risk assessment. The authors explored two options for the combined use of Naïve Bayes classifiers at both stages of the method. It was found that using different methods at both stages potentially enhances the accuracy of the classification task. The results of the improved method were compared with a range of existing methods used for solving the task. It was demonstrated that the improved input-doubling method achieved the highest classification accuracy based on various performance indicators.
format Article
id doaj-art-e4e0628ada11463dbf82e60153f6d744
institution Directory of Open Access Journals
issn 2079-3197
language English
publishDate 2024-10-01
publisher MDPI AG
record_format Article
spelling doaj-art-e4e0628ada11463dbf82e60153f6d7442025-08-20T01:39:08ZengMDPI AGComputation2079-31972024-10-01121020310.3390/computation12100203Enhanced Input-Doubling Method Leveraging Response Surface Linearization to Improve Classification Accuracy in Small Medical Data ProcessingIvan Izonin0Roman Tkachenko1Pavlo Yendyk2Iryna Pliss3Yevgeniy Bodyanskiy4Michal Gregus5Department of Artificial Intelligence, Lviv Polytechnic National University, 79013 Lviv, UkraineDepartment of Publishing Information Technologies, Lviv Polytechnic National University, 79013 Lviv, UkraineDepartment of Artificial Intelligence, Lviv Polytechnic National University, 79013 Lviv, UkraineControl Systems Research Laboratory, Kharkiv National University of Radio Electronics, Nauky Ave. 14, 61166 Kharkiv, UkraineControl Systems Research Laboratory, Kharkiv National University of Radio Electronics, Nauky Ave. 14, 61166 Kharkiv, UkraineFaculty of Management, Comenius University Bratislava, Odbojárov 10, 820 05 Bratislava, SlovakiaCurrently, the tasks of intelligent data analysis in medicine are becoming increasingly common. Existing artificial intelligence tools provide high effectiveness in solving these tasks when analyzing sufficiently large datasets. However, when there is very little training data available, current machine learning methods do not ensure adequate classification accuracy or may even produce inadequate results. This paper presents an enhanced input-doubling method for classification tasks in the case of limited data analysis, achieved via expanding the number of independent attributes in the augmented dataset with probabilities of belonging to each class of the task. The authors have developed an algorithmic implementation of the improved method using two Naïve Bayes classifiers. The method was modeled on a small dataset for cardiovascular risk assessment. The authors explored two options for the combined use of Naïve Bayes classifiers at both stages of the method. It was found that using different methods at both stages potentially enhances the accuracy of the classification task. The results of the improved method were compared with a range of existing methods used for solving the task. It was demonstrated that the improved input-doubling method achieved the highest classification accuracy based on various performance indicators.https://www.mdpi.com/2079-3197/12/10/203data mining techniquessmall data approachclassification algorithmsinput-doubling methodensemble methodsdata linearization
spellingShingle Ivan Izonin
Roman Tkachenko
Pavlo Yendyk
Iryna Pliss
Yevgeniy Bodyanskiy
Michal Gregus
Enhanced Input-Doubling Method Leveraging Response Surface Linearization to Improve Classification Accuracy in Small Medical Data Processing
data mining techniques
small data approach
classification algorithms
input-doubling method
ensemble methods
data linearization
title Enhanced Input-Doubling Method Leveraging Response Surface Linearization to Improve Classification Accuracy in Small Medical Data Processing
title_full Enhanced Input-Doubling Method Leveraging Response Surface Linearization to Improve Classification Accuracy in Small Medical Data Processing
title_fullStr Enhanced Input-Doubling Method Leveraging Response Surface Linearization to Improve Classification Accuracy in Small Medical Data Processing
title_full_unstemmed Enhanced Input-Doubling Method Leveraging Response Surface Linearization to Improve Classification Accuracy in Small Medical Data Processing
title_short Enhanced Input-Doubling Method Leveraging Response Surface Linearization to Improve Classification Accuracy in Small Medical Data Processing
title_sort enhanced input doubling method leveraging response surface linearization to improve classification accuracy in small medical data processing
topic data mining techniques
small data approach
classification algorithms
input-doubling method
ensemble methods
data linearization
url https://www.mdpi.com/2079-3197/12/10/203
work_keys_str_mv AT ivanizonin enhancedinputdoublingmethodleveragingresponsesurfacelinearizationtoimproveclassificationaccuracyinsmallmedicaldataprocessing
AT romantkachenko enhancedinputdoublingmethodleveragingresponsesurfacelinearizationtoimproveclassificationaccuracyinsmallmedicaldataprocessing
AT pavloyendyk enhancedinputdoublingmethodleveragingresponsesurfacelinearizationtoimproveclassificationaccuracyinsmallmedicaldataprocessing
AT irynapliss enhancedinputdoublingmethodleveragingresponsesurfacelinearizationtoimproveclassificationaccuracyinsmallmedicaldataprocessing
AT yevgeniybodyanskiy enhancedinputdoublingmethodleveragingresponsesurfacelinearizationtoimproveclassificationaccuracyinsmallmedicaldataprocessing
AT michalgregus enhancedinputdoublingmethodleveragingresponsesurfacelinearizationtoimproveclassificationaccuracyinsmallmedicaldataprocessing