Proposing Logical Table Constructs for Enhanced Machine Learning Process

Machine learning (ML) has shown enormous potential in various domains with the wide variations of underlying data types. Because of the miscellany in the data sets and the features, ML classifiers often suffer from challenges, such as feature miss-classification, unfit algorithms, low accuracy, over...

Full description

Bibliographic Details
Main Authors: Muhammad Fahim Uddin, Syed Rizvi, Abdul Razaque
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8439940/
id doaj-97990bdf4292466b9afc3d2e34116139
record_format Article
spelling doaj-97990bdf4292466b9afc3d2e341161392021-03-29T21:11:37ZengIEEEIEEE Access2169-35362018-01-016477514776910.1109/ACCESS.2018.28660468439940Proposing Logical Table Constructs for Enhanced Machine Learning ProcessMuhammad Fahim Uddin0https://orcid.org/0000-0002-7800-578XSyed Rizvi1Abdul Razaque2Department of Computer Science, School of Engineering, University of Bridgeport, Bridgeport, CT, USADepartment of Information Science and Technologies, Penn State University, Altoona, PA, USADepartment of Computer Science, New York Institute of Technology, New York, NY, USAMachine learning (ML) has shown enormous potential in various domains with the wide variations of underlying data types. Because of the miscellany in the data sets and the features, ML classifiers often suffer from challenges, such as feature miss-classification, unfit algorithms, low accuracy, overfitting, underfitting, extreme bias, and high predictive errors. Through the lens of related study and latest progress in the field, this paper presents a novel scheme to construct logical table (LT) unit with two internal sub-modules for algorithm blend and feature engineering. The LT unit works in the deepest layer of an enhanced ML engine engineering (eMLEE) process. eMLEE consists of several low-level modules to enhance the ML classifier progression. A unique engineering approach is adopted in eMLEE to blend various algorithms, enhance the feature engineering, construct a weighted performance metric, and augment the validation process. The LT is an in-memory logical component, that governs the progress of eMLEE, regulates the model metrics, improves the parallelism, and keep tracks of each module of eMLEE as the classifier learns. Optimum fitness of the model with parallel “check, validate, insert, delete, and update”mechanism in 3-D logical space via structured schemas in the LT is obtained. The LT unit is developed in Python, C#, and R libraries and tested using miscellaneous data sets. Results are created using GraphPad Prism, SigmaPlot, Plotly, and MS Excel software. To support the built and implementation of the proposed scheme, complete mathematical models along with the algorithms, and necessary illustrations are provided in this paper. To show the practicality of the proposed scheme, several simulation results are presented with a comprehensive analysis of the outcomes for the metrics of the model that the LT regulates with improved outcomes.https://ieeexplore.ieee.org/document/8439940/Big datapredictive modelingdata miningmachine learningalgorithmparallel processing of machine learning metrics reading
collection DOAJ
language English
format Article
sources DOAJ
author Muhammad Fahim Uddin
Syed Rizvi
Abdul Razaque
spellingShingle Muhammad Fahim Uddin
Syed Rizvi
Abdul Razaque
Proposing Logical Table Constructs for Enhanced Machine Learning Process
IEEE Access
Big data
predictive modeling
data mining
machine learning
algorithm
parallel processing of machine learning metrics reading
author_facet Muhammad Fahim Uddin
Syed Rizvi
Abdul Razaque
author_sort Muhammad Fahim Uddin
title Proposing Logical Table Constructs for Enhanced Machine Learning Process
title_short Proposing Logical Table Constructs for Enhanced Machine Learning Process
title_full Proposing Logical Table Constructs for Enhanced Machine Learning Process
title_fullStr Proposing Logical Table Constructs for Enhanced Machine Learning Process
title_full_unstemmed Proposing Logical Table Constructs for Enhanced Machine Learning Process
title_sort proposing logical table constructs for enhanced machine learning process
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description Machine learning (ML) has shown enormous potential in various domains with the wide variations of underlying data types. Because of the miscellany in the data sets and the features, ML classifiers often suffer from challenges, such as feature miss-classification, unfit algorithms, low accuracy, overfitting, underfitting, extreme bias, and high predictive errors. Through the lens of related study and latest progress in the field, this paper presents a novel scheme to construct logical table (LT) unit with two internal sub-modules for algorithm blend and feature engineering. The LT unit works in the deepest layer of an enhanced ML engine engineering (eMLEE) process. eMLEE consists of several low-level modules to enhance the ML classifier progression. A unique engineering approach is adopted in eMLEE to blend various algorithms, enhance the feature engineering, construct a weighted performance metric, and augment the validation process. The LT is an in-memory logical component, that governs the progress of eMLEE, regulates the model metrics, improves the parallelism, and keep tracks of each module of eMLEE as the classifier learns. Optimum fitness of the model with parallel “check, validate, insert, delete, and update”mechanism in 3-D logical space via structured schemas in the LT is obtained. The LT unit is developed in Python, C#, and R libraries and tested using miscellaneous data sets. Results are created using GraphPad Prism, SigmaPlot, Plotly, and MS Excel software. To support the built and implementation of the proposed scheme, complete mathematical models along with the algorithms, and necessary illustrations are provided in this paper. To show the practicality of the proposed scheme, several simulation results are presented with a comprehensive analysis of the outcomes for the metrics of the model that the LT regulates with improved outcomes.
topic Big data
predictive modeling
data mining
machine learning
algorithm
parallel processing of machine learning metrics reading
url https://ieeexplore.ieee.org/document/8439940/
work_keys_str_mv AT muhammadfahimuddin proposinglogicaltableconstructsforenhancedmachinelearningprocess
AT syedrizvi proposinglogicaltableconstructsforenhancedmachinelearningprocess
AT abdulrazaque proposinglogicaltableconstructsforenhancedmachinelearningprocess
_version_ 1724193433031016448