An analysis on new hybrid parameter selection model performance over big data set

Parameter selection or attribute selection is one of the crucial tasks in the data analysis process. Incorrect selection of the important attribute might generate imprecise or event for a wrong decision. It is an advantage if the decision-maker could select and apply the best model that helps in ide...

Full description

Bibliographic Details
Main Authors: Fujita, H. (Author), Krejcar, O. (Author), Mohamad, M. (Author), Selamat, A. (Author), Wu, T. (Author)
Format: Article
Language:English
Published: Elsevier B.V., 2020
Subjects:
Online Access:View Fulltext in Publisher
View in Scopus
LEADER 02615nam a2200361Ia 4500
001 10.1016-j.knosys.2019.105441
008 220121s2020 CNT 000 0 und d
020 |a 09507051 (ISSN) 
245 1 0 |a An analysis on new hybrid parameter selection model performance over big data set 
260 0 |b Elsevier B.V.,  |c 2020 
650 0 4 |a Analysis tool 
650 0 4 |a Analysis tools 
650 0 4 |a Attribute selection 
650 0 4 |a Best-first-search algorithm 
650 0 4 |a Big data 
650 0 4 |a Data handling 
650 0 4 |a Decision 
650 0 4 |a Decision making 
650 0 4 |a Hybrid method 
650 0 4 |a Information analysis 
650 0 4 |a Investigate and analyze 
650 0 4 |a Parameter selection 
650 0 4 |a Parameterization model 
650 0 4 |a Set theory 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1016/j.knosys.2019.105441 
856 |z View in Scopus  |u https://www.scopus.com/inward/record.uri?eid=2-s2.0-85078493275&doi=10.1016%2fj.knosys.2019.105441&partnerID=40&md5=70d0c3bb6d4306304b365d6dcacee1a6 
520 3 |a Parameter selection or attribute selection is one of the crucial tasks in the data analysis process. Incorrect selection of the important attribute might generate imprecise or event for a wrong decision. It is an advantage if the decision-maker could select and apply the best model that helps in identifying the best-optimized attribute set — in the decision analysis process. Recently, many data scientists from various application areas are attracted to investigate and analyze the advantages and disadvantages of big data. One of the issues is, analyzing large volumes and variety of data in a big data environment is very challenging to the data scientists when there is a lack of a suitable model or no appropriate model to be implemented and used as a guideline. Hence, this paper proposes an alternative parameterization model that is able to generate the most optimized attribute set without requiring a high cost to learn, to use, and to maintain. The model is based on two integrated models that are combined with correlation-based feature selection, best-first search algorithm, soft set, and rough set theories which were compliments to each other as a parameter selection method. Experimental have shown that the proposed model has significantly shown as an alternative model in a big data analysis process. © 2020 The Authors 
700 1 0 |a Fujita, H.  |e author 
700 1 0 |a Krejcar, O.  |e author 
700 1 0 |a Mohamad, M.  |e author 
700 1 0 |a Selamat, A.  |e author 
700 1 0 |a Wu, T.  |e author 
773 |t Knowledge-Based Systems  |x 09507051 (ISSN)  |g 192