Analisa Komparasi Model Data Mining Algoritma C4.5, CHAID, dan Random Forest Untuk Penilaian Kelayakan Kredit

Credit has now become a trend in society. The problem with credit is the improper history of credit card usage. The resulting impact can lead to bad credit. If customers fail to pay off debts that have been agreed upon with the bank, they can increase their credit risk. This study aims to conduct a...

Full description

Bibliographic Details
Published in:Computer Science
Main Authors: Amrin Amrin, Omar Pahlevi, Harsih Rianto
Format: Article
Language:English
Published: LPPM Universitas Bina Sarana Informatika 2025-01-01
Subjects:
Online Access:https://jurnal.bsi.ac.id/index.php/co-science/article/view/6208
_version_ 1848651434402250752
author Amrin Amrin
Omar Pahlevi
Harsih Rianto
author_facet Amrin Amrin
Omar Pahlevi
Harsih Rianto
author_sort Amrin Amrin
collection DOAJ
container_title Computer Science
description Credit has now become a trend in society. The problem with credit is the improper history of credit card usage. The resulting impact can lead to bad credit. If customers fail to pay off debts that have been agreed upon with the bank, they can increase their credit risk. This study aims to conduct a comparative analysis of three data mining classification methods: the C4.5 algorithm, Chi-Squared Automatic Interaction Detection (CHAID), and Random Forest. The goal is to classify creditworthiness status. The researcher used 481 vehicle credit records with "bad" and "good" reviews. In this study, the independent variables used are dependent status, age, marital status, occupation, income, employment status, company status, last education, length of stay, house condition, and down payment. For creditworthiness assessment, the C4.5 model shows a truth accuracy rate of 91.90% with an area under the curve (AUC) value of 0.915. The CHAID model shows a truth accuracy rate of 63.83% with an AUC value of 0.661, and the Random Forest model shows a truth accuracy rate of 78.60% with an AUC value of 0.907. The evaluation results show that both the Random Forest and C4.5 algorithms have high accuracy rates and AUC values.
format Article
id doaj-e66cf130a97243c9944bb3ae2ecbff7e
institution Directory of Open Access Journals
issn 2808-9065
2774-9711
language English
publishDate 2025-01-01
publisher LPPM Universitas Bina Sarana Informatika
record_format Article
spelling doaj-e66cf130a97243c9944bb3ae2ecbff7e2025-11-03T00:46:57ZengLPPM Universitas Bina Sarana InformatikaComputer Science2808-90652774-97112025-01-0151495710.31294/coscience.v5i1.62086231Analisa Komparasi Model Data Mining Algoritma C4.5, CHAID, dan Random Forest Untuk Penilaian Kelayakan KreditAmrin Amrin0Omar Pahlevi1Harsih Rianto2Universitas Bina Sarana InformatikaUniversitas Bina Sarana InformatikaUniversitas Bina Sarana InformatikaCredit has now become a trend in society. The problem with credit is the improper history of credit card usage. The resulting impact can lead to bad credit. If customers fail to pay off debts that have been agreed upon with the bank, they can increase their credit risk. This study aims to conduct a comparative analysis of three data mining classification methods: the C4.5 algorithm, Chi-Squared Automatic Interaction Detection (CHAID), and Random Forest. The goal is to classify creditworthiness status. The researcher used 481 vehicle credit records with "bad" and "good" reviews. In this study, the independent variables used are dependent status, age, marital status, occupation, income, employment status, company status, last education, length of stay, house condition, and down payment. For creditworthiness assessment, the C4.5 model shows a truth accuracy rate of 91.90% with an area under the curve (AUC) value of 0.915. The CHAID model shows a truth accuracy rate of 63.83% with an AUC value of 0.661, and the Random Forest model shows a truth accuracy rate of 78.60% with an AUC value of 0.907. The evaluation results show that both the Random Forest and C4.5 algorithms have high accuracy rates and AUC values.https://jurnal.bsi.ac.id/index.php/co-science/article/view/6208classificationalgorithm c4.5chi-squared automatic interaction detection (chaid)random forestconfusion matrix
spellingShingle Amrin Amrin
Omar Pahlevi
Harsih Rianto
Analisa Komparasi Model Data Mining Algoritma C4.5, CHAID, dan Random Forest Untuk Penilaian Kelayakan Kredit
classification
algorithm c4.5
chi-squared automatic interaction detection (chaid)
random forest
confusion matrix
title Analisa Komparasi Model Data Mining Algoritma C4.5, CHAID, dan Random Forest Untuk Penilaian Kelayakan Kredit
title_full Analisa Komparasi Model Data Mining Algoritma C4.5, CHAID, dan Random Forest Untuk Penilaian Kelayakan Kredit
title_fullStr Analisa Komparasi Model Data Mining Algoritma C4.5, CHAID, dan Random Forest Untuk Penilaian Kelayakan Kredit
title_full_unstemmed Analisa Komparasi Model Data Mining Algoritma C4.5, CHAID, dan Random Forest Untuk Penilaian Kelayakan Kredit
title_short Analisa Komparasi Model Data Mining Algoritma C4.5, CHAID, dan Random Forest Untuk Penilaian Kelayakan Kredit
title_sort analisa komparasi model data mining algoritma c4 5 chaid dan random forest untuk penilaian kelayakan kredit
topic classification
algorithm c4.5
chi-squared automatic interaction detection (chaid)
random forest
confusion matrix
url https://jurnal.bsi.ac.id/index.php/co-science/article/view/6208
work_keys_str_mv AT amrinamrin analisakomparasimodeldataminingalgoritmac45chaiddanrandomforestuntukpenilaiankelayakankredit
AT omarpahlevi analisakomparasimodeldataminingalgoritmac45chaiddanrandomforestuntukpenilaiankelayakankredit
AT harsihrianto analisakomparasimodeldataminingalgoritmac45chaiddanrandomforestuntukpenilaiankelayakankredit