Indeterminate values of target variable in development of credit scoring models
In the beginning of every modelling procedure, the first question to ask is what we are trying to predict by the model. In credit scoring the most frequent case is modelling of probability of default; however other situations, such as fraud, revolving of the credit or success of collections could be...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Mendel University Press
2013-01-01
|
Series: | Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis |
Subjects: | |
Online Access: | https://acta.mendelu.cz/61/7/2709/ |
id |
doaj-7967f6a309544fbbad37d8d63f41888e |
---|---|
record_format |
Article |
spelling |
doaj-7967f6a309544fbbad37d8d63f41888e2020-11-24T22:07:40ZengMendel University PressActa Universitatis Agriculturae et Silviculturae Mendelianae Brunensis1211-85162464-83102013-01-016172709271610.11118/actaun201361072709Indeterminate values of target variable in development of credit scoring modelsMartin Řezáč0Lukáš Toma1Department of Mathematics and Statistics, Faculty of Science, Masaryk University, Kotlářská 2, 611 37 Brno, Czech RepublicPolytech Nantes, Université de Nantes 1, quai de Tourville BP 13522, 44035 Nantes Cedex 1, FranceIn the beginning of every modelling procedure, the first question to ask is what we are trying to predict by the model. In credit scoring the most frequent case is modelling of probability of default; however other situations, such as fraud, revolving of the credit or success of collections could be predicted as well. Nevertheless, the first step is always to define the target variable.The target variable is generally an ’output’ of the model. It contains the information on the available data that we want to predict in future data. In credit scoring it is commonly called good/bad definition. In this paper we study the effect of use of indeterminate value of target variable in development of credit scoring models. We explain the basic principles of logistic regression modelling and selection of target variable. Next, the focus is given to introduction of some of the widely used statistics for model assessment. The main part of the paper is devoted to development and assessment of 27 credit scoring models on real credit data, which are built up and assessed according various definitions of target variable. We show that there is a valid reason for some target definitions to include the indeterminate value into the modelling process, as it provided us with convincing results.https://acta.mendelu.cz/61/7/2709/credit scoringindeterminate valuetarget variableGiniKSlift |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Martin Řezáč Lukáš Toma |
spellingShingle |
Martin Řezáč Lukáš Toma Indeterminate values of target variable in development of credit scoring models Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis credit scoring indeterminate value target variable Gini KS lift |
author_facet |
Martin Řezáč Lukáš Toma |
author_sort |
Martin Řezáč |
title |
Indeterminate values of target variable in development of credit scoring models |
title_short |
Indeterminate values of target variable in development of credit scoring models |
title_full |
Indeterminate values of target variable in development of credit scoring models |
title_fullStr |
Indeterminate values of target variable in development of credit scoring models |
title_full_unstemmed |
Indeterminate values of target variable in development of credit scoring models |
title_sort |
indeterminate values of target variable in development of credit scoring models |
publisher |
Mendel University Press |
series |
Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis |
issn |
1211-8516 2464-8310 |
publishDate |
2013-01-01 |
description |
In the beginning of every modelling procedure, the first question to ask is what we are trying to predict by the model. In credit scoring the most frequent case is modelling of probability of default; however other situations, such as fraud, revolving of the credit or success of collections could be predicted as well. Nevertheless, the first step is always to define the target variable.The target variable is generally an ’output’ of the model. It contains the information on the available data that we want to predict in future data. In credit scoring it is commonly called good/bad definition. In this paper we study the effect of use of indeterminate value of target variable in development of credit scoring models. We explain the basic principles of logistic regression modelling and selection of target variable. Next, the focus is given to introduction of some of the widely used statistics for model assessment. The main part of the paper is devoted to development and assessment of 27 credit scoring models on real credit data, which are built up and assessed according various definitions of target variable. We show that there is a valid reason for some target definitions to include the indeterminate value into the modelling process, as it provided us with convincing results. |
topic |
credit scoring indeterminate value target variable Gini KS lift |
url |
https://acta.mendelu.cz/61/7/2709/ |
work_keys_str_mv |
AT martinrezac indeterminatevaluesoftargetvariableindevelopmentofcreditscoringmodels AT lukastoma indeterminatevaluesoftargetvariableindevelopmentofcreditscoringmodels |
_version_ |
1725819201907589120 |