Indeterminate values of target variable in development of credit scoring models

In the beginning of every modelling procedure, the first question to ask is what we are trying to predict by the model. In credit scoring the most frequent case is modelling of probability of default; however other situations, such as fraud, revolving of the credit or success of collections could be...

Full description

Bibliographic Details
Main Authors: Martin Řezáč, Lukáš Toma
Format: Article
Language:English
Published: Mendel University Press 2013-01-01
Series:Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis
Subjects:
KS
Online Access:https://acta.mendelu.cz/61/7/2709/
id doaj-7967f6a309544fbbad37d8d63f41888e
record_format Article
spelling doaj-7967f6a309544fbbad37d8d63f41888e2020-11-24T22:07:40ZengMendel University PressActa Universitatis Agriculturae et Silviculturae Mendelianae Brunensis1211-85162464-83102013-01-016172709271610.11118/actaun201361072709Indeterminate values of target variable in development of credit scoring modelsMartin Řezáč0Lukáš Toma1Department of Mathematics and Statistics, Faculty of Science, Masaryk University, Kotlářská 2, 611 37 Brno, Czech RepublicPolytech Nantes, Université de Nantes 1, quai de Tourville BP 13522, 44035 Nantes Cedex 1, FranceIn the beginning of every modelling procedure, the first question to ask is what we are trying to predict by the model. In credit scoring the most frequent case is modelling of probability of default; however other situations, such as fraud, revolving of the credit or success of collections could be predicted as well. Nevertheless, the first step is always to define the target variable.The target variable is generally an ’output’ of the model. It contains the information on the available data that we want to predict in future data. In credit scoring it is commonly called good/bad definition. In this paper we study the effect of use of indeterminate value of target variable in development of credit scoring models. We explain the basic principles of logistic regression modelling and selection of target variable. Next, the focus is given to introduction of some of the widely used statistics for model assessment. The main part of the paper is devoted to development and assessment of 27 credit scoring models on real credit data, which are built up and assessed according various definitions of target variable. We show that there is a valid reason for some target definitions to include the indeterminate value into the modelling process, as it provided us with convincing results.https://acta.mendelu.cz/61/7/2709/credit scoringindeterminate valuetarget variableGiniKSlift
collection DOAJ
language English
format Article
sources DOAJ
author Martin Řezáč
Lukáš Toma
spellingShingle Martin Řezáč
Lukáš Toma
Indeterminate values of target variable in development of credit scoring models
Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis
credit scoring
indeterminate value
target variable
Gini
KS
lift
author_facet Martin Řezáč
Lukáš Toma
author_sort Martin Řezáč
title Indeterminate values of target variable in development of credit scoring models
title_short Indeterminate values of target variable in development of credit scoring models
title_full Indeterminate values of target variable in development of credit scoring models
title_fullStr Indeterminate values of target variable in development of credit scoring models
title_full_unstemmed Indeterminate values of target variable in development of credit scoring models
title_sort indeterminate values of target variable in development of credit scoring models
publisher Mendel University Press
series Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis
issn 1211-8516
2464-8310
publishDate 2013-01-01
description In the beginning of every modelling procedure, the first question to ask is what we are trying to predict by the model. In credit scoring the most frequent case is modelling of probability of default; however other situations, such as fraud, revolving of the credit or success of collections could be predicted as well. Nevertheless, the first step is always to define the target variable.The target variable is generally an ’output’ of the model. It contains the information on the available data that we want to predict in future data. In credit scoring it is commonly called good/bad definition. In this paper we study the effect of use of indeterminate value of target variable in development of credit scoring models. We explain the basic principles of logistic regression modelling and selection of target variable. Next, the focus is given to introduction of some of the widely used statistics for model assessment. The main part of the paper is devoted to development and assessment of 27 credit scoring models on real credit data, which are built up and assessed according various definitions of target variable. We show that there is a valid reason for some target definitions to include the indeterminate value into the modelling process, as it provided us with convincing results.
topic credit scoring
indeterminate value
target variable
Gini
KS
lift
url https://acta.mendelu.cz/61/7/2709/
work_keys_str_mv AT martinrezac indeterminatevaluesoftargetvariableindevelopmentofcreditscoringmodels
AT lukastoma indeterminatevaluesoftargetvariableindevelopmentofcreditscoringmodels
_version_ 1725819201907589120