TY - JOUR
T1 - Classification Restricted Boltzmann Machine for comprehensible credit scoring model
AU - Tomczak, Jakub M.
AU - Zie¸ba, Maciej
PY - 2015/1/1
Y1 - 2015/1/1
N2 - Credit scoring is the assessment of the risk associated with a consumer (an organization or an individual) that apply for the credit. Therefore, the problem of credit scoring can be stated as a discrimination between those applicants whom the lender is confident will repay credit and those applicants who are considered by the lender as insufficiently reliable. In this work we propose a novel method for constructing comprehensible scoring model by applying Classification Restricted Boltzmann Machines (ClassRBM). In the first step we train the ClassRBM as a standalone classifier that has ability to predict credit status but does not contain interpretable structure. In order to obtain comprehensible model, first we evaluate the relevancy of each of binary features using ClassRBM and further we use these values to create the scoring table (scorecard). Additionally, we deal with the imbalanced data issue by proposing a procedure for determining the cutting point using the geometric mean of specificity and sensitivity. We evaluate our approach by comparing its performance with the results gained by other methods using four datasets from the credit scoring domain.
AB - Credit scoring is the assessment of the risk associated with a consumer (an organization or an individual) that apply for the credit. Therefore, the problem of credit scoring can be stated as a discrimination between those applicants whom the lender is confident will repay credit and those applicants who are considered by the lender as insufficiently reliable. In this work we propose a novel method for constructing comprehensible scoring model by applying Classification Restricted Boltzmann Machines (ClassRBM). In the first step we train the ClassRBM as a standalone classifier that has ability to predict credit status but does not contain interpretable structure. In order to obtain comprehensible model, first we evaluate the relevancy of each of binary features using ClassRBM and further we use these values to create the scoring table (scorecard). Additionally, we deal with the imbalanced data issue by proposing a procedure for determining the cutting point using the geometric mean of specificity and sensitivity. We evaluate our approach by comparing its performance with the results gained by other methods using four datasets from the credit scoring domain.
KW - Comprehensible model
KW - Credit scoring
KW - Imbalanced data
KW - Restricted Boltzmann Machine
UR - http://www.scopus.com/inward/record.url?scp=84910615991&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84910615991&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2014.10.016
DO - 10.1016/j.eswa.2014.10.016
M3 - Article
AN - SCOPUS:84910615991
SN - 0957-4174
VL - 42
SP - 1789
EP - 1796
JO - Expert Systems with Applications
JF - Expert Systems with Applications
IS - 4
ER -