999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

The Application of Bagging for Solving Small-Businesses’ Credit Assessment Problems Based on Daily T

2009-04-29 00:00:00DONGYan-wen
中國管理信息化 2009年15期

Abstract: Although many efforts have been made towards assessing credit risk, and various models and methods for credit assessment have been developed, most of them are based on financial data or stock price and some investigation data provided by special inquiry agencies. Because most of minor and small businesses do not disclose their financial information, it is almost impossible to assess their credit using models or methods published so far. Here we have proposed a new approach to assess the customers’ credit only based on daily transaction data such as sales, payments by customers, amount of overdue payment, etc. This paper proposes a credit assessment system using bagging method. It aims to deal with the issue that the number of unsound customers is much less than that of healthy ones and improve the ability for identifying unsound customers. The performance and effectiveness of the proposed system is confirmed by applying it to the real problems.

Key words: Ensembles Bagging; Customer Evaluation; Credit Risk; Credit Scoring

doi:10.3969/j.issn.1673-0194.2009.15.035

CLC number: TP224;F275Article character:AArticle ID:1673-0194(2009)15-0115-05

1 Introduction

In today’s increasingly competitive business environment, risk management has become an important topic. Credit risk is one of management risks being faced frequently and most simply defined as the potential that counterparty will fail to meet its obligations in accordance with agreed terms. Because there are many types of counterparties from individuals to sovereign governments and many different types of obligations from auto loans to derivatives transactions credit risk takes many forms. There have been significant prior researches on credit analysis or credit evaluation. The models and methodologies published so far for credit risk assessment generally fall into two categories: default models and credit scoring models. Default models assess the likelihood of default by an obligor. Credit scoring models are used to assess the credit quality of counterparty.

The application of statistical techniques to credit analysis started in the 1960’s with the development of computers. Beaver[1] is one of the first researchers to introduced discriminant analysis (DA) to study bankruptcy prediction. From the 1980’s, the DA method was replaced by other statistical techniques such as logit analysis, probit analysis, multidimensional scaling and so on[2][3]. Artificial Intelligence (AI) techniques, particularly rule-based expert systems, case-based reasoning (CBR) systems and machine learning techniques such as neural networks have been used to credit rating or bankruptcy analysis. Recently, researchers have proposed the hybrid data mining approach in the design of an effective credit scoring model[4-6].

Credit scoring models vary regarding the type and quantity of the data needed for decision making, most of prior studies are limited to be applied mainly in financial community companies such as commercial banks where the customers are usually required to submit their financial data and/or others. However, many of companies of non-financial community cannot require financial data and/or others from their customers. It is difficult in particular to obtain the customers’ data in the situation where the customers are small businesses without disclosing financial information. Therefore, most of models and methods for credit assessment, which are suitable and effective to commercial banks or large business, cannot always be applied to small business.

To deal with the customers’ credit assessment problem in a small company, we have proposed some new approaches to assess the customers’ credit only based on daily transaction data such as sales, payments by customers, amount of overdue payment, etc[7] [8]. Because the data can be extracted almost automatically from the database of the management information system, these approaches are suitable to be applied to many of organizations where the customers do not disclose their financial data and have an advantage of a low cost to collect data over other ones published so far.

However, some new issues have to be addressed. One of them is that daily transaction data contains usually a large amount of transaction records and so it is necessary to reduce the data size to improve the learning efficiency. Another issue is how to improve the ability for identifying the unsound customers. Because the number of unsound customers is much less than that of healthy ones, it is more important to indentify unsound customers more accurately than improving the overall accuracy. In order to deal with these issues, this paper proposes a credit assessment system using bagging method. The architecture of the system is described by giving the procedure for generating multiple versions of a customer’s credit assessment through multivariate discriminant analysis (MDA) and aggregating these to get a final assessment through a plurality vote. The performance and effectiveness of the proposed system is confirmed by applying it to the real problems of the company.

Table 1 Credit scores

ScoreCustomers

1A healthy customer for which all orders are accepted.

2A customer for which orders are accepted but limited to a given amount.

3A customer for which orders are accepted only in cash sale.

4An unsound customer for which all of orders are rejected.

2 Credit Assessment Problem in a Small Company

This paper considers the credit assessment problem in a small company that the main business is selling school uniforms and accessories at wholesale. There are 20 employees in the company, and the annual sale is about 600 million Japanese yen. Orders come from about 800 customers that are classified into three types: retailer, school and others. The customers’ credit has been assessed through a four-grade credit score as shown in Table 1.

Because most of the customers are minor small businesses without disclosure of financial information, it is almost impossible to obtain their financial data. It is also difficult to frequently ask an agency for evaluating customers’ credit due to limited budget. For these reasons, it is obviously preferable to develop a system that be able to assess the customers’ credit only based on daily transaction data.

Figure 1 Scheme of credit assessment system using bagging

3 Features Data Extraction

Considering the available data that can be obtained from the company, we collected eight basic features and represent customer Ci (i=1,2, ,n) by the following data structure:

Ci: (xi1, xi2, xi3, , xi8)(1)

where n is the number of customers, xij (j=1,2, ,8) are the features of customer Ci defined as the following:

xi1, xi2: 0-1 variables representing the type of customer Ci as shown in Table 2.

Table 2 0-1 Variables xi1 and xi2

Type of customers xi1xi2

retailer00

school01

other10

 xi3: average amount of overdue payment in the year considered.

 xi4: maximum overdue days for all of overdue payment in the year considered.

 xi5: number of times that overdue payment occurs in the year considered.

 xi6: total sales in the year considered.

 xi7: rate of the average amount of overdue payment to the total sales, i.e. xi7 =xi3 / xi6.

 xi8: number of transaction months that any order from the customer is fulfilled in the year considered.

4 Credit Assessment System Using Bagging

4.1 System scheme

Bagging, which stands for bootstrap aggregating, is one of the earliest and perhaps the simplest ensemble based learning algorithms, with a surprisingly good performance [9]. Diversity of classifiers in bagging is obtained by using bootstrapped replicas of the training data. That is, different training data subsets are randomly drawn with replacement from the entire training dataset. Each training data subset is used to train a different classifier of the same type. Individual classifiers are then combined by taking a simple majority vote of their decisions.

Here the credit assessment system using bagging is proposed and showed Figure 1.

4.2 Procedure for credit assessment

When assessing customers’ credit, the number of healthy customers is usually much larger than that of unsound ones. If the entire dataset of all customers is used as the training dataset, the features of healthy customers are often over-learned but the unsound ones are frequently under learned. Practically, it is more important to identify unsound customers than doing healthy ones in order to control credit risk as low as possible. Considering this aspect, a revision is made in the sampling method of the standard bagging algorithm and bagging procedure for credit assessment is proposed as follows.

[Step 1] Data preparation

The entire dataset is S={ (Ci, csi) | i=1,2, ,n }, where Ci is the features data of customers Ci defined in equation (1) and csi (csi∈{1,2,3,4}) is its credit score. Divide S into two sub dataset S1 and S2 as:

S1={ (Ci, csi) | csi =1, i=1,2, ,n } (2)

S2={ (Ci, csi) | csi >1, i=1,2, ,n } (3)

S1 means the subset of healthy customers and S2 represents the subset of unsound customers. The sizes of dataset S1 and S2 are n1 and n2 respectively.

[Step 2] Sampling with replacement (bootstrap)

Let N be the number of bootstrapped replicas, and build the replica Bk (k=1,2,…,N) by drawing randomly p customers and q customers-with replacement-from the dataset S1 and the dataset S2 respectively.

[Step 3] Weak learning

Using replica Bk (k=1,2, ,N) as the training set that Ci be the input variables and csi be the class label, the Classifier Hk can be constructed by multivariate discriminant analysis, which consists of four discriminant functions. The ensemble E can be obtained as E={ Hk | k=1,2, ,N }.

[Step 4] Majority voting

Given a target customer NC, its features data can be collected and denoted as:

NC: (z1, z2, …, zm)(4)

where z1, z2,…, zm have the same definitions as xi1, xi2,…,xim. Then its credit score cf can be decided by the following steps:

[Step 4.1] Let Gj (j=1,2,3,4) be the class label that is corresponding to the credit score of j. Applying the Classifier Hk to the target customer NC, the class label of NC can be given as cfk (cfk∈{G1,G2,G3,G4}).

[Step 4.2] Let νkj=1,if cfk=Gj0, ptherwise and νj=∑Nk=1νkj, the credit score cf is chosen by taking a simple majority vote as:

cf=Argmax {vj, j=1,2,3,4}(5)

5 Experiment and Discussion

To investigate the performance and effectiveness of the proposed system, we apply the proposed system to the real credit assessing problems of the company. The features data of the customers in 2001 financial year and their credit scores given by financial managers of the company are collected as shown in Table 3.

Table 3 Features data of 498 customers

FeaturesMeanStandard deviationNumber of customers

xi3196 171762 532Credit scoreNumber

xi456.3773.301474

xi52.923.0622

xi61 201 3903 706 05732

xi70.220.37420

xi85.564.14Total498

Let p=24, q=24 and N=5, 10, 24, we apply the proposed system to classify the customers of Table 3, and further the classification results are compared with that given by the financial managers of the company. The comparison results are shown in Table 4. The credit scores for the customers with score of 2, 3 and 4 provided by the system are 100%, 50% and 90% respectively in agreement with the judgments of the financial managers of the company, and the hit rate is not sensitive to N (the number of classifier).

The classification result based on all of customers’ data is also obtained as shown in Table 5. Comparing the hit rates of Table 4 and Table 5, it is obvious that the proposed system can classify unsound customers more effectively than ordinary discriminant analysis. But the system gave lower hit rate for classifying the healthy customers.

6 Concluding Remarks

This paper dealt with customers’ credit scoring problems in a small company and intended to assess the customer’s credit based only on daily transaction data such as sales, payments by customers, amount of overdue payment, etc. A credit assessment system using bagging method was proposed and the performance and effectiveness is confirmed by applying it to the real problems of the company. The experiment results showed that the system can classify unsound customers more effectively than ordinary discriminant analysis.

Breiman[10] reported that linear discriminant analysis is of stable classifiers and has low variance, but can have high bias and therefore bagging cannot work well. It can be expected that the performance of the system can be further improved by introducing appropriate unstable classifiers.

Acknowledgement

This work was partly supported by grant-in-aid for scientific research (C) from the japan society for the promotion of science under grant no. 19530324.

References

[1] W Beaver. Financial Ratios as Predictor of Failure[J].Journal of Accounting Research (Supplement), 1966, 4: 71-111.

[2] L H Ederington. Classification Models and Bond Ratings[J]. The Financial Review, 1985, 20(4):237-262.

[3] M C Mar, C G Apellaniz, C S Cinca.A Multivariate Study of Spanish Bond Ratings[J]. Omega, 1996, 24(4): 451-462, 1996.

[4] D West.Neural Network Credit Scoring Models[J]. Computers and Operations Research, 2000,27(11/12): 1131-1152.

[5] B Baesens, et al.Benchmarking State-of-the-art Classification Algorithms for Credit Scoring[J]. Journal of the Operational Research Society, 2003, 54(6): 627-635.

[6] N C Hsieh. Hybrid Mining Approach in the Design of Credit Scoring Models[J]. Expert Systems with Applications, 2005, 28(4): 655-665.

[7] Y DONG. A Case Based Reasoning System for Evaluating Customer Credit[J]. Journal of Japan Industrial Management Association, 2006, 57(2): 144-152 (In Japanese language).

[8] Y DONG. Development of a Customer Credit Evaluation System via Case-based Reasoning Approach[J]. Asia-Pacific Journal of Industrial Management, 2008, 1(1): 1-7.

[9] L Breiman. Bagging predictors[J]. Machine Learning, 1996, 24(2): 123-140.

[10] L Breiman. Bias, Variance, and Arcing Classifiers[R]. Technical Report 460, Statistics Department, University of California, 1996: 1-25.


登錄APP查看全文

主站蜘蛛池模板: 久久久久青草大香线综合精品| 拍国产真实乱人偷精品| 国产成人精品综合| 拍国产真实乱人偷精品| 亚洲最大情网站在线观看| 日韩A∨精品日韩精品无码| aaa国产一级毛片| 朝桐光一区二区| 亚洲综合经典在线一区二区| 日本黄色不卡视频| 67194在线午夜亚洲| 亚洲第一福利视频导航| 国产成人三级在线观看视频| 国产精品视频第一专区| 国产亚洲男人的天堂在线观看| 一本大道香蕉久中文在线播放| 欧美在线一级片| 高清色本在线www| 久久久波多野结衣av一区二区| 久久成人国产精品免费软件| 成人毛片免费在线观看| 九月婷婷亚洲综合在线| 在线免费a视频| 成人国产精品一级毛片天堂| 国产麻豆精品在线观看| 亚洲色中色| 精品国产成人a在线观看| 国产69囗曝护士吞精在线视频| 性欧美久久| 国产成人成人一区二区| 热久久这里是精品6免费观看| 久久久久无码国产精品不卡 | 一本一本大道香蕉久在线播放| 亚洲欧美日韩精品专区| 亚洲自偷自拍另类小说| 国产尹人香蕉综合在线电影 | 91福利一区二区三区| 欧美亚洲网| 国产波多野结衣中文在线播放| 不卡视频国产| 五月综合色婷婷| 中文成人在线| 国产在线麻豆波多野结衣| 一级不卡毛片| 午夜色综合| 韩国v欧美v亚洲v日本v| 免费高清a毛片| www欧美在线观看| 精品伊人久久久久7777人| 国产理论最新国产精品视频| 色网站在线免费观看| 99热线精品大全在线观看| 精品无码一区二区在线观看| 激情五月婷婷综合网| 亚洲天堂网在线播放| 黄网站欧美内射| 国产杨幂丝袜av在线播放| 国产人成网线在线播放va| 性喷潮久久久久久久久| 性欧美在线| 免费观看三级毛片| 精品一区二区久久久久网站| 成人综合在线观看| 亚洲精品成人片在线观看| 国产香蕉一区二区在线网站| 欧美精品色视频| 精品国产女同疯狂摩擦2| 天堂成人av| 日本一区中文字幕最新在线| 国产91高跟丝袜| 亚洲欧美日韩久久精品| 欧美国产日本高清不卡| 国产精品免费露脸视频| 99久久精品国产综合婷婷| 午夜福利免费视频| 国模极品一区二区三区| 日本a∨在线观看| 国产人在线成免费视频| 久久国产亚洲欧美日韩精品| 欧美在线伊人| 免费视频在线2021入口| 22sihu国产精品视频影视资讯|