详细信息
A hybrid system with filter approach and multiple population genetic algorithm for feature selection in credit scoring ( SCI-EXPANDED收录 CPCI-S收录)
文献类型:会议论文
英文题名:A hybrid system with filter approach and multiple population genetic algorithm for feature selection in credit scoring
作者:Wang, Di[1,2];Zhang, Zuoquan[1];Bai, Rongquan[1];Mao, Yanan[1]
第一作者:Wang, Di
通讯作者:Zhang, ZQ[1]
机构:[1]Beijing Jiaotong Univ, Sch Sci, Beijing 100044, Peoples R China;[2]Beijing Union Univ, Dept Basic Courses, Beijing 100101, Peoples R China
第一机构:Beijing Jiaotong Univ, Sch Sci, Beijing 100044, Peoples R China
通讯机构:[1]corresponding author), Beijing Jiaotong Univ, Sch Sci, Beijing 100044, Peoples R China.
会议论文集:International Conference on Information and Computational Science (ICICS)
会议日期:AUG 02-06, 2016
会议地点:Dalian Univ Technol, Dalian, PAKISTAN
主办单位:Dalian Univ Technol
语种:英文
外文关键词:Credit scoring; Feature selection; Hybrid approach; HMPGA
摘要:With the financial crisis happened in 2007, massive credit risks are exposed to the banking sectors. So credit scoring has attracted more and more attention. Bank owns a lot of customer data. By using those data, credit scoring model can judge the applicants' credit risk accurately. But those data are often high dimensional, and have some irrelevant features. Those irrelevant features will affect classifiers accuracy. Therefore, feature selection is an important topic. This paper proposes a two-phase hybrid approach based on filter approach and multiple population genetic algorithm-HMPGA. In phase 1, it introduces the idea of wrapper approach into three filter approaches to acquire some important prior information for initial populations setting of MPGA. In phase 2, it takes advantage of MPGA's characteristics of global optimization and quick convergence to find optimal feature subset. This paper uses two real credit scoring datasets of UCI databases to compare HMPGA, MPGA and GA. It verifies that the accuracies of feature subsets acquired from HMPGA, MPGA and GA are superior to three filter approaches. Meanwhile, nonparametric Wilcoxon signed rank test is held to confirm that HMPGA is better than MPGA and GA. HMPGA not only can be applied to feature selection of credit scoring, but also can be applied to more fields of data mining. (C) 2017 Elsevier B.V. All rights reserved.
参考文献:
正在载入数据...