详细信息
文献类型:期刊文献
中文题名:滑坡数据连续属性值处理的研究
英文题名:Research in Processing Continuous Property Data of Landslide
作者:亓呈明[1];崔守梅[2]
第一作者:亓呈明
机构:[1]北京联合大学自动化学院;[2]山东省淄博师范高等专科学校
第一机构:北京联合大学城市轨道交通与物流学院
年份:2006
期号:08X
起止页码:10-11
中文期刊名:微计算机信息
外文期刊名:Control & Automation
收录:北大核心:【北大核心2004】;
语种:中文
中文关键词:连续属性值;聚类;滑坡
外文关键词:continuous property, cluster, Landslide
摘要:数据预处理是提高挖掘过程精度和性能的关键。文章在分析决策树算法和滑坡数据属性值特点基础上,利用聚类将连续属性值划分区间,提出了一种针对滑坡数据连续属性值离散化的方法,通过实验,新方法构造的决策树比原算法的分类正确率高,规则冗余少。
Data preprocessing is essential to improving accuracy of data mining, Through analyzing the algorithm of decision tree and property of landslide data, we develop a new method to make continuous property discrete using of cluster in this paper. We compare the performance of the method with the performance of the original algorithm on two properties of data sets. The results provide evidence that: (a) new method is competitive with original algorithm with respect to predictive accuracy; and (h) The rule sets discovered by new method are simpler (smaller) than the rule sets discovered by original algorithm.
参考文献:
正在载入数据...