###
计算机系统应用英文版:2021,30(1):277-281
本文二维码信息
码上扫一扫!
土壤属性数据pH缺失的插补方法
(1.北京林业大学 信息学院, 北京 100083;2.国家林业草原林业智能信息处理工程技术研究中心, 北京 100083)
Imputation Method to Predict Missing pH Data of Soil Attribute
(1.School of Information Science and Technology, Beijing Forestry University, Beijing 100083, China;2.Engineering Research Center for Forestry-Oriented Intelligent Information Processing of National Forestry and Grassland Administration, Beijing 100083, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 924次   下载 2119
Received:May 23, 2020    Revised:June 19, 2020
中文摘要: 土壤分析研究中属性数据缺失的现象时常发生, 为了提高研究结果的可靠性, 有必要对土壤属性数据的缺失值插补方法进行研究. 从数据挖掘的角度利用多种缺失值处理方法来对缺失值进行插补, 以中国主要农田生态系统土壤养分数据库的pH属性为研究对象, 并且从真实值和插补值的拟合优度和插补误差两个方面评估各个方法在不同缺失率的数据集上的表现. 结果表明, 对比其他方法, 如多元回归、SVM、神经网络, 采用最优参数的KNN和随机森林插补方法对土壤属性数据pH进行插补是有效可行的. KNN和随机森林在不同缺失率的数据集上插补缺失数据pH的MAERMSER2的均值分别为0.132和0.131, 0.174和0.178, 0.775和0.765.
Abstract:The problem of the absence of attribute data often occurs in soil analysis and research. To improve the reliability of the research results, it is necessary to study the imputation methods for soil attribute missing data. In this study, a variety of imputation methods have been evaluated to interpolate the soil attribute missing data from the perspective of data mining. Using soil attribute pH as an interpolation object, the Soil Nutrient Database of China’s Major Ecosystems is used as the source of physical and chemical soil attribute data. We evaluate the performance of each method on the dataset of different missing rates in terms of model fitting and imputation error. The result shows that it is feasible to impute soil attribute pH missing data using the optimal parameter K-Nearest Neighbor (KNN) and random forest than other methods, such as multivariable regression, support vector machine, and neural network. The mean value of MAERMSE and R2 of the imputed missing data pH of KNN and random forest on the dataset with different missing rates are 0.132 and 0.131, 0. 174 and 0.178, 0.775 and 0.765, respectively.
文章编号:     中图分类号:    文献标志码:
基金项目:国家自然科学基金(61602042)
引用文本:
张逸飞,曹佳.土壤属性数据pH缺失的插补方法.计算机系统应用,2021,30(1):277-281
ZHANG Yi-Fei,CAO Jia.Imputation Method to Predict Missing pH Data of Soil Attribute.COMPUTER SYSTEMS APPLICATIONS,2021,30(1):277-281