本文已被:浏览 2151次 下载 7641次
Received:February 23, 2017 Revised:March 09, 2017
Received:February 23, 2017 Revised:March 09, 2017
中文摘要: 在当今大数据时代下,数据质量的保证是大数据价值得以发挥的前提,数据质量的评估是其中一个重要的研究课题.本文基于规则库的数据质量评估方法,提出了数据质量评估整体模型,包括规则、规则库、数据质量评估指标、评估模板、评估报告.设计了规则评估模板,组合规则库中的规则,根据数据质量评估指标的重要性设置规则的权重,采用简单比率法和加权平均法相结合的评估方法,计算评估结果并确定数据质量的等级,利用了数据可视化技术来展现数据质量的评估结果.本文既考虑了单个规则的执行合格率,又考虑了各规则在数据质量评估模板中的比重,公正地准确地评估数据质量,并且简洁、直观地呈现评估结果.
Abstract:In today's era of big data, data quality is the premise of the significance of big data. The evaluation of data quality is one of the most important research topics. In this paper, the data quality assessment method based on rule base is put forward, and the overall model of data quality assessment is presented, which includes rules, rule base, data quality evaluation index, evaluation model and evaluation report. This paper designs the rule evaluation template, combines rules in the rule base, sets rule weight according to the importance of data quality evaluation index, adopts the evaluation method that combines the simple ratio method and the weighted average method, calculates the evaluation result, determines the grade of the data quality, and shows the evaluation result of data quality with the data visualization technology. In order to fairly and accurately assess the data quality, and concisely and intuitively present the evaluation results, the paper does not only consider the execution rate of a single rule, but also considers the proportion of each rule in the data quality evaluation template.
文章编号: 中图分类号: 文献标志码:
基金项目:上海市科委重点项目(SKY2015004)
引用文本:
刘芳,李敏,任洪敏,周兆明.基于规则库的数据质量评估方法.计算机系统应用,2017,26(11):165-169
LIU Fang,LI Min,REN Hong-Min,ZHOU Zhao-Ming.Data Quality Evaluation Method Based on Rule Base.COMPUTER SYSTEMS APPLICATIONS,2017,26(11):165-169
刘芳,李敏,任洪敏,周兆明.基于规则库的数据质量评估方法.计算机系统应用,2017,26(11):165-169
LIU Fang,LI Min,REN Hong-Min,ZHOU Zhao-Ming.Data Quality Evaluation Method Based on Rule Base.COMPUTER SYSTEMS APPLICATIONS,2017,26(11):165-169