Matching Algorithm Between Data Item and Data Element During Data Integration
Author:
  • Article
  • | |
  • Metrics
  • |
  • Reference [8]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    In recent years, with the establishment of data element standard, data element plays important role during data integration in many enterprises. Data element may standardize dataitems of databases, reports and documents. It may help mapping between data sources. Analyzing the compositions of data element and putting forward a kind of matching algorithm between dataitem and data element. The matching algorithm is based on levenshtein distance and fused the thought of longest common subsequence, weight and backward focus. It realizes similarity calculation between dataitem and data element of data element dictionary. It uses the permutation and combination principle to optimize matching speed. The experiments have proved that the matching algorithm was right through using the standard dataitems of China Petroleum and Chemical data element dictionary as experimental data.

    Reference
    1 鱼滨,郑娅峰.基于本体的异构数据集成方法及其实现.计算 机应用与软件,2007,24(9):30-32,65.
    2 熊曾刚,张学敏,陈建新.基于XML 的信息系统集成的研究. 情报杂志,2005,6:25-27.
    3 刘庆河,郝文宁,韩宪勇,陈兴建,吴可嘉.基于数据元的数据 交换规范研究.电脑知识与技术,2010,6(10):2309-2310.
    4 吴波,李建,伍东.数据元标准化在石油数据中的研究与实 现.山西电子技术,2006,5:86-89.
    5 章成志.一种基于语义体系的同义词识别研究.淮阴工学院 学报,2004,13(1):59-62,67.
    6 赵作鹏,尹志民,王潜平,许新征,江海峰.一种改进的编辑距 离算法及其在数据处理中的应用.计算机应用,2009,29(2):424-426.
    7 Lu Y, Hou HQ. Automatic recognition and mining of Chinese synonyms for information retrieval. Information Studies: Theory & Application, 2006,29(4):472-475.
    8 朱毅华,侯汉清,沙印亭.计算机识别汉语同义词的两种算法 比较和测评.中国图书馆学报,2002,4:82-85.
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

文必龙,付玥.数据集成中数据项与数据元匹配算法.计算机系统应用,2012,21(3):240-243,231

Copy
Share
Article Metrics
  • Abstract:2837
  • PDF: 4361
  • HTML: 0
  • Cited by: 0
History
  • Received:July 08,2011
  • Revised:July 30,2011
Article QR Code
You are the first1094993Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063