一种领域专家文献自动收集系统
作者:

Automatic Bibliography Integration System for Domain Experts
Author:
  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献 [14]
  • |
  • 相似文献 [20]
  • | | |
  • 文章评论
    摘要:

    设计并实现了一种自动专家文献信息收集系统(BibCollector)。收录对象针对计算机科学技术领域的专家学者,收集范围涵盖国内外主要的全文数据库(SpringerLink, IEEE Xplore,ACM Digital Library, Elsevier ScienceDirect,中国知网CNKI 和万方数据)和常用的引文数据库(SCI,EI,ISTP,CSCD)及专利数据库(Derwent)。该系统使用专家姓名和工作单位作为标识,判断记录相关性和去除重复项,生成的文献列表具有较高的准确度。该系统同

    Abstract:

    We designed and implemented a system called BibCollector which can automatically collect the bibliography information from different databases. This system is targeted at experts in Information Technology (IT) domain. The databases covered include the most used ones such as SpringerLink, IEEE Xplore, ACM Digital Library, Elsevier ScienceDirect. Two main Chinese databases CNKI and Wanfang are also included. The citation databases that are covered include: Science Citation Index, EI, ISTP, CSCD. Besides these, the Derwent patent database is also included. We presented a method by using the name and affiliation/address of a person to accurately query from these databases. We also developed some algorithms to exclude the unrelated records and identify the duplicate ones. Comparing to the overseas and domestic counterparts, our system has advantages of richer record sources and more accurate results.

    参考文献
    1 Amit Singhal. Modern Information Retrieval: A Brief Overview. IEEE Data Engineering Bulletin. New York; IEEE. 2001:35-43.
    2 Ricardo Baeza-Yates, Berthier Ribeiro-Neto. Modern Information Retrieval. New York: ACM Press/Addison Wesley Longman, 1999.
    3 周津慧,王衍喜,王永吉,关贝,郝丹.基于领域专家学科知识 链的文献资源组织与导航.科研信息化技术与应用,2011,2(1):33-42.
    4 王衍喜,周津慧,王永吉,肖永红,郝丹.一种基于科技文献的 学科团队识别方法研究.图书情报工作,2011,55(2):55-58.
    5 Michael Ley. The DBLP computer science bibliography: evolution, research issues, perspectives. String Processing and Information Retrieval, 9th International Symposium, SPIRE. Lisbon, Portugal; Springer, 2002:1-10.
    6 Michael Ley. DBLP-Some lessons learned. Very Large Data Base. VLDB Endowment. 2009:1493-1500.
    7 Michael Ley, Patrick Reuther. Maintaining an online bibliographical database: the problem of data quality. Proc. of the Extraction et Gestion des Connaissances. Lille, France, 2006. Cepadues-Editions.5-10.
    8 Tang Jie, Zhang Jing, Zhang Duo, Yao Limin, Zhu Chunlin, Li Juanzi. ArnetMiner: An expertise oriented search system for web community. Proc. of the 6th International Conference of Semantic Web. Graz, Austria, 2007. ACM New York,1-8.
    9 Hui Han, Hongyuan Zha. Name disambiguation in author citations using a K-way spectral clustering method. Proc. of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries. Denver, CO, USA; ACM. 2005:334-343.
    10 Han H, Giles L, Zha H, Li C, Tsioutsiouliklis K. Two supervised learning approaches for name disambiguation in author citations. Proc. of the 4th ACM/IEEE-CS Joint Conference on Digital Libraries. Tucson, AZ, USA, 2004.296-305.
    11 Jeremy A. Hylton. Identifying and merging related bibliographic records[Thesis (M. Eng.)]. Massachusetts; Massachusetts Institute of Technology, 1996.
    12 郝丹,周津慧,关贝,王衍喜,韩继欣.文献跨库检索中去重方 法研究与应用.现代图书情报技术,2011,7(8):116-120.
    13 J Fenn. Managing citations and your bibliography with BibTeX. The PracTeX Journal, 2007,4(1):1-19.
    14 Luca Previtali, Brenno Lurati, Erik Wilde. BIBTEXML: An XML representation of BibTex. Proc. of the Tenth International World Wide Web Conference. Hongkong, China,2001. ACM Press.64-65.
    引证文献
    网友评论
    网友评论
    分享到微博
    发 布
引用本文

廖晓锋,王永吉,周津慧,关贝.一种领域专家文献自动收集系统.计算机系统应用,2012,21(6):115-120

复制
分享
文章指标
  • 点击次数:2347
  • 下载次数: 3720
  • HTML阅读次数: 0
  • 引用次数: 0
历史
  • 收稿日期:2011-10-01
  • 最后修改日期:2011-11-04
文章二维码
您是第12478933位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京海淀区中关村南四街4号 中科院软件园区 7号楼305房间,邮政编码:100190
电话:010-62661041 传真: Email:csa (a) iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号