###
DOI:
计算机系统应用英文版:2014,23(2):183-188
本文二维码信息
码上扫一扫!
Hdspace分布式机构知识库系统的小文件存储
(河海大学 南京, 211100)
Storage of Small Files in Hdspace Distributing Institutional Repository System
(Business school of Hohai University, Nanjing 211100, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 1429次   下载 2586
Received:July 17, 2013    Revised:October 14, 2013
中文摘要: 机构知识库作为一种新型的学术交流模式和开放获取活动的绿色通道已逐渐成为国内外图书情报界关注的新焦点,随着机构库的发展其数据规模也在不断扩大,传统的存储模式已经不能满足日益增长的存储需求。在对机构库内容存储特点的研究基础上建立基于HDFS与Dspace的分布式机构库Hdspace。首先提出一种小文件合并生成新的存储文件,并对文件提出基于学科分类的两级索引,结合索引预缓存机制提高小文件的读取响应,为海量小文件存储及后续的信息高效利用提供了一种解决方案,通过模拟测试显示本模式能够大大提高机构知识库小文件的存储、读取以及检索效率。
中文关键词: 机构知识库  HDFS  海量小文件  Dspace
Abstract:The development of Institutional Repository requires a massive resource accumulation, the demand for storage keeps increasing especially for the small files. This article proposes a distributing storage model Hdspace which is based on Dapace and HDFS to resolve the problem of the storage of massive small files of Institutional Repository. First by a means of merging small document files to get new storage files, then by establishing two indexes based on subjects and index pre-caching to improve the file-reading response, finally put forward a method for the storage of massive small files.
文章编号:     中图分类号:    文献标志码:
基金项目:
引用文本:
卞艺杰,陈超,李亚冰,陆小亮.Hdspace分布式机构知识库系统的小文件存储.计算机系统应用,2014,23(2):183-188
BIAN Yi-Jie,CHEN Chao,LI Ya-Bing,LU Xiao-Liang.Storage of Small Files in Hdspace Distributing Institutional Repository System.COMPUTER SYSTEMS APPLICATIONS,2014,23(2):183-188