本文已被:浏览 2321次 下载 2222次
Received:October 08, 2017 Revised:November 01, 2017
Received:October 08, 2017 Revised:November 01, 2017
中文摘要: 地质灾害调查、勘查及防治等工作过程中,获取了大量的多源异构数据,其中的文本数据多以文件名检索或大字段形式整体存储,这种传统的存储方式不能满足文本信息中有用信息的快速检索与提取,是当前地质灾害数据存储和检索所面临的一个重要问题.本文基于非结构化数据库技术、中文分词技术、关键词提取技术,实现了地质灾害文本数据中任意有用信息的快速检索及与统计,可以为灾害数据的深层挖掘与融合提供有力支持.
Abstract:In the process of investigation, exploration, and prevention about geologic hazard, a large number of heterogeneous data including text data is obtained. The method to storage the text data in file name search or large field is traditional, cannot meet rapidly retrieve and extract the useful information in the text data. It is an important problem faced by the geological hazard data storing and retrieving. In this study, based on the NoSQL, Chinese word segmentation, and Chinese keyword extraction technology, fast retrieval and statistics of any useful information are realized in geological hazard text data. It can provide strong support for the deep mining and fusion of hazard data.
keywords: geologic hazard NoSQL Chinese words segmentation paragraphs segmentation information retrieve
文章编号: 中图分类号: 文献标志码:
基金项目:国家自然科学基金(41572336)
引用文本:
姚梦辉,刘军旗,封瑞雪,陈根深,赵剑雄.地质灾害信息存储技术及检索方法.计算机系统应用,2018,27(6):209-213
YAO Meng-Hui,LIU Jun-Qi,FENG Rui-Xue,CHEN Gen-Shen,ZHAO Jian-Xiong.Geological Hazard Information Storage Technology and Tetrieval Method.COMPUTER SYSTEMS APPLICATIONS,2018,27(6):209-213
姚梦辉,刘军旗,封瑞雪,陈根深,赵剑雄.地质灾害信息存储技术及检索方法.计算机系统应用,2018,27(6):209-213
YAO Meng-Hui,LIU Jun-Qi,FENG Rui-Xue,CHEN Gen-Shen,ZHAO Jian-Xiong.Geological Hazard Information Storage Technology and Tetrieval Method.COMPUTER SYSTEMS APPLICATIONS,2018,27(6):209-213