Abstract:In the process of investigation, exploration, and prevention about geologic hazard, a large number of heterogeneous data including text data is obtained. The method to storage the text data in file name search or large field is traditional, cannot meet rapidly retrieve and extract the useful information in the text data. It is an important problem faced by the geological hazard data storing and retrieving. In this study, based on the NoSQL, Chinese word segmentation, and Chinese keyword extraction technology, fast retrieval and statistics of any useful information are realized in geological hazard text data. It can provide strong support for the deep mining and fusion of hazard data.