Word Segment Based on Suffix Array

AIPUB归智期刊联盟

WeChat

Mobile website

2025-8-6- 23

Home > Archive>Volume 19, Issue 8, 2010 >229-230

PDF HTML XML Export Cite reminder

Word Segment Based on Suffix Array
DOI:
                        
                    
CSTR:
                        
                    
Author:
                        REN Xue-LiREN Xue-Li

Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
DAI Yu-BiaoDAI Yu-Biao

Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Chinese word segmentation technology is the basis of machine translation, classification, search engines, as well as information retrieval. But the Internet emerging new words have seriously affected the performance of word segmentation. To improve the recognition rate of new words, suffix array is used in this paper, and the number of length of common prefix is calculated. The candidates on their words are filtered out by the threshold. Experimental results show that the new word recognition method has advantages.

Key words:suffix array; word segment; LCP

Get Citation

任雪利,代余彪.基于后缀数组的分词技术①.计算机系统应用,2010,19(8):229-230

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:December 04,2009
Revised:January 18,2010
Adopted:
Online:
Published:

Article QR Code

You are the first1094888Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063