Named Entity Recognition Technology for Brief Case
CSTR:
Author:
Affiliation:

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    A brief case is a brief description of a case record made by a public security organ to improve the quality of information input in the Collaborative Case Handling System and ensure efficient information retrieval and joint investigation. A large amount of case information related to the victim and the perpetrator is between various entities. Therefore, in-depth excavation of brief case texts is an effective means to grasp the beginning and end of a case and to analyze the case. The dense distribution, inter-nesting, and abbreviation of entities in a brief case text bring great challenges to the accurate capture of the case entities. In response to the particularity and complexity of brief case texts, this study improves the method of character vector generation and proposes a Roberta-CNN-BiLSTM-CRF (RC-BiLSTM-CRF) network architecture. Compared with the mainstream Bert-BiLSTM-CRF architecture, this architecture can extract the character vector features, thereby solving the problem of a lengthy character vector brought by model pre-training. The model parameter number is reduced for a higher overall parameter convergence rate. In the comparative experiment, five mainstream architectures are selected and compared on the brief case dataset provided by the public security organs of Hunan Province. The method proposed in this study is proved to be the best in terms of accuracy, recall rate, and F1 value, and its F1 value reaches 88.02%.

    Reference
    Related
    Cited by
Get Citation

陈柱辉,刘新,张明键,张达为.简要案情的命名实体识别技术.计算机系统应用,2022,31(1):47-54

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:March 24,2021
  • Revised:April 21,2021
  • Adopted:
  • Online: December 17,2021
  • Published:
Article QR Code
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063