基于CTD-BLSTM的医疗领域中文命名实体识别模型

doi:10.15888/j.cnki.csa.007609

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月14日 14:35 星期一

首页 > 过刊浏览>2020年第29卷第8期 >173-178. DOI:10.15888/j.cnki.csa.007609

PDF HTML阅读 XML下载导出引用引用提醒

基于CTD-BLSTM的医疗领域中文命名实体识别模型
DOI:
                        10.15888/j.cnki.csa.007609
                    
CSTR:
                        
                    
作者:
                        祝锡永祝锡永
浙江理工大学 经济管理学院, 杭州 310018
在期刊界中查找
在百度中查找
在本站中查找
吴炀吴炀
浙江理工大学 经济管理学院, 杭州 310018
在期刊界中查找
在百度中查找
在本站中查找
刘崇刘崇
浙江理工大学 经济管理学院, 杭州 310018
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(71501172); 浙江省自然科学基金(LY15G010010)

Chinese Named Entity Recognition in Medical Field Using CTD-BLSTM Model

Author:

ZHU Xi-Yong
ZHU Xi-Yong
School of Economics and Management, Zhejiang Sci-Tech University, Hangzhou 310018, China
在期刊界中查找
在百度中查找
在本站中查找
WU Yang
WU Yang
School of Economics and Management, Zhejiang Sci-Tech University, Hangzhou 310018, China
在期刊界中查找
在百度中查找
在本站中查找
LIU Chong
LIU Chong
School of Economics and Management, Zhejiang Sci-Tech University, Hangzhou 310018, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

为在模型训练期间保留更多信息, 用预训练词向量和微调词向量对双向长短期记忆网络(Bi-LSTM)神经模型进行扩展, 并结合协同训练方法来应对医疗文本标注数据缺乏的情况, 构建出改进模型CTD-BLSTM (Co-Training Double word embedding conditioned Bi-LSTM)用于医疗领域的中文命名实体识别. 实验表明, 与原始BLSTM与BLSTM-CRF相比, CTD-BLSTM模型在语料缺失的情况下具有更高的准确率和召回率, 能够更好地支持医疗领域知识图谱的构建以及知识问答系统的开发.

关键词:双向长短期记忆网络;协同训练;中文命名实体识别;问答系统;医疗领域

Abstract:

In order to retain more characteristic information in the training process, this study uses pre-training word vector and fine-tuning word vector to extend Bi-directional Long Short-Term Memory network (Bi-LSTM), and combines the co-training semi-supervision method to deal with the feature of sparse annotated text in the medical field. An improved model of Co-Training Double word embedding conditioned Bi-LSTM (CTD-BLSTM) is further proposed for Chinese named entity recognition. Experiments show that compared with the original BLSTM and BLSTM-CRF, the CTD-BLSTM model has higher accuracy and recall rate in the absence of corpora, the proposed method can better support the construction of medical knowledge graph and the development of knowledge answering system.

Key words:Bi-LSTM;co-training;Chinese named entity recognition;question answering system;medical field

引用本文

祝锡永,吴炀,刘崇.基于CTD-BLSTM的医疗领域中文命名实体识别模型.计算机系统应用,2020,29(8):173-178

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2020-01-22
最后修改日期:2020-02-27
录用日期:
在线发布日期: 2020-07-31
出版日期: 2020-08-15

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码