Decision Tree Algorithms for Lung Cancer Diagnosis Based on Electronic Medical Record
CSTR:
Author:
  • Article
  • | |
  • Metrics
  • |
  • Reference [14]
  • |
  • Related
  • |
  • Cited by
  • | |
  • Comments
    Abstract:

    With the continuous improvement of people's living standards, the number of cancer diseases is increasing. Among them, lung cancer is a major disease that seriously endangers human health in the 21st century. This paper presents a decision tree method for lung cancer diagnosis based on electronic medical records. Firstly, the characteristics of lung cancer electronic medical records and the instability and over-fitting of the model tree in the decision tree are analyzed. The optimal decision tree model constructed by principal component analysis combined with C5.0 algorithm is used. Firstly, two methods of feature dimension reduction with principal component eigenvalue greater than 1 and principal component cumulative contribution rate greater than 85% are established. Then, the decision tree model and pruning operation are established by C5.0 algorithm. Finally, the data preprocessing process and model are given. The experimental results show that the improved algorithm has better accuracy and good scalability, which proves that the improved algorithm is of great significance for the clinical trial of lung cancer.

    Reference
    [1] World Health Organization. Cancer:fact sheet. http://www.who.int/mediacentre/factsheets/fs297/en/,[2017-05-01].
    [2] Wood DE, Kazerooni EA, Baum SL, et al. Lung cancer screening, version 3. 2018, NCCN clinical practice guidelines in oncology. Journal of the National Comprehensive Cancer Network, 2018, 16(4):412-441.
    [3] Li WM, Zhao S, Liu LX. The methods and clinical significance of early diagnosis of lung cancer. Journal of Sichuan University (Medical Science Edition), 2017, 48(3):331-335
    [4] Wang XG, Chen SH. A face recognition method based on PCA and GEP. Advances in Information Sciences and Service Sciences, 2013, 5(1):291-297.[doi:10.4156/aiss
    [5] Paylakhi SZ, Ozgoli S, Paylakhi SH. A novel gene selection method using GA/SVM and fisher criteria in Alzheimer's disease. 201523rd Iranian Conference on Electrical Engineering. Tehran, Iran. 2015. 956-959.
    [6] 莫珍丽.基于遗传算法优化小波神经网络的传染病发病率预测模型研究[硕士学位论文].重庆:重庆医科大学, 2015.
    [7] 王卓.基于粗糙集和C4.5决策树的临床病例数据分类研究.软件导刊, 2014, 13(5):61-64
    [8] Noor NM, Rosid R, Azmi MH, et al. Comparing watershed and FCM segmentation in detecting reticular pattern for interstitial lung disease. 2012 IEEE-EMBS Conference on Biomedical Engineering and Sciences. Langkawi, Malaysia. 2013. 944-949.
    [9] 孙海峰,孙秀玲,齐恩铁,等.基于混合粒子群优化SVM算法的红斑鳞状皮肤病诊断.计算机应用与软件, 2015, 32(6):192-197, 211.[doi:10.3969/j.issn.1000-386x.2015.06.048
    [10] 赵蔷.主成分分析方法综述.软件工程, 2016, 19(6):1-3.[doi:10.3969/j.issn.1008-0775.2016.06.001
    [11] Panhalkar A, Doye D. An outlook in some aspects of hybrid decision tree classification approach:A survey. Satapathy S, Bhateja V, Joshi A. Proceedings of the International Conference on Data Engineering and Communication Technology. Singapore:Springer, 2017.
    [12] Zhou XL, Yan DS. Model tree pruning. International Journal of Machine Learning and Cybernetics, 2019,(1):1-14.[doi:10.1007/s13042-019-00930-9.
    [13] 庄军,郭平,周杨,等.电子病历数据预处理技术.计算机科学, 2007, 34(3):141-144.[doi:10.3969/j.issn.1002-137X.2007.03.037
    [14] 朱甜甜.基于医疗大数据的肿瘤疾病模式分析与研究[硕士学位论文].青岛:青岛科技大学, 2018.
    Related
    Cited by
Get Citation

冯云霞,张润.基于电子病历的肺癌诊断决策树算法.计算机系统应用,2019,28(10):257-263

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:March 20,2019
  • Revised:April 17,2019
  • Online: October 15,2019
  • Published: October 15,2019
Article QR Code
You are the first992102Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063