Research on Optimization Method of Railway Image Scene Classification Based on Deep Learning Method
CSTR:
Author:
  • Article
  • | |
  • Metrics
  • |
  • Reference [16]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    The field of railway detection and monitoring generates massive image data, image scene classification is of great value for subsequent analysis and management. In this study, a visual scene classification model that combines Deep Convolutional Neural Networks (DCNN) and Grad Class Activation Mapping (Grad-CAM) is proposed, DCNN extract feature of railway scene classification image dataset by transfer learning method, Grad-CAM improves the interpretability of the classification model by calculating the weighted thermogram and activation scores of the categories. In the experiment, the effects of different DCNN structures on the performance of railway image scene classification tasks are compared, and visual interpretation of scene classification model is realized. At the same time, based on visualization method, an optimization process is proposed to improve model classification ability by reducing internal deviation of dataset, which verifies the effectiveness of the deep learning technology for image scene classification task.

    Reference
    [1] 张萼辉. 铁路货车装载状态图像智能识别系统. 中国铁路, 2017, (9):113-116
    [2] Harris C, Stephens M. A combined corner and edge detector. Proceedings of the 4th Alvey Vision Conference. Manchester, UK. 1988. 147-151.
    [3] Wang S, Li W, Wang Y, et al. An improved difference of gaussian filter in face recognition. Journal of Multimedia, 2012, 7(6):429-433
    [4] Dalal N, Triggs B. Histograms of oriented gradients for human detection. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Diego, CA, USA. 2005. 886-893.
    [5] Lowe DG. Object recognition from local scale-invariant features. Proceedings of the Seventh IEEE International Conference on Computer Vision. Kerkyra, Greece. 1999. 1150-1157.
    [6] Ojala T, Pietikainen M, Maenpaa T. Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(7):971-987.[doi:10.1109/TPAMI.2002.1017623
    [7] Chen PH, Lin CJ, Schölkopf B. A tutorial on ν-support vector machines. Applied Stochastic Models in Business and Industry, 2005, 21(2):111-136.[doi:10.1002/asmb.v21:2
    [8] LeCun Y, Bengio Y, Hinton G. Deep learning. Nature, 2015, 521(7553):436-444.[doi:10.1038/nature14539
    [9] Zeiler MD, Fergus R. Visualizing and understanding convolutional networks. arXiv:1311.2901, 2014.
    [10] Donahue J, Jia Y, Vinyals O, et al. DeCAF:A deep convolutional activation feature for generic visual recognition. International Conference on Machine Learning. Beijing, China. 2014. 647-655
    [11] Smirnov EA, Timoshenko DM, Andrianov SN. Comparison of regularization methods for imagenet classification with deep convolutional neural networks. Aasri Procedia, 2014, 6:89-94.[doi:10.1016/j.aasri.2014.05.013
    [12] Zhou BL, Khosla A, Lapedriza A, et al. Learning deep features for discriminative localization. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA. 2016. 2921-2929.
    [13] Selvaraju RR, Cogswell M, Das A, et al. Grad-CAM:Visual explanations from deep networks via gradient-based localization. arXiv:1610.02391, 2017.
    [14] Deng J, Dong W, Socher R, et al. Imagenet:A large-scale hierarchical image database. 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, FL, USA. 2009. 248-255.
    [15] Zhou BL, Lapedriza A, Khosla A, et al. Places:A 10 million image database for scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(6):1452-1464.[doi:10.1109/TPAMI.2017.2723009
    [16] Herranz L, Jiang SQ, Li XY. Scene Recognition with CNNs:objects, scales and dataset bias. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, NV, USA. 2016. 571-579.
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

赵冰,李平,代明睿,马小宁.基于深度学习的铁路图像场景分类优化研究.计算机系统应用,2019,28(6):228-234

Copy
Share
Article Metrics
  • Abstract:1980
  • PDF: 3534
  • HTML: 1980
  • Cited by: 0
History
  • Received:December 10,2018
  • Revised:December 29,2018
  • Online: May 28,2019
  • Published: June 15,2019
Article QR Code
You are the first990789Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063