Scene Recognition Algorithm Using Advanced CNN Features

doi:10.15888/j.cnki.csa.006684

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 8

Home > Archive>Volume 27, Issue 12, 2018 >25-32. DOI:10.15888/j.cnki.csa.006684

PDF HTML XML Export Cite reminder

Scene Recognition Algorithm Using Advanced CNN Features
DOI:
                        10.15888/j.cnki.csa.006684
                    
CSTR:
                        [cstr]
                    
Author:
                        BO Kang-HuBO Kang-Hu
School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LEE Fei-FeiLEE Fei-Fei
School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
CHEN QiuCHEN Qiu
School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference [16]

Related [20]

Cited by

Materials

Comments

Abstract:

With the development of artificial intelligence, scene recognition has attracted more and more researchers' attention, which is one of the important directions of computer vision research. The traditional manual features cannot sufficiently describe the characteristics of the scene images, which leading to unsatisfied performance. On the contrary, the features extracted from Convolutional Neural Networks (CNN) contain rich semantics and structural information of the scene images. As one of the most common architectures, AlexNet network model is chosen in this study. By improving the following 4 aspects of the network:depth, width,multi-scale extraction, and multilayer fusion, the proposed approach achieves high accuracies of 92.0% and 94.5% on two publicly available datasets respectively, showing the superiority compared with other methods.

Key words:scene recognition;computer vision;Convolutional Neural Networks (CNN);AlexNet

Reference

[1] Koskela M, Laaksonen J. Convolutional network features for scene recognition. Proceedings of the 22nd ACM Interna-tional Conference on Multimedia. Orlando, FL, USA. 2014. 1169-1172.

[2] Li T, Mei T, Kweon IS, et al. Contextual bag-of-words for visual categorization. IEEE Transactions on Circuits and Systems for Video Technology, 2011, 21(4):381-392.[doi:10.1109/TCSVT.2010.2041828

[3] Lazebnik S, Schmid C, Ponce J. Beyond bags of features:Spatial pyramid matching for recognizing natural scene categories. Proceedings of 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York, NY, USA. 2006. 2169-2178.

[4] Menze BH, Van Leemput K, Lashkari D, et al. A generative probabilistic model and discriminative extensions for brain lesion segmentation-with application to tumor and stroke. IEEE Transactions on Medical Imaging, 2016, 35(4):933-946.[doi:10.1109/TMI.2015.2502596

[5] Yu J, Rui Y, Tao DC. Click prediction for web image reranking using multimodal sparse coding. IEEE Transac-tions on Image Processing, 2014, 23(5):2019-2032.[doi:10.1109/TIP.2014.2311377

[6] He KM, Sun J. Convolutional neural networks at constrained time cost. Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston, MA, USA. 2015. 5353-5360.

[7] Zeiler MD, Fergus R. Visualizing and understanding convolu-tional networks. Proceedings of the 13th European Conference on Computer Vision. Zurich, Switzerland. 2014. 818-833.

[8] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556, 2014.

[9] Jia YQ, Shelhamer E, Donahue J, et al. Caffe:Convolutional architecture for fast feature embedding. arXiv:1408.5093, 2014.

[10] Vedaldi A, Fulkerson B. VLFeat:An open and portable library of computer vision algorithms. Proceedings of the 18th ACM International Conference on Multimedia. Firenze, Italy. 2010. 1469-1472.

[11] Jiang YN, Yuan JS, Yu G. Randomized spatial partition for scene recognition. Proceedings of the 12th European Conference on Computer Vision. Florence, Italy. 2012. 730-743.

[12] Zhou BL, Lapedriza A, Xiao JX, et al. Learning deep features for scene recognition using places database. Procee-dings of the 27th International Conference on Neural Information Processing Systems. Montreal, Canada. 2014. 487-495.

[13] Wu JX, Rehg JM. Beyond the Euclidean distance:Creating effective visual codebooks using the histogram intersection kernel. Proceedings of 2009 IEEE 12th International Conf-erence on Computer Vision. Kyoto, Japan. 2009. 630-637.

[14] Banerji S, Sinha A, Liu CJ. A new bag of words LBP (BoWL) descriptor for scene image classification. Proceedings of the 15th International Conference on Computer Analysis of Images and Patterns. York, UK. 2013. 490-497.

[15] Wang JJ, Yang JC, Yu K, et al. Locality-constrained linear coding for image classification. Proceedings of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Francisco, CA, USA. 2010. 3360-3367.

[16] Boser BE, Guyon IM, Vapnik VN. A training algorithm for optimal margin classifiers. Proceedings of the 5th Annual Workshop on Computational Learning Theory. Pittsburgh, PA, USA. 1992. 144-152.

Get Citation

薄康虎,李菲菲,陈虬.基于改进CNN特征的场景识别.计算机系统应用,2018,27(12):25-32

Copy

Article Metrics

Abstract:2657
PDF: 2936
HTML: 3842
Cited by: 0

History

Received:May 12,2018
Revised:June 04,2018
Adopted:
Online: December 05,2018
Published:

Article QR Code

You are the first992247Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063