基于深度学习的图像检索系统

doi:10.15888/j.cnki.csa.005692

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月5日 4:32 星期六

首页 > 过刊浏览>2017年第26卷第3期 >8-19. DOI:10.15888/j.cnki.csa.005692

PDF HTML阅读 XML下载导出引用引用提醒

基于深度学习的图像检索系统
DOI:
                        10.15888/j.cnki.csa.005692
                    
CSTR:
                        
                    
作者:
                        胡二雷胡二雷
复旦大学 计算机科学技术学院, 上海 201203
在期刊界中查找
在百度中查找
在本站中查找
冯瑞冯瑞
上海市智能信息处理重点实验室 上海视频技术与系统工程研究中心, 上海 201203
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家科技支撑计划（2013BAH09F01）；上海市科委科技创新行动计划（14511106900）

Image Retrieval System Based on Deep Learning

Author:

HU Er-Lei
HU Er-Lei
School of Computer Science, Fudan University, Shanghai 201203, China
在期刊界中查找
在百度中查找
在本站中查找
FENG Rui
FENG Rui
Shanghai Key Laboratory of Intelligent Information Processing, Shanghai Engineering Research Center for Video Technology and System, Shanghai 201203, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献 [12]

相似文献 [20]

引证文献

资源附件

文章评论

摘要:

基于内容的图像检索系统关键的技术是有效图像特征的获取和相似度匹配策略.在过去，基于内容的图像检索系统主要使用低级的可视化特征，无法得到满意的检索结果，所以尽管在基于内容的图像检索上花费了很大的努力，但是基于内容的图像检索依旧是计算机视觉领域中的一个挑战.在基于内容的图像检索系统中，存在的最大的问题是“语义鸿沟”，即机器从低级的可视化特征得到的相似性和人从高级的语义特征得到的相似性之间的不同.传统的基于内容的图像检索系统，只是在低级的可视化特征上学习图像的特征，无法有效的解决“语义鸿沟”.近些年，深度学习技术的快速发展给我们提供了希望.深度学习源于人工神经网络的研究，深度学习通过组合低级的特征形成更加抽象的高层表示属性类别或者特征，以发现数据的分布规律，这是其他算法无法实现的.受深度学习在计算机视觉、语音识别、自然语言处理、图像与视频分析、多媒体等诸多领域取得巨大成功的启发，本文将深度学习技术用于基于内容的图像检索，以解决基于内容的图像检索系统中的“语义鸿沟”问题.

关键词:基于内容的图像检索;深度学习;特征提取;匹配

Abstract:

Learning effective feature representations and similarity measures are crucial to the retrieval performance of a content-based image retrieval system. In the past, the system works on the low-level visual features of input query image, which does not give satisfactory retrieval results, so, despite extensive research efforts for decades, it remains one of the most challenging problem in computer vision field. The main problem is the well-known "semantic gap", which exists between low-level image pixels captured by machines and high-level semantic concepts perceived by human. In the past, the content-based image retrieval system only works on the low-level visual features, which cannot solve "semantic gap" issue. Recently, the fast development of deep learning brings hope for the issue. Deep learning roots from the research of artificial neural network. In order to form more abstract high-level, deep learning combines low-level features, finds the regularities of distribution, which is different from other algorithm. Inspired by recent successes of deep learning techniques for computer vision, speech recognition, natural language process, image and video analysis, multimedia, in this paper, we apply deep learning to solve the "semantic gap" issue in content-based image retrieval.

Key words:content-based image retrieval;deep learning;feature extracting;match

参考文献

1 Wan J, Wang DY, Hoi SCH, Wu PC, Zhu JK, Zhang YD, Li JT. Deep learning for content-based image retrieval:A comprehensive study. Proc. of the 22nd ACM International Conference on Multimedia. ACM. 2014. 157-166.

2 Hinton GE, Osindero S, Teh YW. A fast learning algorithm for deep belief nets. Neural Computation, 2006, 18(7):1527-1554.

3 Rumelhart DE, Hinton GE, Williams RJ. Learning internal representations by error propagation. Nature, 1986.

4 Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems. 2012. 1097-1105.

5 Deng J, Dong W, Socher R, Li LJ, Li K, Li FF. ImageNet:A large-scale hierarchical image database. IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2009). IEEE. 2009. 248-255.

6 Nair V, Hinton GE. Rectified linear units improve restricted biltzmann machines. Proc. 27th International Conference on Machine Learning (ICML-10). 2010. 807-814.

7 Donahue J, Jia YQ, Vinyals O, Hoffman J, Zhang N, Darrell ET. DeCAF:A deep convolutional activation feature for generic visual recognition. ICML. 2014. 647-655.

8 Breiman L. Random forests. Machine Learning, 2001, 45(1):5-32.

9 Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR. Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580. 2012.

10 Szegedy C, Liu W, Jia YQ, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A. Going deeper with convolutions. Proc. of the IEEE Conference on Computer Vision and Pattern Recognition. 2015. 1-9.

11 Donahue J, Jia Y, Vinyals O, et al. DeCAF:A deep convolutional activation feature for generic visual recognition. Computer Science, 2013, 50(1):815-830.

12 Nair V, Hinton GE. Rectified linear units improve restricted boltzmann machines. Proc. 27th International Conference on Machine Learning (ICML-10). 2010. 807-814.

引用本文

胡二雷,冯瑞.基于深度学习的图像检索系统.计算机系统应用,2017,26(3):8-19

复制

文章指标

点击次数:3991
下载次数: 4878
HTML阅读次数: 0
引用次数: 0

历史

收稿日期:2016-07-10
最后修改日期:2016-09-20
录用日期:
在线发布日期: 2017-03-11
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码