###

计算机系统应用英文版:2020,29(5):252-256

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

基于多粒度视频信息和注意力机制的视频场景识别

袁韶祖, 王雷全, 吴春雷

(中国石油大学(华东) 计算机科学与技术学院, 青岛 266580)

Video Scene Recognition with Multi-Granularity Video Features and Attention Mechanism

YUAN Shao-Zu, WANG Lei-Quan, WU Chun-Lei

(College of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 1151次下载 2334次
Received:October 16, 2019 Revised:November 15, 2019

中文摘要: 视频场景识别是机器学习和计算机视觉一个重要的研究领域.但是当前对于视频场景识别的探索工作还远远不够，而且目前提出的模型大都使用视频级的特征信息，忽略了多粒度的视频特征关联.本文提出了一种基于多粒度的视频特征的注意力机制的模型架构，可以动态高效的利用各维度视频信息之间存在的丰富的语义关联，提高识别准确度.本文在中国多媒体大会（CCF ChinaMM 2019）最新推出的VideoNet数据集上进行了实验，实验结果表明基于多粒度的视频特征的注意力机制的模型与传统方法相比具有明显的优越性.

中文关键词: 视频场景识别注意力机制多粒度视频信息卷积神经网络检测网络

Abstract:Video scene recognition has attracted much attention in the field of machine learning and computer vision. It is not only an important practical application, but also a challenge for image understanding in the field of computer vision. Nevertheless, current exploration of video scene recognition has not been unable to meet the needs of production environment. And most proposed models only use video-level feature information, while ignore association of multi-granularity video feature. In this study, we propose an architecture of attention mechanism with multi-granularity video features, which can make use of the rich semantic association among the various dimensions of video information dynamically and efficiently, and improve the performance of the model. The experiments are conducted on the latest VideoNet dataset released by CCF China MM 2019. The result shows that the proposed model based on attention mechanism model with multi-granularity video features outperforms the previous methods.

keywords: video scene recognition attention mechanism multi-granularity video features CNN detection network

文章编号： 中图分类号： 文献标志码：

基金项目:山东省重点研发计划（2019GGX101015）；中央高校自主创新科研计划（17CX02041A，18CX02136A）

Author Name	Affiliation	E-mail
YUAN Shao-Zu	College of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China
WANG Lei-Quan	College of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China
WU Chun-Lei	College of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China	wuchunlei06@163.com

Author Name	Affiliation	E-mail
YUAN Shao-Zu	College of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China
WANG Lei-Quan	College of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China
WU Chun-Lei	College of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China	wuchunlei06@163.com

引用文本：
袁韶祖,王雷全,吴春雷.基于多粒度视频信息和注意力机制的视频场景识别.计算机系统应用,2020,29(5):252-256
YUAN Shao-Zu,WANG Lei-Quan,WU Chun-Lei.Video Scene Recognition with Multi-Granularity Video Features and Attention Mechanism.COMPUTER SYSTEMS APPLICATIONS,2020,29(5):252-256