###
计算机系统应用英文版:2019,28(7):234-239
本文二维码信息
码上扫一扫!
基于双流卷积神经网络的人体行为识别方法
(青岛科技大学 信息科学技术学院, 青岛 266000)
Human Action Recognition Algorithm Based on Two-Stream Convolutional Networks
(Information Science and Technology Academy, Qingdao University of Science and Technology, Qingdao 266061, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 1772次   下载 1996
Received:January 21, 2019    Revised:February 21, 2019
中文摘要: 时序行为检测是指在一段未分割的长视频中,检测出其中包含的若干行为片段的起止时间和类别.针对该项任务,提出基于双流卷积神经网络的行为检测模型.首先使用双流卷积神经网络提取视频的特征序列,然后使用TAG (Temporal Actionness Grouping)生成行为提议,为了构建高质量的行为提议,将行为提议送入边界回归网络中修正边界,使之更为贴近真实数据,再将行为提议扩展为含有上下文信息的三段式特征设计,最后使用多层感知机对行为进行识别.实验结果表明,本算法在THUMOS 2014数据集和ActivityNet v1.3数据集获得较好的识别率.
Abstract:Given a long, untrimmed video consisting of multiple action instances and complex background contents, temporal action detection needs not only to recognize their action categories, but also to localize the start time and end time of each instance. To this end, a temporal action detection network based on two-stream convolutional networks is proposed. First, the two-stream convolutional networks is used to extract the feature sequence of the video, and then TAG (Temporal Actionness Grouping) is used to generate the proposal. In order to construct high-quality proposals, the proposal is feed to the boundary regression network to correct the boundary and make it closer to the ground truth, then extend the proposal to a three-segment feature design with context information, and finally use a multi-layer perception to identify behavior. The experimental results show that the proposed algorithm achieves a great mAP in the THUMOS 2014 dataset and the ActivityNet v1.3 dataset.
文章编号:     中图分类号:    文献标志码:
基金项目:国家自然科学基金(61472196,61672305)
引用文本:
刘云,张堃,王传旭.基于双流卷积神经网络的人体行为识别方法.计算机系统应用,2019,28(7):234-239
LIU Yun,ZHANG Kun,WANG Chuan-Xu.Human Action Recognition Algorithm Based on Two-Stream Convolutional Networks.COMPUTER SYSTEMS APPLICATIONS,2019,28(7):234-239