基于3D-SVD的时空行为定位算法

doi:10.15888/j.cnki.csa.008122

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月24日 3:29 星期四

首页 > 过刊浏览>2021年第30卷第10期 >109-117. DOI:10.15888/j.cnki.csa.008122

PDF HTML阅读 XML下载导出引用引用提醒

基于3D-SVD的时空行为定位算法
DOI:
                        10.15888/j.cnki.csa.008122
                    
CSTR:
                        
                    
作者:
                        王紫烟王紫烟
复旦大学 智能机器人研究院, 上海 200433
在期刊界中查找
在百度中查找
在本站中查找
张立华张立华
复旦大学 智能机器人研究院, 上海 200433;季华实验室, 佛山 528200;智能机器人教育部工程研究中心, 上海 200433;吉林省人工智能与无人系统工程研究中心, 长春 130012
在期刊界中查找
在百度中查找
在本站中查找
翟鹏翟鹏
复旦大学 智能机器人研究院, 上海 200433;上海智能机器人工程技术研究中心, 上海 200433
在期刊界中查找
在百度中查找
在本站中查找
杜洋涛杜洋涛
复旦大学 智能机器人研究院, 上海 200433
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:上海市科委项目(19511132000)

Spatio-Temporal Action Localization Algorithm Based on 3D-SVD

Author:

WANG Zi-Yan
WANG Zi-Yan
Institute of AI and Robotics, Fudan University, Shanghai 200433, China
在期刊界中查找
在百度中查找
在本站中查找
ZHANG Li-Hua
ZHANG Li-Hua
Institute of AI and Robotics, Fudan University, Shanghai 200433, China;Ji Hua Laboratory, Foshan 528200, China;Engineering Research Center of AI and Robotics, Ministry of Education, Shanghai 200433, China;Engineering Research Center of AI and Unmanned Vehicle Systems of Jilin Province, Changchun 130012, China
在期刊界中查找
在百度中查找
在本站中查找
ZHAI Peng
ZHAI Peng
Institute of AI and Robotics, Fudan University, Shanghai 200433, China;Shanghai Engineering Research Center of AI and Robotics, Shanghai 200433, China
在期刊界中查找
在百度中查找
在本站中查找
DU Yang-Tao
DU Yang-Tao
Institute of AI and Robotics, Fudan University, Shanghai 200433, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

随着摄像头的普及, 基于人工智能的行为分析技术在智能视频领域扮演着越来越重要的角色. 现有的算法大多采用光流网络或者3D网络来获取行为的时间信息, 但是光流网络和一般的3D网络计算量大, 在同时进行分类和定位这两项任务时计算效率低. 针对这一问题, 本文构建了一个能够进行空间定位和分类的双流框架, 在3D网络分支中采用SVD的思想分解3D卷积核以减少3D网络的参数, 并利用动态规划算法高效的搜索最佳行为管道, 在训练的过程中采用mixup算法对数据集进行扩充, 增强训练的效果. 最后, 在UCF101-24和J-HMDB-21这两个被广泛使用的行为定位数据集上进行了实验验证, 相比于基线算法, 两个数据集的Frame-mAP分别提高了7.1%和4.8%, 其中, J-HMDB-21在不同IOU下的Video-mAP分别提高了5.2%和4.8%. 实验结果表明, 本文提出的算法能有效提高行为定位能力, 与其它算法相比获得了较好的结果.

关键词:行为定位;SVD;数据增强;行为管道

Abstract:

With the popularity of video surveillance, action analysis technology based on artificial intelligence is playing an increasingly important role in the field of intelligent surveillance. Most of the existing algorithms depend on an optical flow network or a 3D network to obtain the time information of actions. However, the optical flow network and the general 3D network require a large amount of computation, and the computational efficiency is low when classification and localization are carried out simultaneously. To solve this problem, this study builds a dualflow framework capable of spatial localization and classification and follows the idea of SVD to decompose the 3D convolution kernel in the branch of the 3D network, thus reducing the 3D network parameters. In addition, the dynamic programming algorithm is employed to efficiently search the optimal action tubes, and the mixup algorithm is used to expand the data sets during training, thereby enhancing the training results. Finally, experimental verification is carried out on UCF101-24 and J-HMDB-21, two widely used data sets for action localization. Compared with the baseline algorithm, the Frame-mAP of the two data sets is improved by 7.1% and 4.8%, and the Video-mAP of J-HMDB-21 under different IoUs is enhanced by 5.2% and 4.8%. Experimental results show that the proposed algorithm can substantially improve the ability of action localization, with better results compared with other algorithms.

Key words:action localization;SVD;data augmentation;action tubes

引用本文

王紫烟,张立华,翟鹏,杜洋涛.基于3D-SVD的时空行为定位算法.计算机系统应用,2021,30(10):109-117

复制

文章指标

点击次数:873
下载次数: 2186
HTML阅读次数: 1747
引用次数: 0

历史

收稿日期:2021-01-06
最后修改日期:2021-02-07
录用日期:
在线发布日期: 2021-10-08
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码