###

计算机系统应用英文版:2023,32(1):41-49

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

基于深度学习的行为识别多模态融合方法综述

詹健浩¹, 吴鸿伟², 周成祖², 陈晓筹³, 李晓潮¹

(1.厦门大学电子科学与技术学院, 厦门 361005;2.厦门市美亚柏科信息股份有限公司, 厦门 361016;3.厦门大学信息与网络中心, 厦门 361005)

Survey on Multi-modality Fusion Methods for Action Recognition Based on Deep Learning

ZHAN Jian-Hao¹, WU Hong-Wei², ZHOU Cheng-Zu², CHEN Xiao-Chou³, LI Xiao-Chao¹

(1.Department of Microelectronics and Integrated Circuit, Xiamen University, Xiamen 361005, China;2.Xiamen Meiya Pico Information Co. Ltd., Xiamen 361016, China;3.Information and Network Center, Xiamen University, Xiamen 361005, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 1432次下载 7075次
Received:March 08, 2022 Revised:April 12, 2022

中文摘要: 行为识别是通过对视频数据进行处理分析从而让计算机理解人的动作和行为. 不同模态数据在外观、姿态、几何、光照和视角等主要特征上各有优势, 通过多模态融合将这些特征进行融合可以获得比单一模态数据更好的识别效果. 本文对现有行为识别多模态融合方法进行介绍, 对比了它们之间的特点以及获得的性能提升, 包括预测分数融合、注意力机制、知识蒸馏等晚期融合方法, 以及特征图融合、卷积、融合结构搜索、注意力机制等早期融合方法. 通过这些分析和比较归纳出未来多模态融合的研究方向.

中文关键词: 行为识别深度学习多模态融合晚期融合早期融合

Abstract:Action recognition aims to make computers understand human actions by the processing and analysis of video data. As different modality data have different strengths in the main features such as appearance, gesture, geometric shapes, illumination, and viewpoints, action recognition based on the multi-modality fusion of these features can achieve better performance than the recognition based on single modality data. In this study, a comprehensive survey of multi-modality fusion methods for action recognition is given, and their characteristics and performance improvements are compared. These methods are divided into the late fusion methods and the early fusion methods, where the former includes prediction score fusion, attention mechanisms, and knowledge distillation, and the latter includes feature map fusion, convolution, fusion architecture search, and attention mechanisms. Upon the above analysis and comparison, the future research directions are discussed.

keywords: action recognition deep learning multi-modality fusion late fusion early fusion

文章编号： 中图分类号： 文献标志码：

基金项目:福建省高校产学研联合创新项目(2022H6004); 集成电路设计与测试分析福建省高校重点实验室基金; 厦门大学马来西亚研究基金(XMUMRF/2019-C4/IECE/0008)

引用文本：
詹健浩,吴鸿伟,周成祖,陈晓筹,李晓潮.基于深度学习的行为识别多模态融合方法综述.计算机系统应用,2023,32(1):41-49
ZHAN Jian-Hao,WU Hong-Wei,ZHOU Cheng-Zu,CHEN Xiao-Chou,LI Xiao-Chao.Survey on Multi-modality Fusion Methods for Action Recognition Based on Deep Learning.COMPUTER SYSTEMS APPLICATIONS,2023,32(1):41-49

Author Name	Affiliation	E-mail
ZHAN Jian-Hao	Department of Microelectronics and Integrated Circuit, Xiamen University, Xiamen 361005, China
WU Hong-Wei	Xiamen Meiya Pico Information Co. Ltd., Xiamen 361016, China
ZHOU Cheng-Zu	Xiamen Meiya Pico Information Co. Ltd., Xiamen 361016, China
CHEN Xiao-Chou	Information and Network Center, Xiamen University, Xiamen 361005, China
LI Xiao-Chao	Department of Microelectronics and Integrated Circuit, Xiamen University, Xiamen 361005, China	leexcjeffrey@xmu.edu.cn