基于全局-个体特征融合的群体行为识别

doi:10.15888/j.cnki.csa.009698

微信公众号

网站二维码

首页 > 过刊浏览>2024年第33卷第12期 >43-54. DOI:10.15888/j.cnki.csa.009698

PDF HTML阅读 XML下载导出引用引用提醒

基于全局-个体特征融合的群体行为识别
DOI:
                        10.15888/j.cnki.csa.009698
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(41975183, 41875184)

Group Activity Recognition Based on Global-individual Feature Fusion

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

群体行为识别是计算机视觉领域中备受关注的研究方向之一, 旨在通过多个个体动作与互动关系确定整体的行为. 然而, 由于确定个体互动关系、联系紧密程度以及活动关键人物三者的困难, 现有方法常关注于人物的个体特征, 忽略了与活动场景上下文的相互联系. 针对该问题, 提出一个基于全局-个体特征融合的群体行为识别推理模型GIFFNet (global-individual feature fusion network). 通过构建全局-个体特征融合(GIFF)模块, GIFFNet在聚焦关键信息的基础上, 有效整合了场景上下文与个体人物特征, 获取了更具表征能力的融合特征, 以弥补预测群体行为时场景信息缺失的问题. 随后, GIFFNet利用融合特征计算场景中人物之间的交互关系图, 并使用图卷积网络(GCN)进行训练和群体行为类别预测. 此外, 为解决数据集样本失衡的问题, GIFFNet采用动态分配权重的策略优化损失函数. 实验结果表明, GIFFNet在Volleyball、Collective Activity数据集上的多类分类准确度分别为93.8%、96.1%, 类平均精确度分别为93.9%、95.8%, 优于其他现有的深度学习方法. GIFFNet通过特征融合为行为分类提供了表征能力更加强大的特征, 有效地提升了行为识别的精确度.

Abstract:

Group activity recognition (GAR) is one of the highly researched areas in the field of computer vision, aiming to detect the overall behavior performed by multiple individual actions and interactions. However, due to difficulties in determining individual interaction relationships, the tightness of connections, and the key actor, current methods often focus on individual character features, yet neglecting connections with scene context. To address that issue, a novel reasoning model for GAR, GIFFNet, is proposed based on global-individual feature fusion (GIFF). To compensate for the lack of scene information in predicting group activity, GIFFNet, on the basis of focusing on key information, effectively integrates scene context and individual character features by constructing the GIFF module, obtaining more representative fusion features. Subsequently, GIFFNet utilizes fusion features to calculate the interaction relationship graph between characters in the scene and uses graph convolutional network (GCN) for training and predicting group behavior categories. In addition, to address the issue of imbalanced samples in the dataset, GIFFNet adopts a strategy of dynamically assigning weights to optimize the loss function. Experimental results demonstrate that GIFFNet achieves a multi-class classification accuracy (MCA) of 93.8% and 96.1% on Volleyball and Collective Activity datasets, and the mean per class accuracy (MPCA) is 93.9% and 95.8%, respectively, outperforming other existing deep learning methods. GIFFNet provides features with a more powerful characterization ability for activity classification through feature fusion, which effectively improves GAR accuracy.

参考文献

相似文献

引证文献

引用本文

程勇,程遥,王军,杨玲,许小龙,高园元,张开华.基于全局-个体特征融合的群体行为识别.计算机系统应用,2024,33(12):43-54

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-05-24
最后修改日期:2024-06-17
录用日期:
在线发布日期: 2024-10-31
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码