Static Gesture Recognition Based on Residual Double Attention Module and Cross-level Feature Fusion

doi:10.15888/j.cnki.csa.008770

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 15

Home > Archive>Volume 31, Issue 11, 2022 >111-119. DOI:10.15888/j.cnki.csa.008770

PDF HTML XML Export Cite reminder

Static Gesture Recognition Based on Residual Double Attention Module and Cross-level Feature Fusion
DOI:
                        10.15888/j.cnki.csa.008770
                    
CSTR:
                        [cstr]
                    
Author:
                        WU Jia-LuWU Jia-Lu
School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
TIAN Qiu-HongTIAN Qiu-Hong
School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
YUE Jin-HongYUE Jin-Hong
School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

To solve the problems of missing feature extraction by convolutional neural network and insufficient multi-feature extraction of a gesture, this study proposes a static gesture recognition method based on a residual double attention module and a cross-level feature fusion module. The designed residual double attention module can enhance the low-level features extracted by a ResNet50 network, effectively learn the key information, update the weight, and improve the attention to high-level features. Then, the cross-level feature fusion module fuses the high-level and low-level features in different stages to enrich the semantic and location information between different levels in the high-level feature map. Finally, the Softmax classifier of the fully connected layer is used to classify and recognize the gesture image. The experiment is carried out on the American sign language (ASL) dataset. The average recognition accuracy is 99.68%, which is 2.52% higher than that of the basic ResNet50 network. The results show that the proposed method can fully extract and reuse gesture features and effectively improve the recognition accuracy of gesture images.

Key words:gesture image recognition;ResNet;residual double attention module;cross-level feature fusion module;deep learning

Get Citation

吴佳璐,田秋红,岳金鸿.基于残差双注意力与跨级特征融合模块的静态手势识别.计算机系统应用,2022,31(11):111-119

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:February 12,2022
Revised:March 14,2022
Adopted:
Online: July 07,2022
Published:

Article QR Code

You are the first992292Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063