Aerial Scene Classification by Fusion of Dual-branch Attention and FasterNet

doi:10.15888/j.cnki.csa.009512

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-11- 5

Home > Archive>Volume 33, Issue 5, 2024 >15-27. DOI:10.15888/j.cnki.csa.009512

PDF HTML XML Export Cite reminder

Aerial Scene Classification by Fusion of Dual-branch Attention and FasterNet
DOI:
                        10.15888/j.cnki.csa.009512
                    
CSTR:
                        [cstr]
                    
Author:
                        YANG Ben-ChenYANG Ben-Chen
Software College, Liaoning University of Engineering and Technology, Huludao 125105, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
QU Ye-TianQU Ye-Tian
Software College, Liaoning University of Engineering and Technology, Huludao 125105, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
JIN Hai-BoJIN Hai-Bo
Software College, Liaoning University of Engineering and Technology, Huludao 125105, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The scenes in high-resolution aerial images are of many highly similar categories. The classic classification method based on deep learning offers low operational efficiency because of the redundant floating-point operations generated in the feature extraction process. FasterNet improves the operational efficiency through partial convolution but reduces the feature extraction ability and hence the classification accuracy of the model. To address the above problems, this study proposes a hybrid structure classification method integrating FasterNet and the attention mechanism. Specifically, the “cross-shaped convolution module” is used to partially extract scene features and thereby improve the operational efficiency of the model. Then, a dual-branch attention mechanism that integrates coordinate attention and channel attention is used to enable the model to better extract features. Finally, a residual connection is made between the “cross-shaped convolution module” and the dual-branch attention module so that more task-related features can be obtained from network training, thereby reducing operational costs and improving operational efficiency in addition to improving classification accuracy. The experimental results show that compared with the existing classification models based on deep learning, the proposed method has a short inference time and high accuracy. Its number of parameters is 19M, and its average inference time for one image is 7.1 ms. The classification accuracy of the proposed method on the public datasets NWPU-RESISC45, EuroSAT, VArcGIS (10%), and VArcGIS (20%) is 96.12%, 98.64%, 95.42%, and 97.87%, respectively, which is 2.06%, 0.77%, 1.34%, and 0.65% higher than that of the FasterNet model, respectively.

Key words:remote sensing scene;image classification;attention mechanism;residual connection;FasterNet

Get Citation

杨本臣,曲业田,金海波.双分支注意力与FasterNet相融合的航拍场景分类.计算机系统应用,2024,33(5):15-27

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:November 30,2023
Revised:December 29,2023
Adopted:
Online: April 07,2024
Published:

Article QR Code

You are the first991010Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063