Image Captioning Based on Dual Refined Attention

doi:10.15888/j.cnki.csa.007396

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-11- 3

Home > Archive>Volume 29, Issue 5, 2020 >245-251. DOI:10.15888/j.cnki.csa.007396

PDF HTML XML Export Cite reminder

Image Captioning Based on Dual Refined Attention
DOI:
                        10.15888/j.cnki.csa.007396
                    
CSTR:
                        [cstr]
                    
Author:
                        CONG Lu-WenCONG Lu-Wen
College of Computer Science and Technology, China University of Petroleum, Qingdao 266580, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Image captioning is an important task, which connects computer vision and natural language processing, two major artificial intelligence fields. In recent years, encoder-decoder frameworks integrated with attention mechanism have made significant process in captioning. However, many attention-based methods only use spatial attention mechanism. In this study, we propose a novel dual refined attention model for image captioning. In the proposed model, we use not only spatial attention but also channel-wise attention and then use a refine module to refine the image features. By using the refine module, the proposed model can filter the redundant and irrelevant features in the attended image features. We validate the proposed model on MSCOCO dataset via various evaluation metrics, and the results show the effectiveness of the proposed model.

Key words:image captioning;spatial attention;channel-wise attention;Long Short Term Memory (LSTM);computer vision

Get Citation

丛璐文.基于双路细化注意力机制的图像描述模型.计算机系统应用,2020,29(5):245-251

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:October 07,2019
Revised:November 07,2019
Adopted:
Online: May 07,2020
Published: May 15,2020

Article QR Code

You are the first990999Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063