Generation of Chinese Image Description by Multimodal Neural Network

doi:10.15888/j.cnki.csa.007513

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-14- 21

Home > Archive>Volume 29, Issue 9, 2020 >191-197. DOI:10.15888/j.cnki.csa.007513

PDF HTML XML Export Cite reminder

Generation of Chinese Image Description by Multimodal Neural Network
DOI:
                        10.15888/j.cnki.csa.007513
                    
CSTR:
                        [cstr]
                    
Author:
                        CHEN XingCHEN Xing
College of Computer and Information, Hohai University, Nanjing 211100, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Automatic image captioning is a hot topic which connects natural language processing and computer vision. It mainly completes the task of understanding image semantic information and expressing it in the form of human natural language. For the overall quality of Chinese image captioning is not very high, this study uses FastText to generate word vector, uses convolution neural network to extract the global features of the image, then encodes the pairs of sentences and images〈S, I〉, and finally merges them into a feature matrix containing both Chinese description and image information. Decoder uses LSTM model to decode the feature matrix, and obtains the decoding result by calculating cosine similarity. Through comparison, we find that the model proposed in this study is better than other models in BiLingual Evaluation Understudy (BLEU). The Chinese description generated by the model can accurately summarize the semantic information of the image.

Key words:Chinese image captioning;FastText;Convolutional Neural Network (CNN);Long and Short Term Memory network (LSTM)

Get Citation

陈兴.基于多模态神经网络生成图像中文描述.计算机系统应用,2020,29(9):191-197

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:January 04,2020
Revised:January 22,2020
Adopted:
Online: September 07,2020
Published: September 15,2020

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063