Text-to-image Generation Focusing on Global Fidelity

doi:10.15888/j.cnki.csa.008530

WeChat

Mobile website

Home > Archive>Volume 31, Issue 6, 2022 >388-393. DOI:10.15888/j.cnki.csa.008530

PDF HTML XML Export Cite reminder

Text-to-image Generation Focusing on Global Fidelity
DOI:
                        10.15888/j.cnki.csa.008530
                    
CSTR:
                        [cstr]
                    
Author:
                        
                        
                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Considering the difficulty in mutual mapping between text and image modalities in high-dimensional space, this study proposes a generative adversarial network (GAN) based on a stacked structure with global sentence vectors as input for the application of text-to-image generation tasks. The network incorporates a dual attention mechanism for greater integration of features in the two dimensions of space and channel. At the same time, we add the discriminator for fidelity loss as a constraint. The proposed method is experimentally verified on the Caltech-UCSD Birds (CUB) dataset, with Inception Score and SSIM as the evaluation indexes. The results show that the generated image has more realistic detail textures, and the visual effect is closer to the real image.

Reference

Cited by

Get Citation

胡成,胡莹晖,刘兴云.关注全局真实度的文本到图像生成.计算机系统应用,2022,31(6):388-393

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 03,2021
Revised:September 26,2021
Adopted:
Online: May 26,2022
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063