###

计算机系统应用英文版:2023,32(1):348-357

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

基于生成对抗网络的文本生成图像算法

段亚茹, 赵嘉雨, 何立明

(长安大学信息工程学院, 西安 710018)

Text-to-image Algorithm Based on Generative Adversarial Network

DUAN Ya-Ru, ZHAO Jia-Yu, HE Li-Ming

(School of Information Engineering, Chang’an University, Xi’an 710018, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 792次下载 1650次
Received:May 11, 2022 Revised:June 15, 2022

中文摘要: 文本生成图像算法对生成图像的质量和文本匹配度有很高的要求. 为了提高生成图像的清晰度, 在现有算法的基础上改进生成对抗网络模型. 加入动态记忆网络、细节校正模块(DCM)、文本图像仿射组合模块(ACM)来提高生成图片的质量. 其中动态记忆网络可以细化模糊图像并选择重要的文本信息存储, 以提高下一阶段生成图像的质量. DCM纠正细节, 完成合成图像中缺失部分. ACM编码原始图像特征, 重建与文本描述无关的部分. 改进后的模型实现了两个目标, 一是根据给定文本生成高质量的图片, 同时保留与文本无关的内容. 二是使生成图像不再较大程度依赖于初始图像的生成质量. 通过在CUB-200-2011鸟类数据集进行研究实验, 结果表明相较之前的算法模型, FID (Frechet inception)有了显著的改善, 结果由16.09变为10.40. 证明了算法的可行性和先进性.

中文关键词: 生成对抗网络细节校正动态记忆网络文本描述生成图像文本图像仿射组合模块图像生成深度学习

Abstract:Text-to-image algorithm requires high image quality and text matching. In order to improve the clarity of generated images, a generative adversarial network model is improved based on existing algorithms. Dynamic memory network, detail correction module (DCM), and text image affine combination module (ACM) are added to improve the quality of generated images. Specifically, the dynamic memory network can refine fuzzy images and select important text information storage to improve the quality of images generated in the next stage. DCM corrects details and repairs missing parts of composite images. ACM encodes original image features and reconstructs parts irrelevant to the text description. The improved model achieves two goals. On the one hand, high-quality images are generated according to given texts, with contents that are irrelevant to the texts preserved. Second, generated images do not greatly rely on the quality of initial images. Through experiments on the CUB-200-2011 bird data set, the results show that compared with previous algorithm models, the Frechet inception (FID) has been significantly improved, and the result has changed from 16.09 to 10.40, which proves that the algorithm is feasible and advanced.

keywords: generative adversarial network (GAN) detail correction dynamic memory network text to image text image affine combination module image generation deep learning

文章编号： 中图分类号： 文献标志码：

基金项目:

Author Name	Affiliation	E-mail
DUAN Ya-Ru	School of Information Engineering, Chang’an University, Xi’an 710018, China	dyrjiayou2022@163.com
ZHAO Jia-Yu	School of Information Engineering, Chang’an University, Xi’an 710018, China
HE Li-Ming	School of Information Engineering, Chang’an University, Xi’an 710018, China

Author Name	Affiliation	E-mail
DUAN Ya-Ru	School of Information Engineering, Chang’an University, Xi’an 710018, China	dyrjiayou2022@163.com
ZHAO Jia-Yu	School of Information Engineering, Chang’an University, Xi’an 710018, China
HE Li-Ming	School of Information Engineering, Chang’an University, Xi’an 710018, China

引用文本：
段亚茹,赵嘉雨,何立明.基于生成对抗网络的文本生成图像算法.计算机系统应用,2023,32(1):348-357
DUAN Ya-Ru,ZHAO Jia-Yu,HE Li-Ming.Text-to-image Algorithm Based on Generative Adversarial Network.COMPUTER SYSTEMS APPLICATIONS,2023,32(1):348-357