计算机系统应用  2021, Vol. 30 Issue (1): 122-128

Image Inpainting Based on New Encoder and Similarity Constraint
LIN Zhu, WANG Min
College of Computer and Information, Hohai University, Nanjing 211100, China
Abstract: The existing image repair methods have some problems such as obvious trace, semantic discontinuity, unclear, etc. To solve these problems, this study proposes an image repair method based on a new encoder and context-aware loss. In this paper, the generative adversarial network is adopted as the basic network architecture. In order to fully learn the image features and get clearer repair results, SE-ResNet is introduced to extract the effective features of the image. At the same time, the joint context-aware loss training generating network is proposed to constrain the similarity of local features, so that the repaired image is closer to the original and more real and natural. Experiments on multiple public datasets in this paper prove that the proposed method can repair the damaged images better.
Key words: generative adversarial network     image inpainting     residual network     contextual loss

(1)在生成网络和全局上下文以及局部上下文判别网络部分添加了基于SE-ResNet的残差块更好的提取特征.

(2)增加了上下文感知损失网络以辅助约束局部高频特征的相似性来修复图像.

1 相关内容

Yu等人[10]提出一种端到端的图像修复模型, 通过采用一种堆叠型的生成网络确保与周边颜色以及纹理的连贯性, 同时引入了注意力模块从距离较远的区域提取近似待修复区域的特征.

Liu等人[11]提出通过在卷积过程中更新掩膜并使用更新的掩膜值归一化卷积核的权重值, 保证卷积核能够专注于有效的像素值.

Yu等人[12]通过引入门控卷积, 学习一种特征通道的动态选择机制, 以提高色彩的一致性, 同时提出一种高效的判别器SN-PatchGAN用于辅助修复随机缺失的图像.

2 网络结构 2.1 SE-ResNet

2.2 生成网络

 ${L_{\rm {adv}}} = - {E_{x \sim {p_r}(x)}}D(G(M \odot x))$ (1)
 图 1 生成网络结构图

 ${L_{\rm {res}}} = - {E_{x \sim {p_r}(x)}}[{\left\| {M \odot (x - G(M \odot x))} \right\|_2}]$ (2)

 ${L_{ {CX}}} = - \log [CX(\Phi (x),\Phi (G(M \odot x)))]$ (3)

 ${L_{\rm {res}}} + {\lambda _1}{L_{\rm {adv}}} + {\lambda _2}{L_{{CX}}}$ (4)
2.3 判别网络

 图 2 判别网络结构图

 ${L_{\rm {dis}}} = - {E_{x \sim {p_r}}}[\log (D(x)) + \log (1 - D(G(x)))]$ (5)
2.4 上下文感知损失

 ${L_{ {CX}}} = - \log [CX(\Phi (x),\Phi (G(M \odot x)))]$ (6)

 $CX(x,y) = CX(X,Y) = \frac{1}{N}\sum\limits_j {\mathop {\max }\limits_i } C{X_{ij}}$ (7)

 $C{X_{ij}} = {w_{ij}}\Bigg/\sum\limits_k {{w_{ik}}}$ (8)

 ${w_{ij}} = \exp \left( {\dfrac{{1 - {d_{\rm {similar}}}}}{h}} \right)$ (9)

 ${d_{\rm {similar}}} = \frac{{{d_{ij}}}}{{{{\min }_k}{d_{ik}} + \varepsilon }}$ (10)

${d}_{ij}$ 归一化, 其中 ${d}_{ij}$ ${x}_{i}$ ${y}_{j}$ 的余弦距离. 上述 ${d}_{ij}$ 计算公式为:

 ${d_{ij}} = \left( {1 - \frac{{({x_i} - {\mu _y}) \cdot ({y_j} - {\mu _y})}}{{{{\left\| {{x_i} - {\mu _y}} \right\|}_2}{{\left\| {{y_j} - {\mu _y}} \right\|}_2}}}} \right)$ (11)

 ${\mu _y} = \frac{1}{N}\sum\limits_j {{y_j}}$ (12)

3 实验 3.1 数据集

3.2 训练过程

3.3 SE-ResNet的效果分析

 图 3 添加SE-ResNet残差块与否对比图

3.4 上下文感知损失的效果分析

 图 4 采用上下文感知损失与否对比图

3.5 与现有方法的比较

 图 5 与文献[3]方法的中心缺失修复效果对比图

 图 6 与文献[3]方法的随机缺失修复效果对比图

 图 7 边缘检测图对比

4 结束语

