融合特征增强的频谱知识蒸馏
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:


Spectral Knowledge Distillation with Integrated Feature Enhancement
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    现有生成对抗网络压缩方法通常更侧重于网络架构和空间域的优化, 忽视了频谱域优化对蒸馏效果和模型性能的影响. 这种局限性导致轻量化模型在生成图像的高频细节时, 与教师模型之间存在一定的差异; 同时, 在图像转换任务中, 现有的特征提取方法通常会导致图像细节丢失. 针对这些问题, 提出了一种融合特征增强的频谱知识蒸馏(FESD-CycleGAN)方案. 在FESD-CycleGAN中, 首先通过对特征图的部分特征通道进行偏移, 扩大感受野, 增强特征多样性, 从而提升了生成图像的细节和整体表现. 其次基于对频谱域进行知识蒸馏能够使生成器捕捉图像的高频细节这一特性, 提出在对特征图进行特征增强的基础上, 将空间域与频谱域的知识蒸馏相结合, 从而增强了模型对生成图像细节的把控能力. 实验结果表明, 在horse2zebra、summer2winter和edges2shoes数据集上, FESD-CycleGAN与基线模型DCD相比, FID值分别降低了2.19、0.68和0.76, 达到54.98、73.41和27.45, 有效提升轻量化模型的生成性能.

    Abstract:

    Existing generative adversarial network (GAN) compression methods focus more on optimizing network architecture and the spatial domain, while neglecting the impact of spectral-domain optimization on distillation effectiveness and model performance. This limitation results in discrepancies between lightweight models and teacher models in generating high-frequency image details. In addition, conventional feature extraction methods in image translation often cause detail loss. To address these issues, this study proposes a spectral knowledge distillation scheme with integrated feature enhancement (FESD-CycleGAN). In FESD-CycleGAN, by shifting certain feature channels in the feature map, the receptive field is expanded and feature diversity is enhanced, thus improving both the details and the overall quality of generated images. Moreover, since spectral-domain knowledge distillation enables the generator to capture high-frequency details of images, knowledge distillation in both the spatial and spectral domains is integrated on top of feature enhancement in the feature map. This approach enhances the model’s ability to preserve fine details in generated images. Experimental results show that on the horse2zebra, summer2winter, and edges2shoes datasets, FESD-CycleGAN reduces the FID by 2.19, 0.68, and 0.76 compared to the baseline DCD model, achieving scores of 54.98, 73.41, and 27.45, respectively. The generative performance of lightweight models is effectively improved by FESD-CycleGAN.

    参考文献
    相似文献
    引证文献
引用本文

李睿,龚啸威.融合特征增强的频谱知识蒸馏.计算机系统应用,2026,35(1):76-87

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-05-28
  • 最后修改日期:2025-06-20
  • 录用日期:
  • 在线发布日期: 2025-10-29
  • 出版日期:
文章二维码
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62661041 传真: Email:csa@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号