###

计算机系统应用英文版:2020,29(9):16-25

View/Add Comment 过刊浏览高级检索 HTML

←前一篇 | 后一篇→

码上扫一扫！

下载全文

卷积神经网络压缩与加速技术研究进展

尹文枫¹, 梁玲燕¹, 彭慧民¹, 曹其春¹, 赵健¹, 董刚¹, 赵雅倩¹, 赵坤²

(1.浪潮电子信息产业股份有限公司, 济南 250101;2.广东浪潮大数据研究有限公司, 广州 510632)

Research Progress on Convolutional Neural Network Compression and Acceleration Technology

YIN Wen-Feng¹, LIANG Ling-Yan¹, PENG Hui-Min¹, CAO Qi-Chun¹, ZHAO Jian¹, DONG Gang¹, ZHAO Ya-Qian¹, ZHAO Kun²

(1.Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China;2.Guangdong Inspur Big Data Research Co. Ltd., Guangzhou 510632, China)

摘要

图/表

参考文献

相似文献

本文已被：浏览 1275次下载 2729次
Received:February 26, 2020 Revised:March 17, 2020

中文摘要: 神经网络压缩技术的出现缓解了深度神经网络模型在资源受限设备中的应用难题，如移动端或嵌入式设备.但神经网络压缩技术在压缩处理的自动化、稀疏度与硬件部署之间的矛盾、避免压缩后模型重训练等方面存在困难.本文在回顾经典神经网络模型和现有神经网络压缩工具的基础上，总结参数剪枝、参数量化、低秩分解和知识蒸馏四类压缩方法的代表性压缩算法的优缺点，概述压缩方法的评测指标和常用数据集，并分析各种压缩方法在不同任务和硬件资源约束中的性能表现，展望神经网络压缩技术具有前景的研究方向.

中文关键词: 神经网络压缩参数剪枝参数量化低秩分解知识蒸馏

Abstract:The development of neural network compression relieves the difficulty of deep neural networks running on resource-restricted devices, such as mobile or embedded devices. However, neural network compression encounters challenges in automation of compression, conflict of the sparsity and hardware deployment, avoidance of retraining compressed networks and other issues. This paper firstly reviews classic neural network models and current compression toolkits. Secondly, this paper summarizes advantages and weaknesses of representative compression methods of parameter pruning, quantization, low-rank factorization and distillation. This paper lists evaluating indicators and common datasets for the performance evaluation and then analyzes compression performance in different tasks and resource constraints. Finally, promising development trends are stated in this paper as references for promoting the neural network compression technique.

keywords: neural network compression parameter pruning parameter quantizatipn low-rank factorization knowledge distillation

文章编号： 中图分类号： 文献标志码：

基金项目:

引用文本：
尹文枫,梁玲燕,彭慧民,曹其春,赵健,董刚,赵雅倩,赵坤.卷积神经网络压缩与加速技术研究进展.计算机系统应用,2020,29(9):16-25
YIN Wen-Feng,LIANG Ling-Yan,PENG Hui-Min,CAO Qi-Chun,ZHAO Jian,DONG Gang,ZHAO Ya-Qian,ZHAO Kun.Research Progress on Convolutional Neural Network Compression and Acceleration Technology.COMPUTER SYSTEMS APPLICATIONS,2020,29(9):16-25

Author Name	Affiliation	E-mail
YIN Wen-Feng	Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China	yinwenfeng@inspur.com
LIANG Ling-Yan	Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
PENG Hui-Min	Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
CAO Qi-Chun	Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
ZHAO Jian	Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
DONG Gang	Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
ZHAO Ya-Qian	Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
ZHAO Kun	Guangdong Inspur Big Data Research Co. Ltd., Guangzhou 510632, China