Research Progress on Convolutional Neural Network Compression and Acceleration Technology

doi:10.15888/j.cnki.csa.007632

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 6

Home > Archive>Volume 29, Issue 9, 2020 >16-25. DOI:10.15888/j.cnki.csa.007632

PDF HTML XML Export Cite reminder

Research Progress on Convolutional Neural Network Compression and Acceleration Technology
DOI:
                        10.15888/j.cnki.csa.007632
                    
CSTR:
                        [cstr]
                    
Author:
                        YIN Wen-FengYIN Wen-Feng
Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LIANG Ling-YanLIANG Ling-Yan
Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
PENG Hui-MinPENG Hui-Min
Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
CAO Qi-ChunCAO Qi-Chun
Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHAO JianZHAO Jian
Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
DONG GangDONG Gang
Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHAO Ya-QianZHAO Ya-Qian
Inspur Electronic Information Industry Co. Ltd., Jinan 250101, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHAO KunZHAO Kun
Guangdong Inspur Big Data Research Co. Ltd., Guangzhou 510632, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The development of neural network compression relieves the difficulty of deep neural networks running on resource-restricted devices, such as mobile or embedded devices. However, neural network compression encounters challenges in automation of compression, conflict of the sparsity and hardware deployment, avoidance of retraining compressed networks and other issues. This paper firstly reviews classic neural network models and current compression toolkits. Secondly, this paper summarizes advantages and weaknesses of representative compression methods of parameter pruning, quantization, low-rank factorization and distillation. This paper lists evaluating indicators and common datasets for the performance evaluation and then analyzes compression performance in different tasks and resource constraints. Finally, promising development trends are stated in this paper as references for promoting the neural network compression technique.

Key words:neural network compression;parameter pruning;parameter quantizatipn;low-rank factorization;knowledge distillation

Get Citation

尹文枫,梁玲燕,彭慧民,曹其春,赵健,董刚,赵雅倩,赵坤.卷积神经网络压缩与加速技术研究进展.计算机系统应用,2020,29(9):16-25

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:February 26,2020
Revised:March 17,2020
Adopted:
Online: September 07,2020
Published: September 15,2020

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063