Post-training Mixed-accuracy Quantization Algorithm Based on Pyramid-pooled Weight Imprinting

doi:10.15888/j.cnki.csa.009720

WeChat

Mobile website

Home > Archive>Volume , Issue , >1-9. DOI:10.15888/j.cnki.csa.009720

PDF HTML XML Export Cite reminder

Post-training Mixed-accuracy Quantization Algorithm Based on Pyramid-pooled Weight Imprinting
DOI:
                        10.15888/j.cnki.csa.009720
                    
CSTR:
                        
Author:
                        
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Model quantization is widely used for fast inference and deployment of deep neural network models. Post-training quantization has attracted much attention from researchers due to its reduced retraining time and low performance loss. However, most existing post-training quantization methods rely on theoretical assumptions or use fixed bit-width allocations for network layers during the quantization process, which results in significant performance loss in the quantized network, especially in low-bit scenarios. To improve the accuracy of post-training quantized network models, this study proposes a novel post-training mixed-accuracy quantization method (MSQ). This method estimates the accuracy of each layer of the network by inserting a task predictor module, which incorporates the pyramid pooling module and weight imprinting, after each layer of the network model. With the estimations, it assesses the importance of each layer of the network and determines the quantization bit-width of each layer based on the assessment. Experiments show that the MSQ algorithm proposed in this study outperforms some existing mixed-accuracy quantization methods on several popular network architectures, and the quantized network model tested on edge hardware devices shows better performance and lower latency.

Reference

Cited by

Get Citation

张瑞轩,赵宇峰,徐飞,禹婷婷,张乐怡.基于金字塔池化权值印记的训练后混合精度量化算法.计算机系统应用,,():1-9

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:May 29,2024
Revised:June 26,2024
Adopted:
Online: October 31,2024
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063