Optimization of CNN Computing Task Partition Based on Many-Core BWDSP

doi:10.15888/j.cnki.csa.007055

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-14- 20

Home > Archive>Volume 28, Issue 9, 2019 >88-94. DOI:10.15888/j.cnki.csa.007055

PDF HTML XML Export Cite reminder

Optimization of CNN Computing Task Partition Based on Many-Core BWDSP
DOI:
                        10.15888/j.cnki.csa.007055
                    
CSTR:
                        [cstr]
                    
Author:
                        WANG GaiWANG Gai
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHENG Qi-LongZHENG Qi-Long
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
DENG Wen-QiDENG Wen-Qi
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
YANG Jiang-PingYANG Jiang-Ping
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LU Mao-HuiLU Mao-Hui
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Convolutional Neural Network (CNN), which is one of the deep learning algorithms, has been applied in many fields. Because the scale and structure of the network model are complex and the model has large amount of data, it is necessary to reduce the requirements for computational resource. Generally, it needs to use data parallel strategy to partition and calculate tasks with large amount of data. However, just using data parallel strategy which does not combine with the characteristics of computing tasks, it would result in high volume data transmission. Because of that, it is essential to design a reasonable data partitioning strategy for reducing the amount of data transmission through the analysis of the network structure and the computing characteristics of CNN. Firstly, this paper introduces the optimization of computing tasks in deep learning accelerator. Then, it introduces the architecture of the deep learning accelerator based on many-core BWDSP and designs the strategy of computing partition. And it compares and analyzes the experimental results based on VGGNet-16. The experimental results show that the proposed optimization algorithm can significantly improve the performance of data transmission and reduce the amount of data transmission.

Key words:many-core BWDSP;data parallel;Convolutional Neural Network (CNN);computing task partition

Get Citation

王改,郑启龙,邓文齐,杨江平,卢茂辉.基于BWDSP众核的CNN计算任务划分优化.计算机系统应用,2019,28(9):88-94

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:February 28,2019
Revised:March 14,2019
Adopted:
Online: September 09,2019
Published: September 15,2019

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063