RVV Optimization for PyTorch

doi:10.15888/j.cnki.csa.009827

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-24- 23

Home > Archive>Volume 34, Issue 4, 2025 >266-275. DOI:10.15888/j.cnki.csa.009827

PDF HTML XML Export Cite reminder

RVV Optimization for PyTorch
DOI:
                        10.15888/j.cnki.csa.009827
                    
CSTR:
                        32024.14.csa.009827
                    
Author:
                        WANG FanWANG Fan
School of Computer Science & School of Cyber Science and Engineering, Nanjing University of Information Science & Technology, Nanjing 210044, China;University of Chinese Academy of Sciences, Nanjing, Nanjing 211135, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHANG FeiZHANG Fei
Institute of Software, Chinese Academy of Sciences, Beijing 100190, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
SONG Fu-YuanSONG Fu-Yuan
School of Computer Science & School of Cyber Science and Engineering, Nanjing University of Information Science & Technology, Nanjing 210044, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
YU Jia-GengYU Jia-Geng
Institute of Software, Chinese Academy of Sciences, Beijing 100190, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The RISC-V software ecosystem is in the stage of accelerated development. International open-source community makes active contributions with focus on adaptation and optimization for RISC-V, driving its software ecosystem forward. PyTorch, an open-source Python machine learning library, has significant advantages in performance, open-source ecosystem, and research areas. It provides strong support for instruction set architectures such as x86, ARM, PowerPC, and CUDA. However, in the current RISC-V architecture, the software ecosystem porting is mainly focused on adapting to the RISC-V standard instruction set and has not yet fully utilized the RISC-V extended instruction sets to optimize the software ecosystem, which leaves a significant gap between the RISC-V software ecosystem and the mature ecosystems like ARM and x86. PyTorch, lacking support of RISC-V V extension (RVV), results in a considerable gap in inference performance between RISC-V platforms and ARM platforms of similar specifications. To address this issue, this study proposes an efficient development scheme for PyTorch RVV1.0 and optimizes deep convolution operators in PyTorch by using the RVV extended instruction set. A comparative analysis is conducted on the K230 development board, with experimental results showing that the performance of deep convolution operators optimized with RVV is improved by approximately 1.35 to 3.8 times compared to scalar implementations.

Key words:RISC-V;PyTorch;RVV extension instruction set;depthwise convolution

Get Citation

王凡,张飞,宋甫元,于佳耕.面向PyTorch的RVV优化.计算机系统应用,2025,34(4):266-275

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:October 14,2024
Revised:October 21,2024
Adopted:
Online: February 18,2025
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063