Optimization of Memory Access Based on Loongson2F
DOI:
CSTR:
Author:
Affiliation:

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    In most cases, compared to computing time, memory access time takes a much larger proportion of program running time. Therefore, memory access approach can affect the program performance significantly. Testing results show that the performance of ATLAS transplanted on KD-50-I, which is based on Loongson 2F,reaches only 30% of its theoretical peak. In this paper, by exploiting Loop Unrolling technique to decrease memory access frequency, enhancing time and space locality to reduce cache misses and nonblocking cache mechanism to form memory access pipeline, the performance of optimized ATLAS can be improved to 50% higher.

    Reference
    Related
    Cited by
Get Citation

苏波,李凯,徐志广,何颂颂.龙芯2F上的访存优化.计算机系统应用,2010,19(1):171-175

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:March 04,2009
  • Revised:
  • Adopted:
  • Online:
  • Published:
Article QR Code
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063