AI Scheduling Engine Platform Based on Kubernetes

doi:10.15888/j.cnki.csa.009182

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-14- 19

Home > Archive>Volume 32, Issue 8, 2023 >86-94. DOI:10.15888/j.cnki.csa.009182

PDF HTML XML Export Cite reminder

AI Scheduling Engine Platform Based on Kubernetes
DOI:
                        10.15888/j.cnki.csa.009182
                    
CSTR:
                        [cstr]
                    
Author:
                        LIU XiangLIU Xiang
Guangzhou Institution of Technology, Xidian University, Guangzhou 510555, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
HU Rui-MinHU Rui-Min
Hangzhou Institution of Technology, Xidian University, Hangzhou 311231, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
WANG Hai-BinWANG Hai-Bin
Xiamen Meiya Baike Information Co. Ltd., Xiamen 361008, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The design and realization of the AI scheduling engine platform based on Kubernetes is introduced in this paper. To tackle the problems of complex service configuration, the unbalanced utilization rate of computing resources of each node in the cluster and the high cost of system operation and maintenance in the current AI scheduling system, this study proposes a solution based on Kubernetes to implement container scheduling and service management. Combined with the requirements of the AI scheduling engine platform, the various modules of the platform are designed from such aspects as function implementation and platform architecture. At the same time, given the problem that Kubernetes cannot perceive GPU resources, Device Plugin is introduced to collect GPU information on each node in the cluster and report it to the scheduler. In addition, as priority algorithms in Kubernetes scheduling strategy only considers the resource utilization rate and balance degree of the node itself, disregarding the differences in the demand of different types of applications for node resources, priority algorithms based on Pearson correlation coefficient (PCC) is put forward. The scheduling of Pod is determined by calculating the complementary degree of container resources demand and node resource utilization rate, thus ensuring the resource balance of each node after the scheduling.

Key words:Kubernetes|container|schedule|Pearson correlation coefficient (PCC)

Get Citation

刘祥,胡瑞敏,王海滨.基于Kubernetes的AI调度引擎平台.计算机系统应用,2023,32(8):86-94

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:January 09,2023
Revised:February 09,2023
Adopted:
Online: May 22,2023
Published:

Article QR Code

You are the first991210Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063