系统仿真学报 ›› 2022, Vol. 34 ›› Issue (2): 258-268.doi: 10.16182/j.issn1004731x.joss.21-0337

• 仿真建模理论与方法 • 上一篇    下一篇

基于深度强化学习的云作业调度及仿真研究

李启锐1(), 彭心怡2()   

  1. 1.广东石油化工学院 计算机学院,广东 茂名 525000
    2.华南师范大学 数学科学学院,广东 广州 510631
  • 收稿日期:2021-04-20 修回日期:2021-07-01 出版日期:2022-02-18 发布日期:2022-02-23
  • 通讯作者: 彭心怡 E-mail:liqirui@gdupt.edu.cn;1742043887@qq.com
  • 作者简介:李启锐(1982-),男,硕士,副教授,研究方向云计算资源调度。E-mail:liqirui@gdupt.edu.cn
  • 基金资助:
    国家自然科学基金资助项目(61772145);广东省自然科学基金资助项目(2020A1515010727);广东省科技专项资金资助项目(mmkj2020008)

Job Scheduling and Simulation in Cloud Based on Deep Reinforcement Learning

Qirui Li1(), Xinyi Peng2()   

  1. 1.College of Computer Science, Guangdong University of Petrochemical Technology, Maoming 525000, China
    2.School of Mathematical Sciences, South China Normal University, Guangzhou 510631, China
  • Received:2021-04-20 Revised:2021-07-01 Online:2022-02-18 Published:2022-02-23
  • Contact: Xinyi Peng E-mail:liqirui@gdupt.edu.cn;1742043887@qq.com

摘要:

针对复杂瞬变的多用户多队列多数据中心云计算环境中作业调度困难的问题,提出一种基于深度强化学习的作业调度方法。建立了云作业调度系统模型及其数学模型,并建立了由传输时间、等待时间和执行时间三部分构成的优化目标。基于深度强化学习设计了作业调度算法,给出了算法的状态空间、动作空间和奖赏函数。设计与开发了云作业仿真调度器,完成作业的仿真调度。仿真结果表明,相比随机调度、轮转调度、首次适应、最佳适应等基准算法,提出的算法能够有效降低作业的整体完工时间。

关键词: 云计算, 作业调度, 深度强化学习, 完工时间, 多用户, 多队列, 多数据中心

Abstract:

To solve the difficulty in job scheduling in the complex and transient multi-user, multi-queue, and multi-data-center cloud computing environment, this paper proposed a job scheduling method based on deep reinforcement learning. A system model of cloud job scheduling and its mathematical model were built, and an optimization goal consisting of transmission time, waiting time, and execution time was obtained. A job scheduling algorithm based on deep reinforcement learning was designed, and its state space, action space, and reward function were given. A simulated cloud job scheduler was designed and developed, and simulated scheduling experiments were conducted on it. The results show that compared with benchmark algorithms such as random scheduling, round-robin scheduling, firstfit, and optimal fit, the proposed algorithm could effectively reduce the overall makespan of the jobs.

Key words: cloud computing, job scheduling, deep reinforcement learning, makespan, multi-user, multi-queue, multi-data-center

中图分类号: