系统仿真学报 ›› 2016, Vol. 28 ›› Issue (3): 592-599.

• 仿真应用工程 • 上一篇    下一篇

绿色数据中心数据处理型框架中的数据管理

张啸1, 高原2, 王晓亮1, 葛以踊2, 杨海祥1, 万书鹏2   

  1. 1.南京大学计算机软件新技术国家重点实验室南京大学计算机科学与技术系,南京 210023;
    2.国电南瑞科技股份有限公司,南京 210023
  • 收稿日期:2014-07-15 修回日期:2014-10-24 发布日期:2020-07-02
  • 作者简介:张啸(1991-),男,山东,硕士生,研究方向为数据中心。
  • 基金资助:
    国家自然科学基金(61370028, 91218302, 61321491);江苏省自然科学基金(BK2011191);江苏省科技支撑计划(BE2013116);中央高校基本科研业务费专项资金(20620140514);国家电网公司科技项目

Data Management of Data Processing Framework in Green Data Center

Zhang Xiao1, Gao Yuan2, Wang Xiaoliang1, Ge Yiyong2, Yang Haixiang1, Wan Shupeng2   

  1. 1. State Key Laboratory for Novel Software Technology, Department of Computer Science and Technology, Nanjing University, Nanjing 210023, China
    2. NARI Technology Co, Ltd, Nanjing 210023, China
  • Received:2014-07-15 Revised:2014-10-24 Published:2020-07-02

摘要: 使用绿色能源已成为解决数据中心能耗问题的一种有效方式。为了降低绿色能源变化幅度大的特点带来的影响,通常将可延迟作业放入等待队列,将相应空闲服务器置为休眠状态,降低系统能耗,在新能源可用的时候执行作业。当新作业执行时,需要重新开启休眠状态服务器来保证数据可用性。数据放置与作业执行时间的不统一,会导致频繁开启休眠服务器,带来能源浪费。针对绿色数据中心提出一种数据调度策略,根据数据处理型框架中等待队列作业调度次序,通过将未来一段时间内需要被读取的数据块提前复制在活跃服务器上,降低休眠状态服务器开启的次数,从而降低总体能耗。实验模拟结果显示, 该算法可平均减少43%的休眠状态服务器重复开启次数。

关键词: 绿色数据中心, 数据处理型框架, 能耗, 新能源, 数据管理

Abstract: Using renewable energy in data center is an environment-friendly way to solve the problem of high energy consumption of data center. Since renewable energy is variable, delaying the jobs which has no strict deadline i a widely used strategy to maximize the usage of renewable energy. Meanwhile, turning the idle servers off can further reduce energy consumption. If the data required by the jobs to be processed are unavailable, some servers in sleep state need to be reactivated to guarantee that the data required by the jobs are available. Such operation may lead to energy waste due to the frequent reactive processes. An effective data management algorithm was proposed, which copied the data required by the jobs in waiting queue to active servers in advance. By doing so, the times that the sleep servers were reactivated could be reduced. Simulation results show that the times can be reduced by 43% on average.

Key words: green data center, data processing framework, energy, renewable energy, data management

中图分类号: