基于深度强化学习的燃料电池混合动力汽车能量管理策略研究
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:

深圳市海外高层次人才创新创业计划项目(KQJSCX20180330170047681);深圳无人驾驶感知决策与执行技术工程实验室计划项目 (Y7D004);深圳电动汽车动力平台与安全技术重点实验室计划项目


Research on Energy Management Strategy of Fuel Cell Hybrid Vehicles Based on Deep Reinforcement Learning
Author:
Affiliation:

Fund Project:

Shenzhen Science and Technology Innovation Commission (KQJSCX20180330170047681), Shenzhen Engineering Laboratory for Autonomous Driving Technology (Y7D004), and Shenzhen Key Laboratory of Electric Vehicle Powertrain Platform and Safety Technology

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为提高燃料电池混合动力汽车的燃油经济性和燃料电池寿命,该文提出一种基于深度强化学 习(Deep Reinforcement Learning,DRL)的能量管理策略。该策略首先在 DRL 奖励信号中加入寿命因子,通过降低燃料电池功率波动,起到延长燃料电池寿命的效果;其次,通过限制 DRL 的动作空间的方法,使燃料电池系统工作在高效率区间,从而提高整车效率。在 UDDS、WLTC、Japan1015 三个标准工况下进行了离线训练,并在 NEDC 工况下实时应用以验证所提出策略的工况适应性。仿真结果显示,在离线训练中,所提出的策略可以快速收敛,表明其具有较好的稳定性。在燃油经济性方面,与基于动态规划的策略相比,在 3 个训练工况下的差异仅为 5.58%、3.03% 和 4.65%,接近最优燃油经济性;相比基于强化学习的策略,分别提升了 4.46%、7.26% 和 5.35%。与无寿命因子的 DRL 策略相比, 所提出的策略在 3 个训练工况下将燃料电池平均功率波动降低了 10.27%、47.95% 和 10.85%,这有利于提升燃料电池寿命。在未知工况的实时应用中,所提出策略的燃油经济性比基于强化学习的策略提升了 3.39%,这表明其工况适应性。

    Abstract:

    In order to improve the fuel economy and fuel cell lifetime of fuel cell hybrid vehicles, this research proposes an energy management strategy based on deep reinforcement learning (DRL). The strategy first adds a lifetime factor to reward signal of DRL, the lifetime of fuel cell is extended by limiting the power fluctuation. Then, the fuel cell system works in a high efficiency range by limiting the action space of DRL, improving the efficiency of the entire vehicle. After offline training under UDDS, WLTC, and Japan1015, it is applied in real time under NEDC to verify the adaptability of the proposed strategy. The results show that the proposed strategy can converge quickly in offline training, which proves its stability. Compared with dynamic programming-based strategy, the fuel economy difference in training cycles is only 5.58%, 3.03% and 4.65%, which is close to the optimal, and the promotion is 4.46%, 7.26% and 5.35% compared with reinforcement learning-based strategy. Compared with the DRL-based strategy without a lifetime factor, the proposed strategy reduces the average power fluctuation by 10.27%, 47.95%, and 10.85% under training cycles, which is beneficial to improve the fuel cell lifetime. In the real-time application, the fuel economy of the proposed strategy is improved by 3.39% compared with the reinforcement learningbased strategy, which proves its adaptability to unknown cycles.

    参考文献
    相似文献
    引证文献
引用本文

引文格式
李 卫,郑春花,许德州.基于深度强化学习的燃料电池混合动力汽车能量管理策略研究 [J].集成技术,2021,10(3):47-60

Citing format
LI Wei, ZHENG Chunhua, XU Dezhou. Research on Energy Management Strategy of Fuel Cell Hybrid Vehicles Based on Deep Reinforcement Learning[J]. Journal of Integration Technology,2021,10(3):47-60

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2021-05-26
  • 出版日期:
文章二维码