Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning

In this article, an adaptive Proportion integration (PI) controller for varying the output voltage of a proton exchange membrane fuel cell (PEMFC) is proposed. The PI controller operates on the basis of ambient intelligence large-scale deep reinforcement learning. It functions as a coefficient tuner...

Full description

Bibliographic Details
Main Authors: Jiawen Li, Tao Yu, Bo Yang
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9312600/
id doaj-c90bb4237b46430c8abe08e03494acd0
record_format Article
spelling doaj-c90bb4237b46430c8abe08e03494acd02021-03-30T15:17:58ZengIEEEIEEE Access2169-35362021-01-0196063607510.1109/ACCESS.2020.30490729312600Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement LearningJiawen Li0https://orcid.org/0000-0003-4097-9922Tao Yu1Bo Yang2College of Electric Power, South China University of Technology, Guangzhou, ChinaCollege of Electric Power, South China University of Technology, Guangzhou, ChinaFaculty of Electric Power Engineering, Kunming University of Science and Technology, Kunming, ChinaIn this article, an adaptive Proportion integration (PI) controller for varying the output voltage of a proton exchange membrane fuel cell (PEMFC) is proposed. The PI controller operates on the basis of ambient intelligence large-scale deep reinforcement learning. It functions as a coefficient tuner based on an ambient intelligence exploration multi-delay deep deterministic policy gradient (AIEM-DDPG) algorithm. This algorithm is an improvement on the original deep deterministic police gradient (DDPG) algorithm, which incorporates ambient intelligence exploration. The DDPG algorithm serves as the core, and the AIEM-DDPG algorithm runs on a variety of deep reinforcement learning algorithms, including soft actor-critic (SAC), deep deterministic policy gradient (DDPG), proximal policy optimization (PPO) and double deep Q-network (DDQN) algorithms, to attain distributed exploration in the environment. In addition, a classified priority experience replay mechanism is introduced to improve the exploration efficiency. Clipping multi-Q learning, policy delayed updating, target policy smooth regularization and other methods are utilized to solve the problem of Q-value overestimation. A model-free algorithm with good global searching ability and optimization speed is demonstrated. Simulation results show that the AIEM-DDPG adaptive PI controller attains better robustness and adaptability, as well as a good control effect.https://ieeexplore.ieee.org/document/9312600/Distributed deep reinforcement learningambient intelligence exploration multi-delay deep deterministic policy gradientproton exchange membrane fuel cellair mass flow controlintelligent controller
collection DOAJ
language English
format Article
sources DOAJ
author Jiawen Li
Tao Yu
Bo Yang
spellingShingle Jiawen Li
Tao Yu
Bo Yang
Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning
IEEE Access
Distributed deep reinforcement learning
ambient intelligence exploration multi-delay deep deterministic policy gradient
proton exchange membrane fuel cell
air mass flow control
intelligent controller
author_facet Jiawen Li
Tao Yu
Bo Yang
author_sort Jiawen Li
title Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning
title_short Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning
title_full Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning
title_fullStr Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning
title_full_unstemmed Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning
title_sort adaptive controller of pemfc output voltage based on ambient intelligence large-scale deep reinforcement learning
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description In this article, an adaptive Proportion integration (PI) controller for varying the output voltage of a proton exchange membrane fuel cell (PEMFC) is proposed. The PI controller operates on the basis of ambient intelligence large-scale deep reinforcement learning. It functions as a coefficient tuner based on an ambient intelligence exploration multi-delay deep deterministic policy gradient (AIEM-DDPG) algorithm. This algorithm is an improvement on the original deep deterministic police gradient (DDPG) algorithm, which incorporates ambient intelligence exploration. The DDPG algorithm serves as the core, and the AIEM-DDPG algorithm runs on a variety of deep reinforcement learning algorithms, including soft actor-critic (SAC), deep deterministic policy gradient (DDPG), proximal policy optimization (PPO) and double deep Q-network (DDQN) algorithms, to attain distributed exploration in the environment. In addition, a classified priority experience replay mechanism is introduced to improve the exploration efficiency. Clipping multi-Q learning, policy delayed updating, target policy smooth regularization and other methods are utilized to solve the problem of Q-value overestimation. A model-free algorithm with good global searching ability and optimization speed is demonstrated. Simulation results show that the AIEM-DDPG adaptive PI controller attains better robustness and adaptability, as well as a good control effect.
topic Distributed deep reinforcement learning
ambient intelligence exploration multi-delay deep deterministic policy gradient
proton exchange membrane fuel cell
air mass flow control
intelligent controller
url https://ieeexplore.ieee.org/document/9312600/
work_keys_str_mv AT jiawenli adaptivecontrollerofpemfcoutputvoltagebasedonambientintelligencelargescaledeepreinforcementlearning
AT taoyu adaptivecontrollerofpemfcoutputvoltagebasedonambientintelligencelargescaledeepreinforcementlearning
AT boyang adaptivecontrollerofpemfcoutputvoltagebasedonambientintelligencelargescaledeepreinforcementlearning
_version_ 1724179703723458560