Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning
In this article, an adaptive Proportion integration (PI) controller for varying the output voltage of a proton exchange membrane fuel cell (PEMFC) is proposed. The PI controller operates on the basis of ambient intelligence large-scale deep reinforcement learning. It functions as a coefficient tuner...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9312600/ |
id |
doaj-c90bb4237b46430c8abe08e03494acd0 |
---|---|
record_format |
Article |
spelling |
doaj-c90bb4237b46430c8abe08e03494acd02021-03-30T15:17:58ZengIEEEIEEE Access2169-35362021-01-0196063607510.1109/ACCESS.2020.30490729312600Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement LearningJiawen Li0https://orcid.org/0000-0003-4097-9922Tao Yu1Bo Yang2College of Electric Power, South China University of Technology, Guangzhou, ChinaCollege of Electric Power, South China University of Technology, Guangzhou, ChinaFaculty of Electric Power Engineering, Kunming University of Science and Technology, Kunming, ChinaIn this article, an adaptive Proportion integration (PI) controller for varying the output voltage of a proton exchange membrane fuel cell (PEMFC) is proposed. The PI controller operates on the basis of ambient intelligence large-scale deep reinforcement learning. It functions as a coefficient tuner based on an ambient intelligence exploration multi-delay deep deterministic policy gradient (AIEM-DDPG) algorithm. This algorithm is an improvement on the original deep deterministic police gradient (DDPG) algorithm, which incorporates ambient intelligence exploration. The DDPG algorithm serves as the core, and the AIEM-DDPG algorithm runs on a variety of deep reinforcement learning algorithms, including soft actor-critic (SAC), deep deterministic policy gradient (DDPG), proximal policy optimization (PPO) and double deep Q-network (DDQN) algorithms, to attain distributed exploration in the environment. In addition, a classified priority experience replay mechanism is introduced to improve the exploration efficiency. Clipping multi-Q learning, policy delayed updating, target policy smooth regularization and other methods are utilized to solve the problem of Q-value overestimation. A model-free algorithm with good global searching ability and optimization speed is demonstrated. Simulation results show that the AIEM-DDPG adaptive PI controller attains better robustness and adaptability, as well as a good control effect.https://ieeexplore.ieee.org/document/9312600/Distributed deep reinforcement learningambient intelligence exploration multi-delay deep deterministic policy gradientproton exchange membrane fuel cellair mass flow controlintelligent controller |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jiawen Li Tao Yu Bo Yang |
spellingShingle |
Jiawen Li Tao Yu Bo Yang Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning IEEE Access Distributed deep reinforcement learning ambient intelligence exploration multi-delay deep deterministic policy gradient proton exchange membrane fuel cell air mass flow control intelligent controller |
author_facet |
Jiawen Li Tao Yu Bo Yang |
author_sort |
Jiawen Li |
title |
Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning |
title_short |
Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning |
title_full |
Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning |
title_fullStr |
Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning |
title_full_unstemmed |
Adaptive Controller of PEMFC Output Voltage Based on Ambient Intelligence Large-Scale Deep Reinforcement Learning |
title_sort |
adaptive controller of pemfc output voltage based on ambient intelligence large-scale deep reinforcement learning |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
In this article, an adaptive Proportion integration (PI) controller for varying the output voltage of a proton exchange membrane fuel cell (PEMFC) is proposed. The PI controller operates on the basis of ambient intelligence large-scale deep reinforcement learning. It functions as a coefficient tuner based on an ambient intelligence exploration multi-delay deep deterministic policy gradient (AIEM-DDPG) algorithm. This algorithm is an improvement on the original deep deterministic police gradient (DDPG) algorithm, which incorporates ambient intelligence exploration. The DDPG algorithm serves as the core, and the AIEM-DDPG algorithm runs on a variety of deep reinforcement learning algorithms, including soft actor-critic (SAC), deep deterministic policy gradient (DDPG), proximal policy optimization (PPO) and double deep Q-network (DDQN) algorithms, to attain distributed exploration in the environment. In addition, a classified priority experience replay mechanism is introduced to improve the exploration efficiency. Clipping multi-Q learning, policy delayed updating, target policy smooth regularization and other methods are utilized to solve the problem of Q-value overestimation. A model-free algorithm with good global searching ability and optimization speed is demonstrated. Simulation results show that the AIEM-DDPG adaptive PI controller attains better robustness and adaptability, as well as a good control effect. |
topic |
Distributed deep reinforcement learning ambient intelligence exploration multi-delay deep deterministic policy gradient proton exchange membrane fuel cell air mass flow control intelligent controller |
url |
https://ieeexplore.ieee.org/document/9312600/ |
work_keys_str_mv |
AT jiawenli adaptivecontrollerofpemfcoutputvoltagebasedonambientintelligencelargescaledeepreinforcementlearning AT taoyu adaptivecontrollerofpemfcoutputvoltagebasedonambientintelligencelargescaledeepreinforcementlearning AT boyang adaptivecontrollerofpemfcoutputvoltagebasedonambientintelligencelargescaledeepreinforcementlearning |
_version_ |
1724179703723458560 |