Reinforcement learning-based hybrid spectrum resource allocation scheme for the high load of URLLC services

Abstract Ultra-reliable and low-latency communication (URLLC) in mobile networks is still one of the core solutions that require thorough research in 5G and beyond. With the vigorous development of various emerging URLLC technologies, resource shortages will soon occur even in mmWave cells with rich...

Full description

Bibliographic Details
Main Authors: Qian Huang, Xianzhong Xie, Mohamed Cheriet
Format: Article
Language:English
Published: SpringerOpen 2020-12-01
Series:EURASIP Journal on Wireless Communications and Networking
Subjects:
Online Access:https://doi.org/10.1186/s13638-020-01872-5
id doaj-6d2d76342dfd442c82581a257609e827
record_format Article
spelling doaj-6d2d76342dfd442c82581a257609e8272020-12-13T12:23:36ZengSpringerOpenEURASIP Journal on Wireless Communications and Networking1687-14992020-12-012020112110.1186/s13638-020-01872-5Reinforcement learning-based hybrid spectrum resource allocation scheme for the high load of URLLC servicesQian Huang0Xianzhong Xie1Mohamed Cheriet2School of Computer Science and Technology, Chongqing University of Posts and TelecommunicationsSchool of Computer Science and Technology, Chongqing University of Posts and TelecommunicationsÉcole de Technologiesupérieure, Université du QuébecAbstract Ultra-reliable and low-latency communication (URLLC) in mobile networks is still one of the core solutions that require thorough research in 5G and beyond. With the vigorous development of various emerging URLLC technologies, resource shortages will soon occur even in mmWave cells with rich spectrum resources. As a result of the large radio resource space of mmWave, traditional real-time resource scheduling decisions can cause serious delays. Consequently, we investigate a delay minimization problem with the spectrum and power constraints in the mmWave hybrid access network. To reduce the delay caused by high load and radio resource shortage, a hybrid spectrum and power resource allocation scheme based on reinforcement learning (RL) is proposed. We compress the state space and the action space by temporarily dumping and decomposing the action. The multipath deep neural network and policy gradient method are used, respectively, as the approximater and update method of the parameterized policy. The experimental results reveal that the RL-based hybrid spectrum and the power resource allocation scheme eventually converged after a limited number of iterative learnings. Compared with other schemes, the RL-based scheme can effectively guarantee the URLLC delay constraint when the load does not exceed 130%.https://doi.org/10.1186/s13638-020-01872-5Ultra-reliable and low-latency communicationRadio resource allocationmmWaveHybrid spectrumReinforcement learningMultipath deep neural network
collection DOAJ
language English
format Article
sources DOAJ
author Qian Huang
Xianzhong Xie
Mohamed Cheriet
spellingShingle Qian Huang
Xianzhong Xie
Mohamed Cheriet
Reinforcement learning-based hybrid spectrum resource allocation scheme for the high load of URLLC services
EURASIP Journal on Wireless Communications and Networking
Ultra-reliable and low-latency communication
Radio resource allocation
mmWave
Hybrid spectrum
Reinforcement learning
Multipath deep neural network
author_facet Qian Huang
Xianzhong Xie
Mohamed Cheriet
author_sort Qian Huang
title Reinforcement learning-based hybrid spectrum resource allocation scheme for the high load of URLLC services
title_short Reinforcement learning-based hybrid spectrum resource allocation scheme for the high load of URLLC services
title_full Reinforcement learning-based hybrid spectrum resource allocation scheme for the high load of URLLC services
title_fullStr Reinforcement learning-based hybrid spectrum resource allocation scheme for the high load of URLLC services
title_full_unstemmed Reinforcement learning-based hybrid spectrum resource allocation scheme for the high load of URLLC services
title_sort reinforcement learning-based hybrid spectrum resource allocation scheme for the high load of urllc services
publisher SpringerOpen
series EURASIP Journal on Wireless Communications and Networking
issn 1687-1499
publishDate 2020-12-01
description Abstract Ultra-reliable and low-latency communication (URLLC) in mobile networks is still one of the core solutions that require thorough research in 5G and beyond. With the vigorous development of various emerging URLLC technologies, resource shortages will soon occur even in mmWave cells with rich spectrum resources. As a result of the large radio resource space of mmWave, traditional real-time resource scheduling decisions can cause serious delays. Consequently, we investigate a delay minimization problem with the spectrum and power constraints in the mmWave hybrid access network. To reduce the delay caused by high load and radio resource shortage, a hybrid spectrum and power resource allocation scheme based on reinforcement learning (RL) is proposed. We compress the state space and the action space by temporarily dumping and decomposing the action. The multipath deep neural network and policy gradient method are used, respectively, as the approximater and update method of the parameterized policy. The experimental results reveal that the RL-based hybrid spectrum and the power resource allocation scheme eventually converged after a limited number of iterative learnings. Compared with other schemes, the RL-based scheme can effectively guarantee the URLLC delay constraint when the load does not exceed 130%.
topic Ultra-reliable and low-latency communication
Radio resource allocation
mmWave
Hybrid spectrum
Reinforcement learning
Multipath deep neural network
url https://doi.org/10.1186/s13638-020-01872-5
work_keys_str_mv AT qianhuang reinforcementlearningbasedhybridspectrumresourceallocationschemeforthehighloadofurllcservices
AT xianzhongxie reinforcementlearningbasedhybridspectrumresourceallocationschemeforthehighloadofurllcservices
AT mohamedcheriet reinforcementlearningbasedhybridspectrumresourceallocationschemeforthehighloadofurllcservices
_version_ 1724384774373507072