Online Learning for Position-Aided Millimeter Wave Beam Training

Accurate beam alignment is essential for the beam-based millimeter wave communications. The conventional beam sweeping solutions often have large overhead, which is unacceptable for mobile applications, such as a vehicle to everything. The learning-based solutions that leverage the sensor data (e.g....

Full description

Bibliographic Details
Main Authors:	Vutha Va, Takayuki Shimizu, Gaurav Bansal, Robert W. Heath
Format:	Article
Language:	English
Published:	IEEE 2019-01-01
Series:	IEEE Access
Subjects:	Millimeter wave beam alignment beam refinement position-aided online learning multi-armed bandit
Online Access:	https://ieeexplore.ieee.org/document/8662770/

id	doaj-6ccfb40efd19457d8cc451e726b9cbdd
record_format	Article
spelling	doaj-6ccfb40efd19457d8cc451e726b9cbdd2021-03-29T22:18:33ZengIEEEIEEE Access2169-35362019-01-017305073052610.1109/ACCESS.2019.29023728662770Online Learning for Position-Aided Millimeter Wave Beam TrainingVutha Va0Takayuki Shimizu1https://orcid.org/0000-0002-1068-8510Gaurav Bansal2Robert W. Heath3https://orcid.org/0000-0002-4666-5628Wireless Networking and Communications Group, The University of Texas at Austin, Austin, TX, USAToyota Infotechnology Center, Mountain View, CA, USAToyota Infotechnology Center, Mountain View, CA, USAWireless Networking and Communications Group, The University of Texas at Austin, Austin, TX, USAAccurate beam alignment is essential for the beam-based millimeter wave communications. The conventional beam sweeping solutions often have large overhead, which is unacceptable for mobile applications, such as a vehicle to everything. The learning-based solutions that leverage the sensor data (e.g., position) to identify the good beam directions are one approach to reduce the overhead. Most existing solutions, though, are supervised learning, where the training data are collected beforehand. In this paper, we use a multi-armed bandit framework to develop the online learning algorithms for beam pair selection and refinement. The beam pair selection algorithm learns coarse beam directions in some predefined beam codebook, e.g., in discrete angles, separated by the 3 dB beamwidths. The beam refinement fine-tunes the identified directions to match the peak of the power angular spectrum at that position. The beam pair selection uses the upper confidence bound with a newly proposed risk-aware feature, while the beam refinement uses a modified optimistic optimization algorithm. The proposed algorithms learn to recommend the good beam pairs quickly. When using $16\times 16$ arrays at both transmitter and receiver, it can achieve, on average, 1-dB gain over the exhaustive search (over $271\times 271$ beam pairs) on the unrefined codebook within 100 time steps with a training budget of only 30 beam pairs.https://ieeexplore.ieee.org/document/8662770/Millimeter wavebeam alignmentbeam refinementposition-aidedonline learningmulti-armed bandit
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Vutha Va Takayuki Shimizu Gaurav Bansal Robert W. Heath
spellingShingle	Vutha Va Takayuki Shimizu Gaurav Bansal Robert W. Heath Online Learning for Position-Aided Millimeter Wave Beam Training IEEE Access Millimeter wave beam alignment beam refinement position-aided online learning multi-armed bandit
author_facet	Vutha Va Takayuki Shimizu Gaurav Bansal Robert W. Heath
author_sort	Vutha Va
title	Online Learning for Position-Aided Millimeter Wave Beam Training
title_short	Online Learning for Position-Aided Millimeter Wave Beam Training
title_full	Online Learning for Position-Aided Millimeter Wave Beam Training
title_fullStr	Online Learning for Position-Aided Millimeter Wave Beam Training
title_full_unstemmed	Online Learning for Position-Aided Millimeter Wave Beam Training
title_sort	online learning for position-aided millimeter wave beam training
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2019-01-01
description	Accurate beam alignment is essential for the beam-based millimeter wave communications. The conventional beam sweeping solutions often have large overhead, which is unacceptable for mobile applications, such as a vehicle to everything. The learning-based solutions that leverage the sensor data (e.g., position) to identify the good beam directions are one approach to reduce the overhead. Most existing solutions, though, are supervised learning, where the training data are collected beforehand. In this paper, we use a multi-armed bandit framework to develop the online learning algorithms for beam pair selection and refinement. The beam pair selection algorithm learns coarse beam directions in some predefined beam codebook, e.g., in discrete angles, separated by the 3 dB beamwidths. The beam refinement fine-tunes the identified directions to match the peak of the power angular spectrum at that position. The beam pair selection uses the upper confidence bound with a newly proposed risk-aware feature, while the beam refinement uses a modified optimistic optimization algorithm. The proposed algorithms learn to recommend the good beam pairs quickly. When using $16\times 16$ arrays at both transmitter and receiver, it can achieve, on average, 1-dB gain over the exhaustive search (over $271\times 271$ beam pairs) on the unrefined codebook within 100 time steps with a training budget of only 30 beam pairs.
topic	Millimeter wave beam alignment beam refinement position-aided online learning multi-armed bandit
url	https://ieeexplore.ieee.org/document/8662770/
work_keys_str_mv	AT vuthava onlinelearningforpositionaidedmillimeterwavebeamtraining AT takayukishimizu onlinelearningforpositionaidedmillimeterwavebeamtraining AT gauravbansal onlinelearningforpositionaidedmillimeterwavebeamtraining AT robertwheath onlinelearningforpositionaidedmillimeterwavebeamtraining
_version_	1724191852068864000

Online Learning for Position-Aided Millimeter Wave Beam Training

Similar Items