Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and Challenges

Deep learning (DL) has seen great success in the computer vision (CV) field, and related techniques have been used in security, healthcare, remote sensing, and many other areas. As a parallel development, visual data has become universal in daily life, easily generated by ubiquitous low-cost cameras...

Full description

Bibliographic Details
Main Authors:	Yu Tian, Gaofeng Pan, Mohamed-Slim Alouini
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Open Journal of the Communications Society
Subjects:	Computer vision deep learning multiple-input and multiple-output beamforming beam tracking long short-term memory
Online Access:	https://ieeexplore.ieee.org/document/9305715/

id	doaj-e561fb0e65304838aee9fad201957c74
record_format	Article
spelling	doaj-e561fb0e65304838aee9fad201957c742021-03-29T18:58:10ZengIEEEIEEE Open Journal of the Communications Society2644-125X2021-01-01213214310.1109/OJCOMS.2020.30426309305715Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and ChallengesYu Tian0https://orcid.org/0000-0003-3394-3219Gaofeng Pan1https://orcid.org/0000-0003-1008-5717Mohamed-Slim Alouini2https://orcid.org/0000-0003-4827-1793Computer, Electrical and Mathematical Science and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi ArabiaComputer, Electrical and Mathematical Science and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi ArabiaComputer, Electrical and Mathematical Science and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi ArabiaDeep learning (DL) has seen great success in the computer vision (CV) field, and related techniques have been used in security, healthcare, remote sensing, and many other areas. As a parallel development, visual data has become universal in daily life, easily generated by ubiquitous low-cost cameras. Therefore, exploring DL-based CV may yield useful information about objects, such as their number, locations, distribution, motion, etc. Intuitively, DL-based CV can also facilitate and improve the designs of wireless communications, especially in dynamic network scenarios. However, so far, such work is rare in the literature. The primary purpose of this article, then, is to introduce ideas about applying DL-based CV in wireless communications to bring some novel degrees of freedom to both theoretical research and engineering applications. To illustrate how DL-based CV can be applied in wireless communications, an example of using a DL-based CV with a millimeter-wave (mmWave) system is given to realize optimal mmWave multiple-input and multiple-output (MIMO) beamforming in mobile scenarios. In this example, we propose a framework to predict future beam indices from previously observed beam indices and images of street views using ResNet, 3-dimensional ResNext, and a long short-term memory network. The experimental results show that our frameworks achieve much higher accuracy than the baseline method, and that visual data can significantly improve the performance of the MIMO beamforming system. Finally, we discuss the opportunities and challenges of applying DL-based CV in wireless communications.https://ieeexplore.ieee.org/document/9305715/Computer visiondeep learningmultiple-input and multiple-outputbeamformingbeam trackinglong short-term memory
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Yu Tian Gaofeng Pan Mohamed-Slim Alouini
spellingShingle	Yu Tian Gaofeng Pan Mohamed-Slim Alouini Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and Challenges IEEE Open Journal of the Communications Society Computer vision deep learning multiple-input and multiple-output beamforming beam tracking long short-term memory
author_facet	Yu Tian Gaofeng Pan Mohamed-Slim Alouini
author_sort	Yu Tian
title	Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and Challenges
title_short	Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and Challenges
title_full	Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and Challenges
title_fullStr	Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and Challenges
title_full_unstemmed	Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and Challenges
title_sort	applying deep-learning-based computer vision to wireless communications: methodologies, opportunities, and challenges
publisher	IEEE
series	IEEE Open Journal of the Communications Society
issn	2644-125X
publishDate	2021-01-01
description	Deep learning (DL) has seen great success in the computer vision (CV) field, and related techniques have been used in security, healthcare, remote sensing, and many other areas. As a parallel development, visual data has become universal in daily life, easily generated by ubiquitous low-cost cameras. Therefore, exploring DL-based CV may yield useful information about objects, such as their number, locations, distribution, motion, etc. Intuitively, DL-based CV can also facilitate and improve the designs of wireless communications, especially in dynamic network scenarios. However, so far, such work is rare in the literature. The primary purpose of this article, then, is to introduce ideas about applying DL-based CV in wireless communications to bring some novel degrees of freedom to both theoretical research and engineering applications. To illustrate how DL-based CV can be applied in wireless communications, an example of using a DL-based CV with a millimeter-wave (mmWave) system is given to realize optimal mmWave multiple-input and multiple-output (MIMO) beamforming in mobile scenarios. In this example, we propose a framework to predict future beam indices from previously observed beam indices and images of street views using ResNet, 3-dimensional ResNext, and a long short-term memory network. The experimental results show that our frameworks achieve much higher accuracy than the baseline method, and that visual data can significantly improve the performance of the MIMO beamforming system. Finally, we discuss the opportunities and challenges of applying DL-based CV in wireless communications.
topic	Computer vision deep learning multiple-input and multiple-output beamforming beam tracking long short-term memory
url	https://ieeexplore.ieee.org/document/9305715/
work_keys_str_mv	AT yutian applyingdeeplearningbasedcomputervisiontowirelesscommunicationsmethodologiesopportunitiesandchallenges AT gaofengpan applyingdeeplearningbasedcomputervisiontowirelesscommunicationsmethodologiesopportunitiesandchallenges AT mohamedslimalouini applyingdeeplearningbasedcomputervisiontowirelesscommunicationsmethodologiesopportunitiesandchallenges
_version_	1724196206489370624

Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and Challenges

Similar Items