A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis

The increasing use of social media networks on handheld devices, especially smartphones with powerful built-in cameras, and the widespread availability of fast and high bandwidth broadband connections, added to the popularity of cloud storage, is enabling the generation and distribution of massive v...

Full description

Bibliographic Details
Main Authors: Muhammad Shahroz Nadeem, Virginia N. L. Franqueira, Xiaojun Zhai, Fatih Kurugollu
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8744516/
id doaj-7aff2a2e5014443db61371d8ba764066
record_format Article
spelling doaj-7aff2a2e5014443db61371d8ba7640662021-03-29T23:22:10ZengIEEEIEEE Access2169-35362019-01-017840038401910.1109/ACCESS.2019.29247338744516A Survey of Deep Learning Solutions for Multimedia Visual Content AnalysisMuhammad Shahroz Nadeem0https://orcid.org/0000-0001-5835-1602Virginia N. L. Franqueira1https://orcid.org/0000-0003-1332-9115Xiaojun Zhai2https://orcid.org/0000-0002-1030-8311Fatih Kurugollu3College of Engineering and Technology, University of Derby, Derby, U.K.College of Engineering and Technology, University of Derby, Derby, U.K.School of Computer Science and Electronics Engineering, University of Essex, Colchester, U.K.College of Engineering and Technology, University of Derby, Derby, U.K.The increasing use of social media networks on handheld devices, especially smartphones with powerful built-in cameras, and the widespread availability of fast and high bandwidth broadband connections, added to the popularity of cloud storage, is enabling the generation and distribution of massive volumes of digital media, including images and videos. Such media is full of visual information and holds immense value in today's world. The volume of data involved calls for automated visual content analysis systems able to meet the demands of practice in terms of efficiency and effectiveness. Deep learning (DL) has recently emerged as a prominent technique for visual content analysis. It is data-driven in nature and provides automatic end-to-end learning solutions without the need to rely explicitly on predefined handcrafted feature extractors. Another appealing characteristic of DL solutions is the performance they can achieve, once the network is trained, under practical constraints. This paper identifies eight problem domains which require analysis of visual artifacts in multimedia. It surveys the recent, authoritative, and the best performing DL solutions and lists the datasets used in the development of these deep methods for the identified types of visual analysis problems. This paper also discusses the challenges that the DL solutions face which can compromise their reliability, robustness, and accuracy for visual content analysis.https://ieeexplore.ieee.org/document/8744516/Visual content analysisdeep learningmachine learningdataset
collection DOAJ
language English
format Article
sources DOAJ
author Muhammad Shahroz Nadeem
Virginia N. L. Franqueira
Xiaojun Zhai
Fatih Kurugollu
spellingShingle Muhammad Shahroz Nadeem
Virginia N. L. Franqueira
Xiaojun Zhai
Fatih Kurugollu
A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis
IEEE Access
Visual content analysis
deep learning
machine learning
dataset
author_facet Muhammad Shahroz Nadeem
Virginia N. L. Franqueira
Xiaojun Zhai
Fatih Kurugollu
author_sort Muhammad Shahroz Nadeem
title A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis
title_short A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis
title_full A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis
title_fullStr A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis
title_full_unstemmed A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis
title_sort survey of deep learning solutions for multimedia visual content analysis
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description The increasing use of social media networks on handheld devices, especially smartphones with powerful built-in cameras, and the widespread availability of fast and high bandwidth broadband connections, added to the popularity of cloud storage, is enabling the generation and distribution of massive volumes of digital media, including images and videos. Such media is full of visual information and holds immense value in today's world. The volume of data involved calls for automated visual content analysis systems able to meet the demands of practice in terms of efficiency and effectiveness. Deep learning (DL) has recently emerged as a prominent technique for visual content analysis. It is data-driven in nature and provides automatic end-to-end learning solutions without the need to rely explicitly on predefined handcrafted feature extractors. Another appealing characteristic of DL solutions is the performance they can achieve, once the network is trained, under practical constraints. This paper identifies eight problem domains which require analysis of visual artifacts in multimedia. It surveys the recent, authoritative, and the best performing DL solutions and lists the datasets used in the development of these deep methods for the identified types of visual analysis problems. This paper also discusses the challenges that the DL solutions face which can compromise their reliability, robustness, and accuracy for visual content analysis.
topic Visual content analysis
deep learning
machine learning
dataset
url https://ieeexplore.ieee.org/document/8744516/
work_keys_str_mv AT muhammadshahroznadeem asurveyofdeeplearningsolutionsformultimediavisualcontentanalysis
AT virginianlfranqueira asurveyofdeeplearningsolutionsformultimediavisualcontentanalysis
AT xiaojunzhai asurveyofdeeplearningsolutionsformultimediavisualcontentanalysis
AT fatihkurugollu asurveyofdeeplearningsolutionsformultimediavisualcontentanalysis
AT muhammadshahroznadeem surveyofdeeplearningsolutionsformultimediavisualcontentanalysis
AT virginianlfranqueira surveyofdeeplearningsolutionsformultimediavisualcontentanalysis
AT xiaojunzhai surveyofdeeplearningsolutionsformultimediavisualcontentanalysis
AT fatihkurugollu surveyofdeeplearningsolutionsformultimediavisualcontentanalysis
_version_ 1724189655557996544