A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis
The increasing use of social media networks on handheld devices, especially smartphones with powerful built-in cameras, and the widespread availability of fast and high bandwidth broadband connections, added to the popularity of cloud storage, is enabling the generation and distribution of massive v...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8744516/ |
id |
doaj-7aff2a2e5014443db61371d8ba764066 |
---|---|
record_format |
Article |
spelling |
doaj-7aff2a2e5014443db61371d8ba7640662021-03-29T23:22:10ZengIEEEIEEE Access2169-35362019-01-017840038401910.1109/ACCESS.2019.29247338744516A Survey of Deep Learning Solutions for Multimedia Visual Content AnalysisMuhammad Shahroz Nadeem0https://orcid.org/0000-0001-5835-1602Virginia N. L. Franqueira1https://orcid.org/0000-0003-1332-9115Xiaojun Zhai2https://orcid.org/0000-0002-1030-8311Fatih Kurugollu3College of Engineering and Technology, University of Derby, Derby, U.K.College of Engineering and Technology, University of Derby, Derby, U.K.School of Computer Science and Electronics Engineering, University of Essex, Colchester, U.K.College of Engineering and Technology, University of Derby, Derby, U.K.The increasing use of social media networks on handheld devices, especially smartphones with powerful built-in cameras, and the widespread availability of fast and high bandwidth broadband connections, added to the popularity of cloud storage, is enabling the generation and distribution of massive volumes of digital media, including images and videos. Such media is full of visual information and holds immense value in today's world. The volume of data involved calls for automated visual content analysis systems able to meet the demands of practice in terms of efficiency and effectiveness. Deep learning (DL) has recently emerged as a prominent technique for visual content analysis. It is data-driven in nature and provides automatic end-to-end learning solutions without the need to rely explicitly on predefined handcrafted feature extractors. Another appealing characteristic of DL solutions is the performance they can achieve, once the network is trained, under practical constraints. This paper identifies eight problem domains which require analysis of visual artifacts in multimedia. It surveys the recent, authoritative, and the best performing DL solutions and lists the datasets used in the development of these deep methods for the identified types of visual analysis problems. This paper also discusses the challenges that the DL solutions face which can compromise their reliability, robustness, and accuracy for visual content analysis.https://ieeexplore.ieee.org/document/8744516/Visual content analysisdeep learningmachine learningdataset |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Muhammad Shahroz Nadeem Virginia N. L. Franqueira Xiaojun Zhai Fatih Kurugollu |
spellingShingle |
Muhammad Shahroz Nadeem Virginia N. L. Franqueira Xiaojun Zhai Fatih Kurugollu A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis IEEE Access Visual content analysis deep learning machine learning dataset |
author_facet |
Muhammad Shahroz Nadeem Virginia N. L. Franqueira Xiaojun Zhai Fatih Kurugollu |
author_sort |
Muhammad Shahroz Nadeem |
title |
A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis |
title_short |
A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis |
title_full |
A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis |
title_fullStr |
A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis |
title_full_unstemmed |
A Survey of Deep Learning Solutions for Multimedia Visual Content Analysis |
title_sort |
survey of deep learning solutions for multimedia visual content analysis |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
The increasing use of social media networks on handheld devices, especially smartphones with powerful built-in cameras, and the widespread availability of fast and high bandwidth broadband connections, added to the popularity of cloud storage, is enabling the generation and distribution of massive volumes of digital media, including images and videos. Such media is full of visual information and holds immense value in today's world. The volume of data involved calls for automated visual content analysis systems able to meet the demands of practice in terms of efficiency and effectiveness. Deep learning (DL) has recently emerged as a prominent technique for visual content analysis. It is data-driven in nature and provides automatic end-to-end learning solutions without the need to rely explicitly on predefined handcrafted feature extractors. Another appealing characteristic of DL solutions is the performance they can achieve, once the network is trained, under practical constraints. This paper identifies eight problem domains which require analysis of visual artifacts in multimedia. It surveys the recent, authoritative, and the best performing DL solutions and lists the datasets used in the development of these deep methods for the identified types of visual analysis problems. This paper also discusses the challenges that the DL solutions face which can compromise their reliability, robustness, and accuracy for visual content analysis. |
topic |
Visual content analysis deep learning machine learning dataset |
url |
https://ieeexplore.ieee.org/document/8744516/ |
work_keys_str_mv |
AT muhammadshahroznadeem asurveyofdeeplearningsolutionsformultimediavisualcontentanalysis AT virginianlfranqueira asurveyofdeeplearningsolutionsformultimediavisualcontentanalysis AT xiaojunzhai asurveyofdeeplearningsolutionsformultimediavisualcontentanalysis AT fatihkurugollu asurveyofdeeplearningsolutionsformultimediavisualcontentanalysis AT muhammadshahroznadeem surveyofdeeplearningsolutionsformultimediavisualcontentanalysis AT virginianlfranqueira surveyofdeeplearningsolutionsformultimediavisualcontentanalysis AT xiaojunzhai surveyofdeeplearningsolutionsformultimediavisualcontentanalysis AT fatihkurugollu surveyofdeeplearningsolutionsformultimediavisualcontentanalysis |
_version_ |
1724189655557996544 |