A Survey on Semi-, Self- and Unsupervised Learning for Image Classification

While deep learning strategies achieve outstanding results in computer vision tasks, one issue remains: The current strategies rely heavily on a huge amount of labeled data. In many real-world problems, it is not feasible to create such an amount of labeled training data. Therefore, it is common to...

Full description

Bibliographic Details
Main Authors: Lars Schmarje, Monty Santarossa, Simon-Martin Schroder, Reinhard Koch
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9442775/
id doaj-19eacb9697b84a03ab19befee1c03787
record_format Article
spelling doaj-19eacb9697b84a03ab19befee1c037872021-06-10T23:00:41ZengIEEEIEEE Access2169-35362021-01-019821468216810.1109/ACCESS.2021.30843589442775A Survey on Semi-, Self- and Unsupervised Learning for Image ClassificationLars Schmarje0https://orcid.org/0000-0002-6945-5957Monty Santarossa1https://orcid.org/0000-0002-4159-1367Simon-Martin Schroder2https://orcid.org/0000-0002-6603-9907Reinhard Koch3https://orcid.org/0000-0003-4398-1569Multimedia Information Processing Group, Kiel University, Kiel, GermanyMultimedia Information Processing Group, Kiel University, Kiel, GermanyMultimedia Information Processing Group, Kiel University, Kiel, GermanyMultimedia Information Processing Group, Kiel University, Kiel, GermanyWhile deep learning strategies achieve outstanding results in computer vision tasks, one issue remains: The current strategies rely heavily on a huge amount of labeled data. In many real-world problems, it is not feasible to create such an amount of labeled training data. Therefore, it is common to incorporate unlabeled data into the training process to reach equal results with fewer labels. Due to a lot of concurrent research, it is difficult to keep track of recent developments. In this survey, we provide an overview of often used ideas and methods in image classification with fewer labels. We compare 34 methods in detail based on their performance and their commonly used ideas rather than a fine-grained taxonomy. In our analysis, we identify three major trends that lead to future research opportunities. 1. State-of-the-art methods are scalable to real-world applications in theory but issues like class imbalance, robustness, or fuzzy labels are not considered. 2. The degree of supervision which is needed to achieve comparable results to the usage of all labels is decreasing and therefore methods need to be extended to settings with a variable number of classes. 3. All methods share some common ideas but we identify clusters of methods that do not share many ideas. We show that combining ideas from different clusters can lead to better performance.https://ieeexplore.ieee.org/document/9442775/Semi-supervisedself-supervisedunsupervisedimage classificationdeep learningsurvey
collection DOAJ
language English
format Article
sources DOAJ
author Lars Schmarje
Monty Santarossa
Simon-Martin Schroder
Reinhard Koch
spellingShingle Lars Schmarje
Monty Santarossa
Simon-Martin Schroder
Reinhard Koch
A Survey on Semi-, Self- and Unsupervised Learning for Image Classification
IEEE Access
Semi-supervised
self-supervised
unsupervised
image classification
deep learning
survey
author_facet Lars Schmarje
Monty Santarossa
Simon-Martin Schroder
Reinhard Koch
author_sort Lars Schmarje
title A Survey on Semi-, Self- and Unsupervised Learning for Image Classification
title_short A Survey on Semi-, Self- and Unsupervised Learning for Image Classification
title_full A Survey on Semi-, Self- and Unsupervised Learning for Image Classification
title_fullStr A Survey on Semi-, Self- and Unsupervised Learning for Image Classification
title_full_unstemmed A Survey on Semi-, Self- and Unsupervised Learning for Image Classification
title_sort survey on semi-, self- and unsupervised learning for image classification
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description While deep learning strategies achieve outstanding results in computer vision tasks, one issue remains: The current strategies rely heavily on a huge amount of labeled data. In many real-world problems, it is not feasible to create such an amount of labeled training data. Therefore, it is common to incorporate unlabeled data into the training process to reach equal results with fewer labels. Due to a lot of concurrent research, it is difficult to keep track of recent developments. In this survey, we provide an overview of often used ideas and methods in image classification with fewer labels. We compare 34 methods in detail based on their performance and their commonly used ideas rather than a fine-grained taxonomy. In our analysis, we identify three major trends that lead to future research opportunities. 1. State-of-the-art methods are scalable to real-world applications in theory but issues like class imbalance, robustness, or fuzzy labels are not considered. 2. The degree of supervision which is needed to achieve comparable results to the usage of all labels is decreasing and therefore methods need to be extended to settings with a variable number of classes. 3. All methods share some common ideas but we identify clusters of methods that do not share many ideas. We show that combining ideas from different clusters can lead to better performance.
topic Semi-supervised
self-supervised
unsupervised
image classification
deep learning
survey
url https://ieeexplore.ieee.org/document/9442775/
work_keys_str_mv AT larsschmarje asurveyonsemiselfandunsupervisedlearningforimageclassification
AT montysantarossa asurveyonsemiselfandunsupervisedlearningforimageclassification
AT simonmartinschroder asurveyonsemiselfandunsupervisedlearningforimageclassification
AT reinhardkoch asurveyonsemiselfandunsupervisedlearningforimageclassification
AT larsschmarje surveyonsemiselfandunsupervisedlearningforimageclassification
AT montysantarossa surveyonsemiselfandunsupervisedlearningforimageclassification
AT simonmartinschroder surveyonsemiselfandunsupervisedlearningforimageclassification
AT reinhardkoch surveyonsemiselfandunsupervisedlearningforimageclassification
_version_ 1721384347120959488