A study of self-training variants for semi-supervised image classification

Sahito, Attaullah

A study of self-training variants for semi-supervised image classification

Authors

Sahito, Attaullah

Files

thesis.pdf (5.89 MB)

Permanent Link

https://hdl.handle.net/10289/14678

Rights

Abstract

Artificial neural networks achieve state-of-the-art performance when trained on a vast number of labelled examples. Still, they can easily overfit training examples when few labelled examples are available. The requirement to have labels for all training examples is a strong limitation of standard supervised machine learning. This can be addressed by applying semi-supervised learning methods that extend supervised learning and use unlabelled examples. Self-training is the most basic and generic semi-supervised approach. In self-training, a model is trained iteratively on both labelled and pseudo-labelled examples obtained from previous iterations. This thesis focuses on the task of investigating different variants of self-training by applying metric learning, transfer learning, and self-supervised learning. The first part of this thesis investigates how metric learning can be applied to self-training. This is achieved by applying several metric learning losses for the training of feedforward neural networks. Experimental results show that triplet loss – a metric learning loss – can achieve better results than cross-entropy loss with simple neural networks. For improving the performance of self-training, the second part of the thesis investigates applying large neural networks and pretraining on various image sizes of ImageNet with different loss functions. Experimental results show that pretraining always improves the predictive performance of the model. Pretraining on smaller image sizes with cross-entropy loss provides the highest performance. In the third part of this thesis, several self-training methods are developed using self-supervised learning. Geometric transformation-based self-supervised learning is applied to unlabelled examples. The experimental results indicate that applying self-supervised learning for only the first iteration achieves better performance than using it in all iterations of self-training.

Type

Thesis

Date

2021

Publisher

The University of Waikato

Degree

Doctor of Philosophy (PhD)

Supervisor

Pfahringer, Bernhard
Frank, Eibe

A study of self-training variants for semi-supervised image classification

Authors

Files

Permanent Link

Publisher link

Rights

Abstract

Citation

Type

Series name

Date

Publisher

Degree

Type of thesis

Supervisor