dc.contributor.advisor | Jenssen, Robert | |
dc.contributor.advisor | Kampffmeyer, Michael C. | |
dc.contributor.author | Trosten, Daniel Johansen | |
dc.date.accessioned | 2019-07-03T12:57:35Z | |
dc.date.available | 2019-07-03T12:57:35Z | |
dc.date.issued | 2019-05-31 | |
dc.description.abstract | Deep image clustering is a rapidly growing branch of machine learning and computer vision, in which deep neural networks are trained to discover groups within a set of images, in an unsupervised manner. Deep neural networks have proven to be immensely successful in several machine learning tasks, but the majority of these advances have been in supervised settings. The process of labeling data for supervised applications can be extremely time-consuming, or even completely infeasible in many domains. This has led researchers to shift their focus towards the deep clustering field. However, this field is still in its infancy, meaning that it includes several open research questions, regarding e.g. the design and optimization of the algorithms, the discovery of meaningful clusters, and the initialization of model parameters.
In an attempt to address some of these open questions, a new algorithm for deep image clustering is developed in this thesis. The proposed Deep Tensor Kernel Clustering (DTKC) consists of a convolutional neural network (CNN), which is trained to reflect a common cluster structure at the output of all its intermediate layers. Encouraging a consistent cluster structure throughout the network has the potential to guide it towards meaningful clusters, even though these clusters might appear to be nonlinear in the input space. The cluster structure is enforced through the idea of companion objectives, where separate loss functions are attached to each of the layers in the network. These companion objectives are constructed based on a proposed generalization of the Cauchy-Schwarz (CS) divergence, from vectors to tensors of arbitrary rank. Generalizing the CS divergence to tensor-valued data is a crucial step, due to the tensorial nature of the intermediate representations in the CNN. Furthermore, an alternate initialization strategy based on self-supervised learning, is also employed. To the author's best knowledge, this is the first attempt at using this particular self-supervised learning approach to initialize a deep clustering algorithm.
Several experiments are conducted to thoroughly assess the performance of the proposed DTKC model, with and without self-supervised pre-training. The results show that the models outperform, or perform comparable to, a wide range of benchmark algorithms from the literature. | en_US |
dc.identifier.uri | https://hdl.handle.net/10037/15661 | |
dc.language.iso | eng | en_US |
dc.publisher | UiT Norges arktiske universitet | en_US |
dc.publisher | UiT The Arctic University of Norway | en_US |
dc.rights.accessRights | openAccess | en_US |
dc.rights.holder | Copyright 2019 The Author(s) | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-sa/4.0 | en_US |
dc.rights | Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) | en_US |
dc.subject.courseID | FYS-3941 | |
dc.subject | VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Anvendt matematikk: 413 | en_US |
dc.subject | VDP::Mathematics and natural science: 400::Mathematics: 410::Applied mathematics: 413 | en_US |
dc.subject | VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Statistikk: 412 | en_US |
dc.subject | VDP::Mathematics and natural science: 400::Mathematics: 410::Statistics: 412 | en_US |
dc.title | Deep Image Clustering with Tensor Kernels and Unsupervised Companion Objectives | en_US |
dc.type | Master thesis | en_US |
dc.type | Mastergradsoppgave | en_US |