ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Improving Image Classification Accuracy Through Cluster Optimization in Latent Space
Cited 0 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Soonyong Song, Donghun Lee, Heechul Bae, Chan-Won Park
Issue Date
2024-10
Citation
International Conference on Information and Communication Technology Convergence (ICTC) 2024, pp.1342-1346
Publisher
IEEE
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICTC62082.2024.10827620
Abstract
In this paper, we propose a novel image classification method that combines a variational encoder and triplet loss. The method aims to improve classification accuracy by minimizing the variance of clusters in the latent space. The variational encoder transforms data into the latent space to form probabilistic distributions, which represent clusters of the categories we want to classify. Triplet loss reduces the overlap between these clusters, leading to accurate boundary setting and ensuring that samples move toward the cluster centers. This research contributes to improving the performance of image classification tasks by combining the strengths of these two approaches. The proposed method was implemented using PyTorch to create a neural network for experimentation, utilizing the simplest linear layer for proof of concept, and was evaluated on the MNIST dataset. The results show that the proposed method outperforms existing methods by achieving an average accuracy of 98.10%, demonstrating a performance gain of more than 0.2% compared to previous methods.
KSP Keywords
Boundary setting, Classification method, Cluster center, Cluster optimization, Image Classification, Latent space, Linear Layer, MNIST Dataset, Network for Experimentation(NfExp), Performance gain, Probabilistic distributions