ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper DISCO – U-Net based Autoencoder Architecture with Dual Input Streams for Skeleton Image Drawing
Cited 6 time in scopus Download 205 time Share share facebook twitter linkedin kakaostory
Authors
Soonyong Song, Heechul Bae, Junhee Park
Issue Date
2021-10
Citation
International Conference on Computer Vision Workshops (ICCVW) 2021, pp.2128-2135
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICCVW54120.2021.00241
Abstract
In this paper, we propose a DISCO, which is a manner of designing autoencoder architecture to process dual input streams for skeletal image generation. The DISCO was designed to be dealing with binary masks and skeletonized images concurrently at the input side. We expected the skeletonized images using traditional thinning algorithms could help to boost skeleton prediction performances. Inside the DISCO architecture, there exist two encoders and a single decoder. Each functional block is stacked with multiple logical layers. We designed that logical layer outputs of encoders transferred corresponding counterpart layers in a decoder referring to U-Net architecture. In addition, we proposed hybrid-type encoder models based on the DISCO architecture to capitalize on the effect of the model ensemble. We demonstrated performances of the DISCO-A and DISCO-B models derived from the proposed architecture in terms of f1-score and loss convergence per each epoch. We confirmed the DISCO-B had produced the best performance under symbolic label usage. In the development phase, our best score reached 0.7386 with 500 epochs.
KSP Keywords
Best performance, F1-score, Hybrid-type, Input side, Model ensemble, Skeleton image, Thinning algorithms, development phase, image generation