ETRI Knowledge Sharing Platform : DISCO – U-Net based Autoencoder Architecture with Dual Input Streams for Skeleton Image Drawing

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper DISCO – U-Net based Autoencoder Architecture with Dual Input Streams for Skeleton Image Drawing

Cited 6 time in scopus

Download 236 time Share share

Citation: International Conference on Computer Vision Workshops (ICCVW) 2021, pp.2128-2135

Abstract: In this paper, we propose a DISCO, which is a manner of designing autoencoder architecture to process dual input streams for skeletal image generation. The DISCO was designed to be dealing with binary masks and skeletonized images concurrently at the input side. We expected the skeletonized images using traditional thinning algorithms could help to boost skeleton prediction performances. Inside the DISCO architecture, there exist two encoders and a single decoder. Each functional block is stacked with multiple logical layers. We designed that logical layer outputs of encoders transferred corresponding counterpart layers in a decoder referring to U-Net architecture. In addition, we proposed hybrid-type encoder models based on the DISCO architecture to capitalize on the effect of the model ensemble. We demonstrated performances of the DISCO-A and DISCO-B models derived from the proposed architecture in terms of f1-score and loss convergence per each epoch. We confirmed the DISCO-B had produced the best performance under symbolic label usage. In the development phase, our best score reached 0.7386 with 500 epochs.

KSP Keywords: Best performance, F1-score, Hybrid-type, Input side, Model ensemble, Skeleton image, Thinning algorithms, development phase, image generation

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.