ETRI Knowledge Sharing Platform : CitiusSynapse: A Deep Learning Framework for Embedded Systems

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Journal Article CitiusSynapse: A Deep Learning Framework for Embedded Systems

Cited 1 time in scopus

Download 486 time Share share

Abstract: As embedded systems, such as smartphones with limited resources, have become increas-ingly popular, active research has recently been conducted on performing on-device deep learning in such systems. Therefore, in this study, we propose a deep learning framework that is specialized for embedded systems with limited resources, the operation processing structure of which differs from that of standard PCs. The proposed framework supports an OpenCL-based accelerator engine for accelerator deep learning operations in various embedded systems. Moreover, the parallel processing performance of OpenCL is maximized through an OpenCL kernel that is optimized for embedded GPUs, and the structural characteristics of embedded systems, such as unified memory. Furthermore, an on-device optimizer for optimizing the performance in on-device environments, and model con-verters for compatibility with conventional frameworks, are provided. The results of a performance evaluation show that the proposed on-device framework outperformed conventional methods.

KSP Keywords: Conventional methods, Deep learning framework, Limited resources, Parallel Processing, Performance evaluation, Structural characteristic, Unified memory, deep learning(DL), device framework, embedded system, processing performance

This work is distributed under the term of Creative Commons License (CCL)
(CC BY)

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.