ETRI Knowledge Sharing Platform : Evaluating the Performance of Deep Learning Inference Service on Edge Platform

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper Evaluating the Performance of Deep Learning Inference Service on Edge Platform

Cited 0 time in scopus

Authors: Hyun-Hwa Choi, Jae-Geun Cha, Seung-Hyun Yun, Dae Won Kim, Sumin Jang, Sun Wook Kim

Citation: International Conference on Information and Communication Technology Convergence (ICTC) 2021, pp.1789-1793

Abstract: Deep learning inference requires tremendous amount of computation and typically is offloaded the cloud for execution. Recently, edge computing, which processes and stores data at the edge of the Internet closest to the mobile devices or sensors, has been considered as new computing paradigm. We have studied the performance of the deep neural network (DNN) inference service based on different configurations of resources assigned to a container. In this work, we measured and analyzed a real-world edge service on containerization platform. An edge service is named A!Eye, an application with various DNN inferences. The edge service has both CPU-friendly and GPU-friendly tasks. CPU tasks account for more than half of the latency of the edge service. Our analyses reveal interesting findings about running the DNN inference service on the container-based execution platform; (a) The latency of DNN inference-based edge services is affected by CPU-based operation performance. (b) Pinning CPUs can reduce the latency of an edge service. (c) In order to improve the performance of an edge service, it is very important to avoid PCIe bottleneck shared by resources like CPUs, GPUs and NICs.

KSP Keywords: Deep neural network(DNN), Edge Computing, Edge services, Mobile devices, Operation performance, Real-world, deep learning(DL), neural network(NN)

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.