ETRI Knowledge Sharing Platform : ACLTuner: A Profiling-Driven Fast Tuning to Optimized Deep Learning Inference

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper ACLTuner: A Profiling-Driven Fast Tuning to Optimized Deep Learning Inference

Cited - time in scopus

Authors: Yongin Kwon, Joo Hyoung Cha, Jubin Lee, Misun Yu, Jeman Park, Jemin Lee

Citation: Conference on Neural Information Processing Systems (NeurIPS) 2023 : Workshop, pp.1-12

Abstract: Deep learning has expanded its footprint across diverse domains. The performance of these computations hinges on the interplay between deep learning compilers and inference libraries. While compilers adapt efficiently to new deep learning operations or models, their tuning processes are too time-consuming. In contrast, inference libraries offer quick execution but with adaptability limitations. To address these challenges, we propose ACLTuner, which optimizes execution configurations using existing inference library kernels. ACLTuner identifies and assigns the optimal kernel through targeted device profiling. Compared to ArmNN, AutoTVM, Ansor, ONNXRuntime, and TFLite, ACLTuner not only achieves up to 2.0x faster execution time across seven deep learning models, but also reduces the average tuning time by 95%.

KSP Keywords: deep learning(DL), deep learning models, execution time, optimal kernel, tuning time

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.