ETRI Knowledge Sharing Platform : Accelerating Training of DNN in Distributed Machine Learning System with Shared Memory

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper Accelerating Training of DNN in Distributed Machine Learning System with Shared Memory

Cited 7 time in scopus

Citation: International Conference on Information and Communication Technology Convergence (ICTC) 2017, pp.1210-1213

Abstract: In distributed DNN training, the speed of reading and updating model parameters greatly affects model training time. In this paper we investigate the performance of deep neural network training with parameter sharing based on shared memory for distributed machine learning. We propose a shared memory-based modification of the deep learning framework. In our framework, remote shared memory is used to maintain global shared parameters of parallel deep learning workers. Our framework can accelerate training of DNN by speeding up the parameter sharing in every training iteration in distributed model training. We evaluated our proposed framework by training the three different deep learning model. The experiment results show that our framework improves training time for deep learning models in distributed system.

KSP Keywords: Deep learning framework, Deep neural network(DNN), Distributed System(DS), Distributed machine learning, Experiment results, Machine learning system, Memory-based, Model parameter, Shared Memory, Shared parameters, Training time

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.