ETRI-Knowledge Sharing Plaform



논문 검색
구분 SCI
연도 ~ 키워드


학술대회 Development and Evaluation of High-Density Multi-GPU Sub-System Using PCI Express Expansion Hardware
Cited 3 time in scopus Download 1 time Share share facebook twitter linkedin kakaostory
김영우, 오명훈, 박찬열
International Conference on Consumer Electronics (ICCE) 2020 : Asia, pp.244-247
In this paper, we develop and evaluate a high-density multi-GPU hardware sub-system for high performance computing and deep learning. The high-density multi-GPU hardware is implemented as an out-of-box hardware by extending PCI Express system bus via multi-Gbps class cable assemblies from a server. The high-density multi-GPU hardware extends the PCI Express in a multiple-ways and provides multiple GPUs for a server. The high-density multi-GPU hardware is developed to fit into 21 inches OCP Open Rack Vl and V2 rack and chassis having density of 12 GPUs in 4OU (Open rack Unit) height. The HPL and deep learning applications are tested to evaluate the performance of the developed high-density multi-GPU hardware. The initial test result of HPL is shown that the maximum performance of 36.22 TFLOPS (efficiency of 63.4%) with 12 NVIDIA P100 on triple sever nodes. For deep learning application, the resnet50 result of 3, 187.2 images/sec is obtained with 12 NVIDIA P100 on single sever node with synthetic data. The experimental results show that the developed multi-GPU hardware sub-system exhibits relatively good scalability in a single node especially deep learning applications than the HPL.
HPL, Multi-GPU, OCP, Open Rack, PCI Express Expansion, Resnet50
KSP 제안 키워드
Deep learning application, Development and Evaluation, GPU hardware, High Performance Computing, High-density, Multi-GPU, Multiple GPUs, PCI-Express(PCIe), Sub-system, Synthetic data, System bus