ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article Exploring GPU sharing techniques for edge AI smart city applications
Cited 1 time in scopus Download 180 time Share share facebook twitter linkedin kakaostory
Authors
Sooyeon Woo, Jihwan Yeo, Jinhong Kim, Kyungwoon Lee
Issue Date
2025-10
Citation
ETRI Journal, v.47, no.5, pp.855-864
ISSN
1225-6463
Publisher
한국전자통신연구원
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.4218/etrij.2025-0065
Abstract
The growing adoption of edge AI in smart city applications such as traffic management, surveillance, and environmental monitoring necessitates efficient computational strategies to satisfy the requirements for low latency and high accuracy. This study investigated GPU sharing techniques to improve resource utilization and throughput when running multiple AI applications simultaneously on edge devices. Using the NVIDIA Jetson AGX Orin platform and object detection workloads with the YOLOv8 model, we explored the performance tradeoffs of the threading and multiprocessing approaches. Our findings reveal distinct advantages and limitations. Threading minimizes memory usage by sharing CUDA contexts, whereas multiprocessing achieves higher GPU utilization and shorter inference times by leveraging independent CUDA contexts. However, scalability challenges arise from resource contention and synchronization overheads. This study provides insights into optimizing GPU sharing for edge AI applications, highlighting key tradeoffs and opportunities for enhancing performance in resource-constrained environments.
KSP Keywords
AI Applications, Computational strategies, Edge devices, Environmental Monitoring, GPU utilization, High accuracy, Low latency, Performance tradeoffs, Resource utilization, Resource-Constrained Environments, Traffic management
This work is distributed under the term of Korea Open Government License (KOGL)
(Type 4: : Type 1 + Commercial Use Prohibition+Change Prohibition)
Type 4: