ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Optimizing Memory Scheduling for Efficient Image Patch Inference on Resource-Constrained Devices
Cited 0 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Seungtae Hong, Gunju Park, Jeong-Si Kim
Issue Date
2024-10
Citation
International Conference on Information and Communication Technology Convergence (ICTC) 2024, pp.677-679
Publisher
IEEE
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICTC62082.2024.10827245
Abstract
In this paper, we propose an optimized memory scheduling technique for efficient image patch inference on resource-constrained devices. Our approach addresses the inefficiencies in the existing MCUNet v2's memory scheduling algorithm, which maintains buffer tensors longer than necessary, leading to suboptimal memory usage. We designed an algorithm to dynamically adjust memory blocks and retain tensors only for the necessary duration during patch inference. This method significantly reduces the memory footprint, particularly for the maximum input/output tensor. The results demonstrate substantial improvements in memory utilization, enabling more complex models to be deployed on resource-constrained devices. Our findings highlight the potential of this scheduling technique to enhance the feasibility and performance of deep learning applications on MCUs.
KSP Keywords
Complex models, Memory scheduling, Memory utilization, Resource Constrained Devices(RCD), Scheduling algorithm, Scheduling technique, deep learning(DL), image patch, learning application, memory footprint, memory usage