ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article Prefetching-Based Metadata Management in Advanced Multitenant Hadoop
Cited 13 time in scopus Download 25 time Share share facebook twitter linkedin kakaostory
Authors
Minh Chau Nguyen, Heesun Won, Siwoon Son, Myeong-Seon Gil, Yang-Sae Moon
Issue Date
2019-02
Citation
Journal of Supercomputing, v.75, no.2, pp.533-553
ISSN
0920-8542
Publisher
Springer
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.1007/s11227-017-2019-5
Project Code
16MH1700, Smart Networking Core Technology Development, Sunhee Yang
Abstract
Metadata management is an essential part in Apache Hadoop. Performing optimization of metadata accesses enhances big data storing, processing and analyzing, especially in multitenant environments. Nevertheless, as environmental complexity increases, metadata management is becoming more challenging and costly because of the heavy performance issues. In this paper, we propose a novel approach to improve the performance of metadata management for Hadoop in the multitenant environment based on the prefetching mechanism. We create metadata access graphs based on historical access values, define access patterns and then perform prefetching potential items for the near-future requests to minimize the latency. We present a formal algorithm to apply the prefetching mechanism into the Hadoop system and perform the actual implementation on a recent Hadoop system. Experimental results show that the proposed approach can enable the high performance for metadata management as well as maintain advanced multitenancy features.
KSP Keywords
Access graphs, Access pattern, Apache Hadoop, Big Data, Hadoop system, High performance, Novel approach, metadata management, multitenant environment