ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article A Dynamic and Parallel Approach for Repetitive Prime Labeling of XML with MapReduce
Cited 4 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Jinhyun Ahn, Dong-Hyuk Im, Taewhi Lee, Hong-Gee Kim
Issue Date
2017-02
Citation
Journal of Supercomputing, v.73, no.2, pp.810-836
ISSN
0920-8542
Publisher
Springer
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.1007/s11227-016-1803-y
Abstract
A massive amount of extensible markup language (XML) data from various areas is available on the Web. Answering structural queries against XML data is important, as it is the core of information retrieval systems for XML data. Labeling scheme has been suggested for rapid query processing of massive XML data. Interval-based, prefix-based, and prime number labeling scheme exist. Of these, the prime number labeling scheme has the advantage of query processing by arithmetic operations. Recently, the repetitive prime number labeling scheme was proposed; this scheme produces a smaller label size than conventional prime number labeling using prime numbers repetitively. However, a parallel algorithm for the repetitive prime number labeling scheme does not exist; therefore, this scheme is difficult to apply to massive XML data. In this paper, a dynamic and parallel approach of XML labeling algorithm that works with MapReduce is proposed for, particularly, the repetitive prime number labeling scheme. Two optimization techniques are devised: the label assignment order adjustment to further reduce the label size and the upper tree compressing technique to reduce the memory requirements during the labeling process. Experiments over real-world XML data confirmed that the techniques are effective than the previous works.
KSP Keywords
Arithmetic operations, Label assignment, Labeling algorithm, Labeling scheme, Optimization techniques, Parallel Algorithm, Parallel approach, Prime number labeling, Query processing, Real-world, XML data