ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술지 A Dynamic and Parallel Approach for Repetitive Prime Labeling of XML with MapReduce
Cited 4 time in scopus Download 8 time Share share facebook twitter linkedin kakaostory
저자
안진현, 임동혁, 이태휘, 김홍기
발행일
201702
출처
Journal of Supercomputing, v.73 no.2, pp.810-836
ISSN
0920-8542
출판사
Springer
DOI
https://dx.doi.org/10.1007/s11227-016-1803-y
협약과제
16ZS1400, 듀얼모드 배치.쿼리 분석을 제공하는 빅데이터 플랫폼 핵심기술 개발, 원종호
초록
A massive amount of extensible markup language (XML) data from various areas is available on the Web. Answering structural queries against XML data is important, as it is the core of information retrieval systems for XML data. Labeling scheme has been suggested for rapid query processing of massive XML data. Interval-based, prefix-based, and prime number labeling scheme exist. Of these, the prime number labeling scheme has the advantage of query processing by arithmetic operations. Recently, the repetitive prime number labeling scheme was proposed; this scheme produces a smaller label size than conventional prime number labeling using prime numbers repetitively. However, a parallel algorithm for the repetitive prime number labeling scheme does not exist; therefore, this scheme is difficult to apply to massive XML data. In this paper, a dynamic and parallel approach of XML labeling algorithm that works with MapReduce is proposed for, particularly, the repetitive prime number labeling scheme. Two optimization techniques are devised: the label assignment order adjustment to further reduce the label size and the upper tree compressing technique to reduce the memory requirements during the labeling process. Experiments over real-world XML data confirmed that the techniques are effective than the previous works.
KSP 제안 키워드
Arithmetic operations, Extensible markup language(XML), Information Retrieval System, Information retrieval(IR), Label assignment, Labeling algorithm, Labeling scheme, Optimization techniques(OT), Parallel Algorithm, Parallel approach, Prime number labeling