ETRI Knowledge Sharing Platform : Parallel Prime Number Labeling of Large XML Data Using MapReduce

Titles

논문 검색
Type		SCI
Year	~	Keyword

List

Conference Paper Parallel Prime Number Labeling of Large XML Data Using MapReduce

Cited 3 time in scopus

Citation: International Conference on IT Convergence and Security (ICITCS) 2016, pp.1-2

Abstract: Massive XML (Extensible Markup Language) data are available on the web. XML data labeling schemes have been suggested for structural query processing of massive XML data. Notable schemes include interval- based, prefix-based, and prime number-based labeling schemes. Of these, the prime number labeling scheme has the advantage of query processing by simple arithmetic operations. However, a parallel algorithm for this scheme does not exist. The requirement that all parents' labels have to be multiplied to obtain the label of a node makes it difficult to label XML data in a parallel fashion. To address the issue, in this paper, we propose a cluster-based technique wherein all parent nodes for a node are aggregated to compute its label by two-step MapReduce jobs. Our experiments on real-world XML datasets showed the advantages over a single machine-based system.

KSP Keywords: Arithmetic operations, Data Labeling, Labeling scheme, Parallel Algorithm, Prime number labeling, Query processing, Real-world, Two-Step, XML data, based system, cluster-based

218 Gajeong-ro, Yuseong-gu, Daejeon, 34129, KOREA, Contact: sh.kim@etri.re.kr

Please refrain from automatic collection of e-mail addresses posted on this homepage.