ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article MapReduce-Based D_ELT Framework to Address the Challenges of Geospatial Big Data
Cited 9 time in scopus Download 158 time Share share facebook twitter linkedin kakaostory
Authors
Junghee Jo, Kang-Woo Lee
Issue Date
2019-11
Citation
ISPRS International Journal of Geo-Information, v.8, no.11, pp.1-15
ISSN
2220-9964
Publisher
MDPI
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.3390/ijgi8110475
Abstract
The conventional extracting-transforming-loading (ETL) system is typically operated on a single machine not capable of handling huge volumes of geospatial big data. To deal with the considerable amount of big data in the ETL process, we propose D-ELT (delayed extracting-loading-transforming) by utilizing MapReduce-based parallelization. Among various kinds of big data, we concentrate on geospatial big data generated via sensors using Internet of Things (IoT) technology. In the IoT environment, update latency for sensor big data is typically short and old data are not worth further analysis, so the speed of data preparation is even more significant. We conducted several experiments measuring the overall performance of D-ELT and compared it with both traditional ETL and extracting-loading- transforming (ELT) systems, using different sizes of data and complexity levels for analysis. The experimental results show that D-ELT outperforms the other two approaches, ETL and ELT. In addition, the larger the amount of data or the higher the complexity of the analysis, the greater the parallelization effect of transform in D-ELT, leading to better performance over the traditional ETL and ELT approaches.
KSP Keywords
Different sizes, ETL process, Geospatial big data, Internet of thing(IoT), IoT environment, Overall performance, Sensor Big Data, data preparation, single machine
This work is distributed under the term of Creative Commons License (CCL)
(CC BY)
CC BY