ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술대회 VPC: Scalable, Low Downtime Checkpointing for Virtual Clusters
Cited 6 time in scopus Download 0 time Share share facebook twitter linkedin kakaostory
저자
Peng Lu, Ravindran, 김창수
발행일
201210
출처
International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) 2012, pp.203-210
DOI
https://dx.doi.org/10.1109/SBAC-PAD.2012.31
협약과제
12VS1400, 차세대 메모리 기반의 빅데이터 분석 관리 소프트웨어 원천기술 개발, 허성진
초록
A virtual cluster (VC) consists of multiple virtual machines (VMs) running on different physical hosts, inter-connected by a virtual network. A fault-tolerant protocol and mechanism are essential to the VC's availability and usability. We present Virtual Predict Check pointing (or VPC), a lightweight, globally consistent check pointing mechanism, which checkpoints the VC for immediate restoration after VM failures. By predicting the checkpoint-caused page faults during each check pointing interval, VPC further reduces the solo VM downtime than traditional incremental check pointing approaches. Besides, VPC uses a globally consistent check-pointing algorithm, which preserves the global consistency of the VMs' execution and communication states, and only saves the updated memory pages during each check pointing interval to reduce the entire VC downtime. Our implementation reveals that, compared with past VC check pointing/migration solutions including VNsnap, VPC reduces the solo VM downtime by as much as 45%, under the NPB benchmark, and reduces the entire VC downtime by as much as 50%, under the NPB distributed program. Additionally, VPC incurs a memory overhead of no more than 9%. In all cases, VPC's performance overhead is less than 16%. © 2012 IEEE.
KSP 제안 키워드
As 4, Check-pointing, Fault-tolerant, Global consistency, Memory overhead, Performance Overhead, VM failures, Virtual Network, Virtual cluster, Virtual machine(VM)