ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article Neural Volumetric Video Coding with Hierarchical Coded Representation of Dynamic Volume
Cited 0 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Ju-Yeon Shin, Jung-Kyung Lee, Gun Bang, Jun-Sik Kim, Je-Won Kang
Issue Date
2025-07
Citation
IEEE Transactions on Multimedia, v.27, pp.4412-4426
ISSN
1520-9210
Publisher
IEEE
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.1109/TMM.2025.3544415
Abstract
This paper proposes a novel multi-view (MV) video coding technique that leverages a four-dimensional (4D) voxel-grid representation to enhance coding efficiency, particularly in novel view synthesis. Although the voxel grid approximation provides a continuous representation for dynamic scenes, its volumetric nature requires substantial storage. The compression of MV videos can be interpreted as the compression of dense features. However, the substantial size of these features poses a significant problem relative to the generation of dynamic scenes at arbitrary viewpoints. To address this challenge, this study introduces a hierarchical coded representation of dynamic volumes based on low-rank tensor decomposition of volumetric features and develops effective coding techniques based on this representation. The proposed method employs a two-level coding strategy to capture the temporal characteristics of the decomposed features. At a higher level, spatial features are encoded, representing 3D structural information, with time-invariant components over short intervals of an MV video sequence. At a lower level, temporal features are encoded to capture the dynamics of current scenes. The spatial features are shared in a group, and temporal features are encoded at each time step. The experimental results demonstrate that the proposed technique outperforms existing MV video coding standards and current state-of-the-art methods, providing superior rate-distortion performance in the novel view synthesis of MV video compression.
KSP Keywords
Arbitrary Viewpoints, Coding efficiency, Coding strategy, Coding techniques, Current state, Dynamic scene, Dynamic volume, Four-Dimensional(4D), Level coding, Low-rank tensor, Multi-view