ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article Attention-Based Bi-Prediction Network for Versatile Video Coding (VVC) over 5G Network
Cited 4 time in scopus Download 80 time Share share facebook twitter linkedin kakaostory
Authors
Young-Ju Choi, Young-Woon Lee, Jongho Kim, Se Yoon Jeong, Jin Soo Choi, Byung-Gyu Kim
Issue Date
2023-03
Citation
Sensors, v.23, no.5, pp.1-19
ISSN
1424-8220
Publisher
Multidisciplinary Digital Publishing Institute (MDPI)
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.3390/s23052631
Abstract
As the demands of various network-dependent services such as Internet of things (IoT) applications, autonomous driving, and augmented and virtual reality (AR/VR) increase, the fifthgeneration (5G) network is expected to become a key communication technology. The latest video coding standard, versatile video coding (VVC), can contribute to providing high-quality services by achieving superior compression performance. In video coding, inter bi-prediction serves to improve the coding efficiency significantly by producing a precise fused prediction block. Although block-wise methods, such as bi-prediction with CU-level weight (BCW), are applied in VVC, it is still difficult for the linear fusion-based strategy to represent diverse pixel variations inside a block. In addition, a pixel-wise method called bi-directional optical flow (BDOF) has been proposed to refine bi-prediction block. However, the non-linear optical flow equation in BDOF mode is applied under assumptions, so this method is still unable to accurately compensate various kinds of bi-prediction blocks. In this paper, we propose an attention-based bi-prediction network (ABPN) to substitute for the whole existing bi-prediction methods. The proposed ABPN is designed to learn efficient representations of the fused features by utilizing an attention mechanism. Furthermore, the knowledge distillation (KD)- based approach is employed to compress the size of the proposed network while keeping comparable output as the large model. The proposed ABPN is integrated into the VTM-11.0 NNVC-1.0 standard reference software. When compared with VTM anchor, it is verified that the BD-rate reduction of the lightweighted ABPN can be up to 5.89% and 4.91% on Y component under random access (RA) and low delay B (LDB), respectively.
KSP Keywords
5G networks, Attention mechanism, Augmented and Virtual Reality, Based Approach, Bi-prediction, Coding efficiency, Compression performance, Flow equation, Fused features, High-quality, Knowledge Distillation
This work is distributed under the term of Creative Commons License (CCL)
(CC BY)
CC BY