ETRI Knowledge Sharing Platform : Attention-Based Bi-Prediction Network for Versatile Video Coding (VVC) over 5G Network

BROWSE

Titles

논문 검색
Type		SCI
Year	~	Keyword

Detail

List

Journal Article Attention-Based Bi-Prediction Network for Versatile Video Coding (VVC) over 5G Network

Cited 5 time in scopus

Download 122 time Share share

Authors: Young-Ju Choi, Young-Woon Lee, Jongho Kim, Se Yoon Jeong, Jin Soo Choi, Byung-Gyu Kim

Issue Date: 2023-03

Citation: Sensors, v.23, no.5, pp.1-19

ISSN: 1424-8220

Publisher: Multidisciplinary Digital Publishing Institute (MDPI)

Language: English

Type: Journal Article

DOI: https://dx.doi.org/10.3390/s23052631

Abstract: As the demands of various network-dependent services such as Internet of things (IoT) applications, autonomous driving, and augmented and virtual reality (AR/VR) increase, the fifthgeneration (5G) network is expected to become a key communication technology. The latest video coding standard, versatile video coding (VVC), can contribute to providing high-quality services by achieving superior compression performance. In video coding, inter bi-prediction serves to improve the coding efficiency significantly by producing a precise fused prediction block. Although block-wise methods, such as bi-prediction with CU-level weight (BCW), are applied in VVC, it is still difficult for the linear fusion-based strategy to represent diverse pixel variations inside a block. In addition, a pixel-wise method called bi-directional optical flow (BDOF) has been proposed to refine bi-prediction block. However, the non-linear optical flow equation in BDOF mode is applied under assumptions, so this method is still unable to accurately compensate various kinds of bi-prediction blocks. In this paper, we propose an attention-based bi-prediction network (ABPN) to substitute for the whole existing bi-prediction methods. The proposed ABPN is designed to learn efficient representations of the fused features by utilizing an attention mechanism. Furthermore, the knowledge distillation (KD)- based approach is employed to compress the size of the proposed network while keeping comparable output as the large model. The proposed ABPN is integrated into the VTM-11.0 NNVC-1.0 standard reference software. When compared with VTM anchor, it is verified that the BD-rate reduction of the lightweighted ABPN can be up to 5.89% and 4.91% on Y component under random access (RA) and low delay B (LDB), respectively.

KSP Keywords: 5G networks, Attention mechanism, Augmented and Virtual Reality, Based Approach, Bi-prediction, Coding efficiency, Compression performance, Flow equation, Fused features, High-quality, Knowledge Distillation

This work is distributed under the term of Creative Commons License (CCL)
(CC BY)

ETRI-Knowledge Sharing Plaform

BROWSE

Titles

Detail

ETRI