Deep bi-prediction blending. This paper presents a learning-based method to improve bi-prediction in video coding. In conventional video coding solutions, block-based motion compensation blocks from already decoded reference pictures stand out as the main tool used to predict the current frame. Especially, bi-predicted blocks, i.e. blocks that combine two different motion compensated prediction blocks, greatly improve the final temporal prediction accuracy by averaging together the 2 predictions. In recent codecs generation such as VVC, the blending process has been improved, for example by performing weighted blending or refining the predicted block by adding a correction offset derived from the two blocks' optical flow. In this context, we introduce a simple neural network that further improves the blending operation. A complexity balance, both in terms of network size and encoder mode selection, is carried out. Extensive tests on top of the recently standardized VVC codec are performed and show a BD-rate improvement of -1.4% in random access configuration, for a network size of about 10k parameters. We also propose a simple CPU-based implementation and network quantization to assess the complexity/gains tradeoff in a conventional codec framework.
Deep bi-prediction blending
Deep bi-prediction blending
Deep bi-prediction blending
Research Paper / Aug 2021 / Video coding, Compression, Machine learning/ Deep learning /Artificial Intelligence
Related Content
Research Paper /Feb 2024 / Wireless communication, 5G, Machine learning/ Deep learning /Artificial Intelligence
The ubiquitous deployment of 4G/5G technology has made it a critical infrastructure for society that will facilitate the delivery and adoption of emerging applications and use cases (extended reality, automation, robotics, to name but a few). These new applications require high throughput and low latency in both uplink and downlink for optimal performance, while…
Research Paper /Apr 2024 / Compression, Volumetric Imaging, Machine learning/ Deep learning /Artificial Intelligence
"Learning-based point cloud (PC) compression is a promising research avenue to reduce the transmission and storage costs for PC applications. Existing learning-based methods to compress PCs attributes employ variational autoencoders (VAE) or normalizing flows (NF) to learn compact signal representations. However, VAEs leverage a lower-dimensional bottleneck that…
Achieving successful variable bitrate compression with computationally simple algorithms from a single end-to-end learned image or video compression model remains a challenge. Many approaches have been proposed, including conditional auto-encoders, channel-adaptive gains for the latent tensor or uniformly quantizing all elements of the latent tensor. This paper …
Webinar /Jun 2024
Blog Post /Jun 2025
Blog Post /Jun 2025