"JVET has developed a new Enhanced Compression Model (ECM) for testing future video coding algorithms on top of the Versatile Video Coding (VVC) standard. Reference Picture Resampling (RPR) is a powerful tool that improves video coding efficiency of next generation like Versatile Video Coding (VVC). This feature is well designed...
In recent video coding standards, the introduction of multiple transform selection (MTS) has significantly improved coding efficiency. The latest standard, Versatile Video Coding (VVC), adopted implicit MTS as an alternative tool to explicit MTS. This tool reduces the encoder complexity by inferring the optimal transform type from decoder-side information instead...
RESEARCH PAPER / Apr 2024
/
["Compression",
"Volumetric Imaging",
"Machine learning/ Deep learning /Artificial Intelligence"]
"Learning-based point cloud (PC) compression is a promising research avenue to reduce the transmission and storage costs for PC applications. Existing learning-based methods to compress PCs attributes employ variational autoencoders (VAE) or normalizing flows (NF) to learn compact signal representations. However, VAEs leverage a lower-dimensional bottleneck that limit the maximum...
RESEARCH PAPER / Apr 2024
/
["Compression",
"Video coding",
"Machine learning/ Deep learning /Artificial Intelligence"]
"The last standard Versatile Video Codec (VVC), aims to im- prove the compression efficiency by saving around 50% of bitrate at the same quality compared to its predecessor High Efficiency Video Codec (HEVC). However, this comes with a significant rise in computational complexity due to the new added tools in...
RESEARCH PAPER / Mar 2024
/
["Compression",
"Machine learning/ Deep learning /Artificial Intelligence"]
Achieving successful variable bitrate compression with computationally simple algorithms from a single end-to-end learned image or video compression model remains a challenge. Many approaches have been proposed, including conditional auto-encoders, channel-adaptive gains for the latent tensor or uniformly quantizing all elements of the latent tensor. This paper follows the traditional...
RESEARCH PAPER / Sep 2021
/
Video coding,
Compression,
Machine learning/ Deep learning /Artificial Intelligence Neural network
Despite many modern applications of Deep Neural Networks (DNNs), the large number of parameters in the hidden layers makes them unattractive for deployment on devices with storage capacity constraints. In this paper we propose a Data-Driven Low-rank (DDLR) method to reduce the number of parameters of pretrained DNNs and expedite...
RESEARCH PAPER / Aug 2021
/
Video coding,
Compression,
Machine learning/ Deep learning /Artificial Intelligence
Deep bi-prediction blending. This paper presents a learning-based method to improve bi-prediction in video coding. In conventional video coding solutions, block-based motion compensation blocks from already decoded reference pictures stand out as the main tool used to predict the current frame. Especially, bi-predicted blocks, i.e. blocks that combine two different...
Representation of 3D scenes is gaining popularity in industry, notably for Virtual Reality, Augmented Reality, and 360° Video. The point cloud format is well suited for such representations. Indeed, point clouds can be created with a simple capture process and modest processing, enabling a real-time, end-to-end point cloud distribution chain....
This paper describes a light field scalable compression scheme based on the sparsity of the angular Fourier transform of the light field. A subset of sub-aperture images (or views) is compressed using HEVC as a base layer and transmitted to the decoder. An entire light field is reconstructed from this...