In recent video coding standards, the introduction of multiple transform selection (MTS) has significantly improved coding efficiency. The latest standard, Versatile Video Coding (VVC), adopted implicit MTS as an alternative tool to explicit MTS. This tool reduces the encoder complexity by inferring the optimal transform type from decoder-side information instead of performing an intensive rate-distortion search. However, implicit MTS became less efficient in the enhanced compression model (ECM) due to the introduction of new primary transforms to explicit MTS, which drastically increased the number of possible horizontal and vertical transform pairs compared to VVC. This paper presents a new implicit MTS method that leverages a lookup table (LUT) to infer the best transform pair based on the block size and intra-prediction mode. This LUT is used in a hybrid MTS scheme where explicit MTS is enabled only for MIP blocks. The proposed LUT-based hybrid method reduces the ECM complexity by 15% while retaining 60% and 83% of the MTS gain compared to the MTS method of ECM and VVC, respectively.
Low-Complexity Transform Design Using Hybrid Intra MTS
Low-Complexity Transform Design Using Hybrid Intra MTS
Low-Complexity Transform Design Using Hybrid Intra MTS
Related Content
Research Paper /Apr 2024 / Compression, Volumetric Imaging, Machine learning/ Deep learning /Artificial Intelligence
"Learning-based point cloud (PC) compression is a promising research avenue to reduce the transmission and storage costs for PC applications. Existing learning-based methods to compress PCs attributes employ variational autoencoders (VAE) or normalizing flows (NF) to learn compact signal representations. However, VAEs leverage a lower-dimensional bottleneck that…
Achieving successful variable bitrate compression with computationally simple algorithms from a single end-to-end learned image or video compression model remains a challenge. Many approaches have been proposed, including conditional auto-encoders, channel-adaptive gains for the latent tensor or uniformly quantizing all elements of the latent tensor. This paper …
Representation of 3D scenes is gaining popularity in industry, notably for Virtual Reality, Augmented Reality, and 360° Video. The point cloud format is well suited for such representations. Indeed, point clouds can be created with a simple capture process and modest processing, enabling a real-time, end-to-end point cloud distribution chain. However, point clou…
Webinar /Jun 2024
Blog Post /Jul 2025
Blog Post /Jun 2025
Blog Post /Jun 2025