Current video coding standards like HEVC, VP9, VVC, AV1, etc., involve partitioning a picture into coding tree units (CTU), typically corresponding to 64x64 or 128x128 picture areas. Each CTU is partitioned into coding blocks following a recursive coding tree. In recently published perceptual video encoding methods, the CTU is used as the spatial unit to assign a QP value in a given picture area. Such an approach fits well with the usual rate distortion optimization used to decide the coding tree representation of a CTU since a constant QP is used inside the CTU. Thus Lagrangian rate distortion optimization works in such a situation. However, for some applications, finer spatial granularity may be desired with an adaptive QP. A perceptual video coding scheme may use a codec agnostic QP allocation process that proceeds on a 16x16 block basis. The issue raised in such a case is that the rate distortion trade-off among split modes no more works with the Lagrangian method. This paper proposes several methods to perform the rate distortion optimization of a coding tree in the situation where multiple QPs may be assigned inside the same CTU. First a theoretical method to solve the problem is described. It consists in a coding tree RD optimization using multiple Lagrange parameters. Then some simpler empirical methods which emulate the theoretical approach are proposed. Experimental results show the benefit of the proposed methods on top of VP9 and HEVC video encoders.
Multiple Constraints Rate Distortion Optimization for a Video Encoder Control
Multiple Constraints Rate Distortion Optimization for a Video Encoder Control
Multiple Constraints Rate Distortion Optimization for a Video Encoder Control
Related Content
Representation of 3D scenes is gaining popularity in industry, notably for Virtual Reality, Augmented Reality, and 360° Video. The point cloud format is well suited for such representations. Indeed, point clouds can be created with a simple capture process and modest processing, enabling a real-time, end-to-end point cloud distribution chain. However, point clou…
In recent video coding standards, the introduction of multiple transform selection (MTS) has significantly improved coding efficiency. The latest standard, Versatile Video Coding (VVC), adopted implicit MTS as an alternative tool to explicit MTS. This tool reduces the encoder complexity by inferring the optimal transform type from decoder-side information instea…
Representation of 3D scenes is gaining popularity in industry, notably for Virtual Reality, Augmented Reality, and 360° Video. The point cloud format is well suited for such representations. Indeed, point clouds can be created with a simple capture process and modest processing, enabling a real-time, end-to-end point cloud distribution chain. However, point clou…
Webinar /Jun 2024
Blog Post /Jul 2025
Blog Post /Jun 2025
Blog Post /Jun 2025