NN-based decision predictor of Reference Picture Resampling for video coding




NN-based decision predictor of Reference Picture Resampling for video coding

NN-based decision predictor of Reference Picture Resampling for video coding
Research Paper / SPIE. Optics+Photonics / Jul 2024 / ["Video coding", "Streaming/OTT", "Compression"]

"JVET has developed a new Enhanced Compression Model (ECM) for testing future video coding algorithms on top of the Versatile Video Coding (VVC) standard. Reference Picture Resampling (RPR) is a powerful tool that improves video coding efficiency of next generation like Versatile Video Coding (VVC). This feature is well designed to support frame changing resolution without inserting intra refresh picture. Video streaming and low delay scenarios can take advantage of RPR to ensure a smooth frame-based bit-rate adaptation, compared to traditional techniques that can generates bitrate leaps. In this paper, a neural network regressor to predict RPR decision is discussed, and adaptation of downscaling factor is proposed to improve VVC coding efficiency in the context of random access and all intra modes configurations. "