A Novel Approach to Improve Rate-Distortion-Complexity in Versatile Video Coding Standard
Abstract
Versatile Video Coding (VVC) achieves up to 30% bitrate reduction at the same quality level compared to its predecessor, High Efficiency Video Coding (HEVC). It could support resolutions from 4K to 16K as well as 360° videos. Some new coding tools, such as AFFINE, Integer Motion Vector (IMV), Decoder-side Motion Vector Refinement (DMVR), and Triangle are proposed for VVC to improve the encoder efficiency. But, these new coding tools usually impose high computational complexity on the encoder side. In this paper, we provide a new approach to reduce the computational complexity of the Rate-Distortion Optimization (RDO) process in the encoder side of VVC. In the proposed approach, first, the effectiveness of each coding tool at various parts of the scene is estimated. The results of the experiments show that some of the coding tools--,i.e., AFFINE and IMV, have much better performance in borderline CTUs. So, the proposed approach suggests considering these coding tools in the RDO process, just for the borderline CTUs. This way the computational complexity is decreased considerably without affecting the coding performance. Simulation results show that by disabling the AFFINE and IMV coding tools in the rate-distortion optimization process of non-borderline CTUs, the encoding gain is reduced by only 0.88% and 0.72% BD-rate, but the processing time is reduced by 11.70% and 63.91%, respectively. As the second approach, the correlation between the various coding tools is investigated. Our simulation results show that the AFFINE and Triangle coding tools are highly correlated to each other. So, in the rate-distortion process, if the encoder decided to disable the AFFINE coding tool, the Triangle coding tool is also can be considered disabled without examining the rate-distortion process for this coding tool. This way, the computational complexity is reduced, by 4.96%, on average, without affecting the encoding gain considerably.
Keywords
Versatile Video Coding standard (VVC), AFFINE coding tool, DMVR coding tool, GBI coding tool, BIO coding tool, Triangle coding tool, IMV coding tool
References
- [1] M. Manohara, R. Mudumbai, J. Gibson and U. Madhow, "Error correction scheme for uncompressed HD video over wireless," 2009 IEEE International Conference on Multimedia and Expo, pp. 802-805, 2009.
- [2] M-Z. Wang, S. Wan, H. Gong, and M-Y. Ma, “Attention-Based Dual-Scale CNN In-Loop Filter for Versatile Video Coding,” IEEE Access, vol. 7, pp. 145214 – 145226, 2019.
- [3] A. Rezaeieh and H. Roodaki, "A Method for Rate-Distortion-Complexity Optimization in Versatile Video Coding Standard," 2021 26th International Computer Conference, Computer Society of Iran (CSICC), pp. 1-5, 2021.
- [4] A. Tissier, A. Mercat, T. Amestoy, W. Hamidouche, J. Vanne, and D. Menard, “Complexity Reduction Opportunities in the Future VVC Intra Encoder,” 2019 IEEE 21st International Workshop on Multimedia Signal Processing (MMSP), pp. 1-6, 2019.
- [5] L. Li, H. Li, D. Liu, H. Yang, S. Lin, H. Chen, and F. Wu, “An Efficient Four-Parameter Affine Motion Model for Video Coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 28, pp. 1934 – 1948, 2017.
- [6] X. Ji, D. Zhao, and W. Gao, “Concealment of Whole-Picture Loss in Hierarchical B-Picture Scalable Video Coding,” IEEE Transactions on Multimedia, vol. 11, pp. 11-22, 2009.
- [7] X. Chen, J. An and J. Zheng, “Decoder-Side Motion Vector Refinement Based on Bilateral Template Matching,” Joint Video Exploration Team of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11, 4th Meeting: Chengdu, CN, 15–21, 2016.
- [8] C-C. Chen, X. Xiu, Y. He, and Y. Ye, “Generalized Bi-prediction Method for Future Video Coding,” Picture Coding Symposium (PCS), pp. 1-5, 2016.
- [9] S. Aramvith and M-T. Sun, The Essential Guide to Video Processing (Second Edition), Academic Press, 2009.
- [10] D. PARK, J. LEE, J-W. KANG, and J-G. KIM, “Simplified Triangular Partitioning Mode in Versatile Video Coding,” IEICE Transactions on Information and Systems, vol. E103–D, pp. 472-475, 2020.
- [11] W. Chien and J. Boyce, “JVET AHG report: Tool reporting procedure (AHG13),” 13th JVET Meeting, Doc. JVET-M0013, 2019.
- [12] I. E. Richardson, The H.264 Advanced Video Compression Standard 2nd Edition, John Wiley and Sons, 2003.
- [13] X. Li, P. Amon, A. Hutter, and A. Kaup, “Lagrange Multiplier Selection for Rate-Distortion Optimization in SVC,” Picture Coding Symposium, pp. 1-4, 2009.
- [14] A. Tissier, A. Mercat, T. Amestoy, W. Hamidouche, J. Vanne, and D. Menard, "Complexity Reduction Opportunities in the Future VVC Intra Encoder," IEEE 21st International Workshop on Multimedia Signal Processing (MMSP), pp. 1-6, 2019.
- [15] M. Aklouf, M. Leny, F. Dufaux, and M. Kieffer, "Low Complexity Versatile Video Coding (VVC) for Low Bitrate Applications," 8th European Workshop on Visual Information Processing (EUVIP), pp. 22-27, 2019.
- [16] H. Gao, S. Esenlik, Z. Zhao, E. Steinbach, and J. Chen, "Low Complexity Decoder Side Motion Vector Refinement for VVC," Picture Coding Symposium (PCS), pp. 1-5, 2019.
- [17] S-H. Park and J-W. Kang, “Fast Affine Motion Estimation for Versatile Video Coding (VVC) Encoding,” IEEE Access, vol. 7, pp. 1-1, 2019.
- [18] Z. Zhang, X. Zhao, X. Li, Z. Li and S. Liu, “Fast Adaptive Multiple Transform for Versatile Video Coding,” Data Compression Conference (DCC), Snowbird, pp. 63-72, 2019.
- [19] Q. H. Van, L. D. T. Hue, V. D. Du, V. N. Hong, and X. HoangVan, "Complexity Controlled Side Information Creation for Distributed Scalable Video Coding,” 3rd International Conference on Recent Advances in Signal Processing, Telecommunications & Computing (SigTelCom), pp. 104-108, 2019.
- [20] K-H.Tai, M-J. Chen, J-R. Lin, R-Y. Huang, C.-H. Yeh, C-Y. Chen, S. D. Lin, R-M. Weng, C.-Y. Chang, "Acceleration for HEVC Encoder by Bimodal Segmentation of Rate-Distortion Cost and Accurate Determination of Early Termination and Early Split," IEEE Access, vol. 7, pp. 45259-45273, 2019.
- [21] J. Boyce, K. Suehring, X. Li and V. Seregin, “JVET common test conditions and software reference configurations”, Joint Video Experts Team (JVET) of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11 10th Meeting: San Diego, US, 10–20 Apr. 2018.
- [22] https://jvet.hhi.fraunhofer.de/trac/vvc/browser/vtm, last access on November 2021.
- [23] G. Bjøntegaard, “Calculation of average PSNR differences between RD curves,” ITU T SG16/Q6, Doc. VCEG-M33, April 2001.
- [24] W. Ren, W. He, and Y. Cui, “An Improved Fast Affine Motion Estimation Based on Edge Detection Algorithm for VVC,” Symmetry 2020, vol. 12, PP. 1143, 2020.