Abstract:
Entropy coding is a fundamental technology in video coding that removes statistical redundancy among syntax elements. In high efficiency video coding (HEVC), context-adap...Show MoreMetadata
Abstract:
Entropy coding is a fundamental technology in video coding that removes statistical redundancy among syntax elements. In high efficiency video coding (HEVC), context-adaptive binary arithmetic coding (CABAC) is adopted as the primary entropy coding method. The CABAC consists of three steps: binarization, context modeling, and binary arithmetic coding. As the binarization processes and context models are both manually designed in CABAC, the probability of the syntax elements may not be estimated accurately, which restricts the coding efficiency of CABAC. To address the problem, we propose a convolutional neural network-based arithmetic coding (CNNAC) method and apply it to compress the syntax elements of the intra-predicted residues in HEVC. Instead of manually designing the binarization processes and context models, we propose directly estimating the probability distribution of the syntax elements with a convolutional neural network (CNN), as CNNs can adaptively build complex relationships between inputs and outputs by training with a lot of data. Then, the values of the syntax elements, together with their estimated probability distributions, are fed into a multi-level arithmetic codec to perform entropy coding. In this paper, we have utilized the CNNAC to code the syntax elements of the DC coefficient; the lowest frequency AC coefficient; the second, third, fourth, and fifth lowest frequency AC coefficients; and the position of the last non-zero coefficient in the HEVC intra-predicted residues. The experimental results show that our proposed method achieves up to 6.7% BD-rate reduction and an average of 4.7% BD-rate reduction compared to the HEVC anchor under all intra (AI) configuration.
Published in: IEEE Transactions on Circuits and Systems for Video Technology ( Volume: 30, Issue: 7, July 2020)
Funding Agency:
Citations are not available for this document.
Cites in Papers - |
Cites in Papers - IEEE (21)
Select All
1.
Bolin Chen, Zhao Wang, Binzhe Li, Shurun Wang, Shiqi Wang, Yan Ye, "Interactive Face Video Coding: A Generative Compression Framework", IEEE Transactions on Image Processing, vol.34, pp.2910-2925, 2025.
2.
Junbin Zhuang, Yan Zheng, Baolong Guo, Yunyi Yan, "Globally Deformable Information Selection Transformer for Underwater Image Enhancement", IEEE Transactions on Circuits and Systems for Video Technology, vol.35, no.1, pp.19-32, 2025.
3.
Ayad M. Dalloo, Amjad Jaleel Humaidi, Ammar K. Al Mhdawi, Hamed Al-Raweshidy, "Approximate Computing: Concepts, Architectures, Challenges, Applications, and Future Directions", IEEE Access, vol.12, pp.146022-146088, 2024.
4.
Michael Schäfer, Jonathan Pfaff, Heiko Schwarz, Detlev Marpe, Thomas Wiegand, "Nonlinear Transform Coding for VVC Intra Coding", 2024 Picture Coding Symposium (PCS), pp.1-5, 2024.
5.
Yixuan Li, Bolin Chen, Baoliang Chen, Meng Wang, Shiqi Wang, Weisi Lin, "Perceptual Quality Assessment of Face Video Compression: A Benchmark and An Effective Method", IEEE Transactions on Multimedia, vol.26, pp.8596-8608, 2024.
6.
Helen. K. Joy, Manjunath. R Kounte, "Deep CNN Based Interpolation Filter for High Efficiency Video Coding", 2024 2nd International Conference on Intelligent Data Communication Technologies and Internet of Things (IDCIoT), pp.519-524, 2024.
7.
Xiandong Meng, Shuyuan Zhu, Siwei Ma, Bing Zeng, "Learned Image Compression with Large Capacity and Low Redundancy of Latent Representation", 2023 IEEE International Conference on Image Processing (ICIP), pp.1640-1644, 2023.
8.
Hengyu Man, Xiaopeng Fan, Ruiqin Xiong, Debin Zhao, "Tree-Structured Data Clustering-Driven Neural Network for Intra Prediction in Video Coding", IEEE Transactions on Image Processing, vol.32, pp.3493-3506, 2023.
9.
Bolin Chen, Zhao Wang, Binzhe Li, Shiqi Wang, Yan Ye, "Compact Temporal Trajectory Representation for Talking Face Video Compression", IEEE Transactions on Circuits and Systems for Video Technology, vol.33, no.11, pp.7009-7023, 2023.
10.
Yunhao Mao, Meng Wang, Zhangkai Ni, Shiqi Wang, Sam Kwong, "Neural Network Based Rate Control for Versatile Video Coding", IEEE Transactions on Circuits and Systems for Video Technology, vol.33, no.10, pp.6072-6085, 2023.
11.
Jing Zhang, Yonghong Hou, Bo Peng, Zhaoqing Pan, Ge Li, "Global-Context Aggregated Intra Prediction Network for Depth Video Coding", IEEE Transactions on Circuits and Systems II: Express Briefs, vol.70, no.8, pp.3159-3163, 2023.
12.
Chao Liu, Heming Sun, Jiro Katto, Xiaoyang Zeng, Yibo Fan, "QA-Filter: A QP-Adaptive Convolutional Neural Network Filter for Video Coding", IEEE Transactions on Image Processing, vol.31, pp.3032-3045, 2022.
13.
Ge Li, Jianjun Lei, Zhaoqing Pan, Bo Peng, Nam Ling, "Multiple Resolution Prediction With Deep Up-Sampling for Depth Video Coding", IEEE Transactions on Circuits and Systems for Video Technology, vol.32, no.9, pp.6337-6346, 2022.
14.
Hajar Maseeh Yasin, Siddeeq Yosef Ameen, "Review and Evaluation of End-to-End Video Compression with Deep-Learning", 2021 International Conference of Modern Trends in Information and Communication Technology Industry (MTICTI), pp.1-8, 2021.
15.
Wen Gao, Siwei Ma, Lingyu Duan, Yonghong Tian, Peiyin Xing, Yaowei Wang, Shanshe Wang, Huizhu Jia, Tiejun Huang, "Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding", IEEE Transactions on Circuits and Systems for Video Technology, vol.31, no.11, pp.4147-4161, 2021.
16.
Han Zhang, Li Song, Yan Huang, Rong Xie, "Current Frame Priors Assisted Neural Network for Intra Prediction", IEEE Access, vol.9, pp.112359-112371, 2021.
17.
Xiandong Meng, Xuan Deng, Shuyuan Zhu, Xinfeng Zhang, Bing Zeng, "A Robust Quality Enhancement Method Based on Joint Spatial-Temporal Priors for Video Coding", IEEE Transactions on Circuits and Systems for Video Technology, vol.31, no.6, pp.2401-2414, 2021.
18.
Han Zhang, Li Song, Li Li, Zhu Li, Xiaokang Yang, "Compression Priors Assisted Convolutional Neural Network for Fractional Interpolation", IEEE Transactions on Circuits and Systems for Video Technology, vol.31, no.5, pp.1953-1967, 2021.
19.
Changyue Ma, Dong Liu, Li Li, Yao Wang, Feng Wu, "Convolutional Neural Network-Based Coefficients Prediction for HEVC Intra-Predicted Residues", 2020 Data Compression Conference (DCC), pp.183-192, 2020.
20.
Boyang Chen, Kai Liu, Evgeny Belyaev, "An Efficient Hardware Implementation of Multialphabet Adaptive Arithmetic Encoder Based on Generalized Virtual Sliding Window", IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol.28, no.5, pp.1326-1330, 2020.
21.
Dong Liu, Zhenzhong Chen, Shan Liu, Feng Wu, "Deep Learning-Based Technology in Responses to the Joint Call for Proposals on Video Compression With Capability Beyond HEVC", IEEE Transactions on Circuits and Systems for Video Technology, vol.30, no.5, pp.1267-1280, 2020.
Cites in Papers - Other Publishers (2)
1.
Partha Das, Sezer Karaoglu, Theo Gevers, "Intrinsic image decomposition using physics-based cues and CNNs", Computer Vision and Image Understanding, vol.223, pp.103538, 2022.
2.
Hyung-Hwa Ko, "Enhanced Binary MQ Arithmetic Coder with Look-Up Table", Information, vol.12, no.4, pp.143, 2021.