I. Introduction
The share of video on the Internet traffic is expected to reach 82% by 2022 [1], which raises the need for highly efficient video codecs. The compression efficiency of HEVC has been improved greatly with the adoption of numerous advanced tools [2]. HEVC introduces a more flexible quad-tree coding block partitioning structure and uses the Coding Units (CUs) and Prediction Units (PUs) concept to manage the partition [3]. In HEVC, each video frame is divided into non-overlapping predefined size Coding Tree Units (CTUs), and each CTU is then recursively divided into CUs. CUs can be of various sizes, namely, 64×64, 32×32, 16×16 which and 8×8 pixels, correspond to CU depth levels 0, 1, 2 and 3, respectively. A CU can be further partitioned into PUs and Transform Units (TUs). The partitioning is recursive within a quad-tree hierarchy. PU is obtained by partitioning a CU, and it is used for the prediction. To determine the best coding mode for each PU, all candidate modes are examined by calculating the RD cost. Finally, based on the RD costs, optimal CTU partitioning is obtained. A TU is a square block, which shares the same transform and quantization processes. A TU tree structure has its root at the CU level. The TU shape is always square, and it may take a size from 32×32 to 4×4 samples. Fig. 1 exemplary shows an optimal CTU partitioning into CUs with its corresponding quad-tree structure.