1. INTRODUCTION
The popular lossy image and video compression standards (e.g., JPEG [1] and HEVC [2]) adopt the block-based compression architecture to remove the redundancy. Despite the great success achieved over the past decade, they employed signal level visual representations without fully considering visual perception. Following the insight of Marr [3], geometric structures (e.g., edges and ridges) and stochastic textures are two prominent components that compose the visual scene. As such, it is highly expected that the compression algorithms follow such representations of an image, which can greatly promote the coding efficiency and representation capability.