Introduction
The purpose of steganography is to embed secret messages into cover objects without arousing a warder’s attention. Because of the wide usage of JPEG images, increasingly much attention has been paid to JPEG steganography. Recently, with the advent of the STCs (Syndrome-Trellis Codes) coding technique [1], some advanced adaptive JPEG steganographic schemes have been designed, such as UED (Uniform Embedding Distortion) [2] and J-UNIWARD (JPEG UNIversal WAvelet Relative Distortion) [3]. These adaptive schemes are difficult to detect because their embedding changes are constrained to complex regions which are hard to model.
Although the previous JPEG steganalysis features (e.g., PEV [4] and JRM [5]) can effectively detect early embedding schemes (e.g., F5 [6], OutGuess [7], and model-based steganography [8]), they are less effective for detecting modern adaptive JPEG steganography. To better detect those adaptive embedding schemes, some JPEG phase-aware feature sets are proposed, such as DCTR (Discrete Cosine Transform Residual) [9], GFR (Gabor Filter Residual) [10], and PHARM (PHase-Aware pRojection Model) [11]. The success of JPEG phase-aware features can be summarized as follows. First, unlike the previous JPEG features formed from quantized DCT coefficients, JPEG phase-aware features are built in the spatial domain. This is probably because the statistical characteristics captured in the spatial domain are more sensitive to those adaptive JPEG embedding algorithms [12]. Second, JPEG phase-aware features compute histograms from the subsets of an image split by their JPEG phase rather than directly from the whole one. This is effective because the statistical properties of pixels in a decompressed JPEG image depend on their positions within the
The DCTR and GFR features compute histograms from the subsets of noise residuals. The difference between them is that DCTR uses 64 DCT bases to generate noise residuals whereas GFR uses 256 Gabor filters. But unlike DCTR and GFR, PHARM projects a large number of random matrices on residuals before the histogram computation to diversify the features. Meanwhile, however, PHARM is so computationally expensive because of these complicated random projections.
In this paper, we review the design of PHARM and propose three improvements for less computation time and higher detection accuracy.
First, the maximum size of projection matrices is reduced. To generate diverse projections in PHARM, residuals are convolved with many projection matrices of random sizes. Compared to large size projection matrices, moderate size ones not only reduce the computational complexity of convolution but also more effectively capture the embedding changes caused by adaptive JPEG steganography. In this paper, hence, compared to the original PHARM, the maximum projection matrix size decreases from eight to four.
Second, for each projection, multiple phase pairs are selected to compute phase-aware histograms. The original PHARM computes phase-aware histograms in a wasteful manner, choosing only one phase pair to compute a histogram for each projection. That is, one projection corresponds to only one phase-aware histogram. To obtain the original 12600-dimensional PHARM, a large number of (900) projections have to be generated for each residual, which is very time consuming. In this paper, we randomly select three different phase pairs per projection to compute phase-aware histograms and correspondingly reduce the number of projections per residual to keep the same feature dimension, which greatly reduces the computation time while maintaining the detection accuracy.
Third, the transposition symmetry is taken into account. The original PHARM applies three geometrical transformations (horizontal mirroring, vertical mirroring, and rotation by 180 degrees) to projection matrices for symmetrization. However, the transposed transformation is not considered. In this paper, besides the existing projection matrices, we also utilize their transposed versions to generate projections, and then merge the corresponding histograms based on the proper symmetries. This method further improves the detection accuracy while preserving the feature dimensionality.
The rest of this paper is organized as follows. In Section II, we briefly introduce the original PHARM to make this paper self-contained. In Section III, we describe the improved PHARM in detail. In Section IV, experiments are conducted to demonstrate the efficiency and effectiveness of our improved PHARM. Conclusions are drawn in Section V.
Original PHARM Features
In this section, we briefly describe how the original PHARM features are calculated to help readers better understand our proposed improvements. The calculation procedures of PHARM are described as follows (see Fig. 1).
Decompress a JPEG image into the spatial domain without rounding the pixel values to integers. The obtained decompressed JPEG image is denoted as
.\mathbf {X} \in \mathbb {R}^{n_{1} \times n_{2}} Generate the linear residual
by convolving the decompressed JPEG image\mathbf {Z} with one of seven linear high-pass filters\mathbf {X} .\mathbf {K} where the symbol ‘\begin{equation*} \mathbf {Z} = \mathbf {X} \star \mathbf {K},\tag{1}\end{equation*} View Source\begin{equation*} \mathbf {Z} = \mathbf {X} \star \mathbf {K},\tag{1}\end{equation*}
’ denotes convolution.\star Generate the projections
by convolving the residual\mathbf {P} with\mathbf {Z} random matrices\nu .{\Pi } where\begin{equation*} \mathbf {P}^{(i)} = \mathbf {Z} \star {\Pi }^{(i)}, ~i \in \{1, \ldots, \nu \},\tag{2}\end{equation*} View Source\begin{equation*} \mathbf {P}^{(i)} = \mathbf {Z} \star {\Pi }^{(i)}, ~i \in \{1, \ldots, \nu \},\tag{2}\end{equation*}
,{\Pi }^{(i)} \in \mathbb {R}^{r \times s} andr are uniformly randomly selected froms , the elements of\{1, \ldots, S\} are independent realizations of a standard normal random variable{\Pi }^{(i)} normalized to a unit Frobenius norm\mathcal {N}(0,1) .\| {\Pi } \|_{2} = 1 For each
, selecti \in \{1, \ldots, \nu \} ,u uniformly randomly and subsample the projection valuesv \in \{0, \ldots, 7\} to\mathbf {P}^{(i)} ,\mathbf {P}^{(i;u,v)} \triangleq \left ({p_{u + 8 \cdot k, v + 8 \cdot l}^{(i)}}\right) ,0\leq k \leq n_{1}/8-1 .0\leq l \leq n_{2}/8-1 Compute the histogram
of the quantized values\mathbf {h}^{(i;u,v)} ,\mathbf {P}^{(i;u,v)} where\begin{align*} \mathbf {h}^{(i;u,v)}_{t}=&\sum _{a,b}\left [{Q_{T}\left ({\left |{\left.{p_{a,b}^{(i; u,v)}}\right |}\right /q}\right)=t+1/2}\right],\qquad \\&\qquad \qquad \qquad \quad t \in \{0, \ldots, T-1\},\tag{3}\end{align*} View Source\begin{align*} \mathbf {h}^{(i;u,v)}_{t}=&\sum _{a,b}\left [{Q_{T}\left ({\left |{\left.{p_{a,b}^{(i; u,v)}}\right |}\right /q}\right)=t+1/2}\right],\qquad \\&\qquad \qquad \qquad \quad t \in \{0, \ldots, T-1\},\tag{3}\end{align*}
is a quantizer withQ_{T} centroidsT ,\{1/2, \ldots, T-1/2\} is the truncation threshold,T is the quantization step, and [q ] is the Iverson bracket equal to 0 when the statementP is false and 1 whenP is true.P Enlarge the set
by adding to it projections obtained by convolving with three transformed versions of the matrix\mathbf {P}^{(i)} , namely its horizontal flipping{\Pi }^{(i)} , vertical flipping{\Pi }^{(i)}_{\mathrm {flplr}} , and rotation by{\Pi }^{(i)}_{\mathrm {flpud}} .180^{\circ }~{\Pi }^{(i)}_{\mathrm {rot180}} Merge the histograms computed from the subsets of the projections obtained with
,{\Pi }^{(i)} ,{\Pi }^{(i)}_{\mathrm {flplr}} , and{\Pi }^{(i)}_{\mathrm {flpud}} . The merging method is based on the symmetries between{\Pi }^{(i)}_{\mathrm {rot180}} and its geometrically transformed versions.{\Pi }^{(i)} Concatenate all the symmetrized histograms to form the final feature set PHARM.
The parameter setup for the original PHARM is illustrated as follows. The maximum projection matrix size
Improved PHARM Features
The PHARM is effective for the detection of adaptive JPEG steganographic algorithms but has expensive computational cost. In this section, we elaborately modify its calculation process to reduce the computation time and further improve the detection accuracy.
A. Reducing Maximum Projection Matrix Size
In Step 3 of the PHARM calculation described in Section II, residuals are convolved with a large number of matrices of random sizes for diverse projections. The parameter
On the one hand, the reduction of the maximum projection matrix size
Based on the above two reasons, we determine to reduce the maximum size of projection matrices
In the experiments of this subsection, the dataset used for performance evaluation is BOSSbase 1.01 [16] which contains 10000 grayscale images of size \begin{equation*} P_{\mathrm {E}} = \mathrm {min}_{P_{\mathrm {FA}}}\frac {1}{2}(P_{\mathrm {FA}}+P_{\mathrm {MD}}),\tag{4}\end{equation*}
In Table 1, the detection accuracy and computation time for PHARM are given when the maximum projection matrix size
B. Selecting Multiple Phase Pairs Per Projection
In Step 4 of the PHARM calculation described in Section II, for each projection, only one phase pair is randomly selected to compute a corresponding phase-aware histogram. That is, the original PHARM only computes one two-dimensional histogram for each projection. To construct the 12600-dimensional PHARM (concatenated histograms), each residual has to be convolved with 900 random matrices to generate enough projections. This is the main reason why PHARM is so demanding to compute.
The original PHARM obtains phase-aware histograms in such a wasteful manner. Being different from PHARM, DCTR and GFR both exploit all 64 phase pairs to compute phase-aware histograms. In DCTR and GFR, a residual is first subsampled according to 64 phase pairs, and then 64 phase-aware histograms are computed from the corresponding sub-residuals. Inspired by the calculation of DCTR and GFR, we believe that selecting multiple phase pairs does not have a serious adverse effect on steganography detection. With multiple phase-aware histograms per projection, the number of projections per residual correspondingly decreases to maintain the same dimension (
Hence, we decide to utilize more than one phase pair per projection to compute phase-aware histograms. Next, we experimentally decide how many phase pairs are selected for each projection and correspondingly determine the number of projections per residual.
In the experiments of Table 2, the training and testing sets are the same as those of Table 1. The detection accuracy and computation time are also measured in the same manner as in the experiments of Table 1.
Table 2 shows the detection accuracy and computation time for PHARM with the different number of selected phase pairs per projection when the parameter
C. Employing Transposition Symmetry
In Step 7 of the PHARM calculation described in Section II, the histograms of different projections are merged based on the symmetries between projection matrix
The symmetrization is very useful for steganography detection. And the transposition symmetry is widely employed for symmetrization in various steganalysis features, such as PSRM [18], GFR-GSM [19], DCTRD, and GFRD.
In this paper, we attempt to employ the transposition symmetry for PHARM to further improve the detection accuracy. We first combine equations (1) and (2), and the projection \begin{equation*} \mathbf {P}^{(i)} = \mathbf {X} \star \mathbf {K} \star {\Pi }^{(i)} = \mathbf {X} \star \left ({\mathbf {K} \star {\Pi }^{(i)}}\right) = \mathbf {X} \star \mathbf {C}^{(i)}.\tag{5}\end{equation*}
\begin{align*} \mathbf {P}'^{(i)}=&\mathbf {X} \star \left ({\mathbf {C}^{(i)}}\right)^{T} = \mathbf {X} \star \left ({\mathbf {K} \star {\Pi }^{(i)}}\right)^{T} \\=&\mathbf {X} \star \left ({{\Pi }^{(i)}}\right)^{T} \star \mathbf {K}^{T} = \mathbf {X} \star \mathbf {K}^{T} \star \left ({{\Pi }^{(i)}}\right)^{T}.\tag{6}\end{align*}
Symmetries that can be utilized for a given
Next, we conduct some experiments to show that the proposed symmetrization strategy can further improve the detection accuracy of PHARM. In the experiments, 10000 images from BOSSbase 1.01 are converted into JPEG format with quality factors 75 and 95 as cover images and then stego images are generated by J-UNIWARD with a payload of 0.2 bpnzac. The
D. Final Feature Design
We now summarize the computation procedures of our PHARM as follows (see Fig. 3).
Decompress a JPEG image into the spatial domain without rounding to obtain the decompressed JPEG image
.\mathbf {X} \in \mathbb {R}^{n_{1} \times n_{2}} Generate the linear residuals
and\mathbf {Z} by convolving the decompressed JPEG image\mathbf {Z}' with the linear high-pass filter\mathbf {X} and its transposed version\mathbf {K} , respectively.\mathbf {K}^{T} Generate the projections
by convolving the residual\mathbf {P} with\mathbf {Z} random matrices\nu .{\Pi } For each
, select three different phase pairsi \in \{1, \ldots, \nu \} ,(u_{1}, v_{1}) ,(u_{2}, v_{2}) uniformly randomly and subsample the projection values(u_{3}, v_{3}) to\mathbf {P}^{(i)} ,\mathbf {P}^{(i;u_{j},v_{j})} \triangleq \left ({p_{u_{j} + 8 \cdot k, v_{j} + 8 \cdot l}^{(i)}}\right) ,u_{j}, v_{j} \in \{0,1, \ldots, 7\} ,j \in \{1, 2, 3\} ,0\leq k \leq n_{1}/8-1 .0\leq l \leq n_{2}/8-1 Compute the histogram
of the quantized values\mathbf {h}^{(i;u_{j},v_{j})} by (3).\mathbf {P}^{(i;u_{j},v_{j})} Enlarge the set
by adding to it projections obtained not only by convolving the residual\mathbf {P}^{(i)} with\mathbf {Z} ,{\Pi }^{(i)}_{\mathrm {flplr}} ,{\Pi }^{(i)}_{\mathrm {flpud}} but also by convolving the residual{\Pi }^{(i)}_{\mathrm {rot180}} with\mathbf {Z}' ,({\Pi }^{(i)})^{T} ,({\Pi }_{\mathrm {flplr}}^{(i)})^{T} ,({\Pi }_{\mathrm {flpud}}^{(i)})^{T} .({\Pi }_{\mathrm {rot180}}^{(i)})^{T} Merge the histograms computed from the subsets of the projections obtained with
,{\Pi }^{(i)} ,{\Pi }^{(i)}_{\mathrm {flplr}} ,{\Pi }^{(i)}_{\mathrm {flpud}} and their transposed versions. The merging method shown in Fig. 2 is based on the symmetries between{\Pi }^{(i)}_{\mathrm {rot180}} and its flipped, rotated, and transposed versions.{\Pi }^{(i)} Concatenate all the symmetrized histograms to form our improved PHARM.
All parameters in our PHARM except
Experiments
In the previous section, we introduce our improved PHARM in detail. Unlike the original PHARM, the maximum size of projection matrices is reduced from eight to four (
From Table 4, it can be seen that compared to the original PHARM, our new PHARM of the same dimension provides better detection performance regardless of quality factors, steganographic algorithms and embedding rates. Compared to GFR of larger dimensions, surprisingly, our improved PHARM also has higher detection accuracy for different conditions. For example, for quality factor 95 and J-UNIWARD at 0.4 bpnzac, compared to DCTR, PHARM and GFR, the detection accuracy of our improved PHARM can rise by 6.62%, 5.31%, and 1.44%, respectively. For quality factor 75 and UED-JC at 0.2 bpnzac, with respect to DCTR, PHARM and GFR, the improvements of our PHARM are 7.86%, 3.88%, and 2.41%, respectively. We also find that although the parameters of the improved PHARM are determined according to the detection performance for J-UNIWARD at 0.2 bpnzac and quality factor 75, they are effective for different conditions.
The efficiency of feature extraction is also important to the practical steganalysis application. For a
To sum up, in contrast to the original PHARM, our improved PHARM not only offers better detection accuracy but also needs less calculation time.
Conclusion
In this paper, we modify the original PHARM feature set for less computation time and higher detection accuracy. There are three improvements in our proposed PHARM. First, the maximum projection matrix size is reduced from eight to four, which not only lowers the computational complexity but also better captures the slight changes in the adaptive steganograghy. Second, instead of only one phase pair, three different ones per projection are randomly selected to compute phase-aware histograms and the number of projections per residual correspondingly decreases from 900 to 300, which reduces the computation time dramatically. Third, the transposition symmetry is taken into consideration, which further improves the detection accuracy while preserving the feature dimensionality. With these three improvements, our improved PHARM takes roughly two thirds of the computation time of the original PHARM but achieves higher detection accuracy than DCTR, original PHARM, and even GFR of larger dimensions.
The future work will focus on the following several aspects. First, the knowledge of the selection channel can be incorporated within the improved PHARM to further boost the detection of adaptive steganography. Second, a novel weighted histogram scheme proposed in [19] can also be applied to our PHARM to make it more powerful. Third, random projections can be used for preprocessing in CNN based steganalysis. Fourth, we can try to use the linear filters introduced in [20]–[22] for residual generation in PHARM. Fifth, in order to make our improved PHARM more usable in practice, we can implement it on a GPU device, similarly to GPU-PSRM [23], GPU-SRM and GPU-DCTR [24]. Sixth, as an efficient and effective scheme in JPEG image steganalysis, our improved PHARM can also inspire the design of novel steganalytic algorithms for HEVC and 3D-HEVC videos [25]–[27].