Journals & Magazines >IEEE Access >Volume: 9

Sparse Representation Based Hyperspectral Anomaly Detection via Adaptive Background Sub-Dictionaries

The overview of the proposed sparse representation based hyperspectral anomaly detection via adaptive background sub-dictionaries.

Abstract:

Hyperspectral anomaly detection has drawn much attention in recent years. In this paper, in order to effectively extract anomalies in hyperspectral images, a novel sparse...Show More

Metadata

Abstract:

Hyperspectral anomaly detection has drawn much attention in recent years. In this paper, in order to effectively extract anomalies in hyperspectral images, a novel sparse-representation based hyperspectral anomaly detection method via adaptive background sub-dictionaries is proposed. Firstly, a background estimation strategy is proposed to provide representative background information. Based on the estimated background, a global dictionary is constructed by utilizing K-means clustering algorithm. Next, Several active atoms are selected from the global dictionary to form a sub-dictionary to adaptively approximate the local region in each dual-window. This sub-dictionary construction strategy can remove potential anomaly contamination in local regions. Finally, a re-weighting strategy is proposed to enhance the performance of sparse-representation-based anomaly detector. Experimental results demonstrate that our method can effectively extract anomalies and suppress background simultaneously.

The overview of the proposed sparse representation based hyperspectral anomaly detection via adaptive background sub-dictionaries.

Published in: IEEE Access ( Volume: 9)

Page(s): 14735 - 14751

Date of Publication: 29 October 2020

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2020.3034796

Contents

CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.

SECTION I.

Introduction

Hyperspectral images (HSIs) are capable to contain abundant spectral characteristics of ground materials [1]. They consist of hundreds or even thousands of continuous and narrow spectral bands ranged from $0.4-2.5~\mu m$ and each band is approximately $0.01~\mu m$ wide [2]. Owing to the high resolution in spectral dimension, different objects can be recognized and distinguished according to their spectral signatures via hyperspectral images. Upon this basis, HSIs have been employed in various tasks that require identification of objects such as target detection. In terms of whether prior information is required, hyperspectral target detection can be categorized into two types: supervised and unsupervised. Due to small spatial size of targets and unpredictable atmosphere factors, it is usually hard to obtain the spectral information of targets. Therefore, unsupervised target detection, known as anomaly detection, is more commonly researched in practical and has drawn much attention with state-of-the-art techniques, such as compressive sensing [3] and deep learning [4].

Anomalies in HSIs refer to objects that occupy a few pixels (even subpixels in some situations). They have significantly distinct spectral characteristics from neighboring regions. Over the last few decades, a quantity of anomaly detection methods have been proposed. The well-known Reed-Xiaoli (RX) algorithm [5] exploits the assumption that the background follows a multivariate normal distribution. It measures the Mahalanobis distance between the spectrum vectors of the test pixel and the background pixels as detection results. Local RX and Global RX are studied respectively according to different means of background estimation. However, the performance of RX algorithm is unstable as it essentially depends on the estimated background covariance matrix. Moreover, the assumption of background distribution is not in accordance with the fact that the background in real-world HSI is much more complicated. To address these issues, quite an amount of RX-based algorithms have been developed. The regularized RX algorithm [6] aims to attenuate the ill conditioning of the matrix inversion by regularizing the background covariance matrix. Aiming at decreasing anomaly contamination in background statistics, the weighted-RX algorithm [7] estimates the Gaussian probability as weight vectors. In order to effectively separate anomaly pixels from the background, the kernel RX algorithm [8] projects the HSI dataset into a higher dimensional feature space. The subspace-based RX is introduced in [9], which explores the background features via the representative eigenvectors of the covariance matrix.

In recent years, with the development of compressed sensing theory, representation based techniques have emerged as a hot topic in many application fields, such as anomaly detection [3], face recognition [10], image classification [11], image denoising [12], and so on. Sparse representation (SR) based HSI anomaly detectors assume that a background pixel can be linearly represented with only a few coefficients over a background dictionary while an anomaly pixel can not. Li et al. [13] select the most representative background elements to adaptively approximate local regions, thus false alarm rate can be effectively reduced. Aiming at reducing anomaly contamination in the background dictionary, Zhu et al. [14] construct a background dictionary via extracted background endmembers. A sparsity score estimation framework is proposed in [15] to provide a novel view for HSI anomaly detection. The atom usage probability (AUP) score is used to assess reconstruction energy of dictionary atoms, which helps enhancing the discriminative power of the background dictionary. Low-rank representation (LRR) based methods also play a vital role in HSI anomaly detection. In LRR model, HSI data is assumed drawn from multiple subspaces. Based on this assumption, the background part and the anomaly part are able to be separated by a background dictionary. The anomaly detector based on low-rank and sparse representation introduced in [16] employs LRR model to obtain the sparse anomaly component. The $l_{2}$ -norm is then applied to columns in the sparse matrix to locate anomaly pixels. Wang et al. [17] form a background dictionary with the material signature matrix for the LRR model to extract the background information to identify the anomaly components. As one of the significant characteristic of HSI, Tan et al. [18] analyze the spatial similarity among pixels in local regions and impose a spatial constraint to improve the detection performance with LRR model. Collaborative representation (CR) technique also has an outstanding performance in detecting anomalies in HSIs. The collaborative representation based detectors (CRD) adopt a sliding dual-window strategy and consider that the central test pixel lies in the subspace spanned by neighboring pixels in the outer window. The detection criterion is the reconstruction error of the test pixel. Li et al. [19] introduce a distance-weighted Tikhonov regularization to the CRD optimization procedure and then project the detector into a higher dimension by the kernel trick. For the aim to eliminate the influence by potential anomalies in the outer window, Li et al. [20] develop a principal-component-analysis (PCA) based method to remove the outliers in neighboring regions. Wu et al. [21] combine LRR model with CRD to achieve a more effective separation between background component and sparse anomaly component. The aforementioned methods focus on obtaining a promising background estimation, which is further used to extract anomalies. Although they design various strategies to extract pure background information, the detection results yet suffer from serious false alarms. This is attributed to anomaly contamination and lack of complete background information. Therefore, estimating a pure background without anomaly contamination still remains a challenge. Especially, the accuracy of the representation-based detectors essentially relies on the quality of estimated background, i.e. the quality of the constructed background dictionary. Generally, a desirable background dictionary is expected to be immune from anomaly contamination and to contain as abundant background information as possible.

In this paper, inspired by the work of Zhu et al. [14], and from the perspective of dictionary construction for SR, we propose a novel hyperspectral anomaly detection method based on adaptive background sub-dictionaries. The main contributions of this paper can be summarized as follows:

An SMACC endmember extraction model based background estimation strategy is proposed so that representative and pure background information can be extracted.
Based on the estimated background, a global dictionary is constructed by utilizing K-means clustering algorithm. Several active atoms are selected from this global dictionary to form a sub-dictionary. The local region in each dual-window can be adaptively approximated by this sub-dictionary.
With the sub-dictionaries, a re-weighting strategy based on spectral angle distance is proposed to enhance the performance of SR based anomaly detector.

The remainder of this paper is organized as follows. In Section 2, the basic theories of SR based anomaly detector and SMACC endmember extraction model are briefly reviewed. In Section 3, the proposed hyperspectral anomaly detection method is demonstrated in detail. In Section 4, with the experiments on real HSI datasets, the effectiveness of the proposed method is evaluated and the proposed strategies and parameters are further discussed. In Section 5, we draw the conclusions.

SECTION II.

Related Works

A. Sparse Representation for Anomaly Dectection

The basic idea of SR based anomaly detection is to represent the test pixel with the linear combination of the background dictionary atoms. It assumes that if a pixel belongs to the background class, it lies in the subspace spanned by the background dictionary atoms. Given a reshaped HSI dataset denoted as $\mathbf {X}=[\mathbf {x}_{1}, \mathbf {x}_{2},\ldots,\mathbf {x}_{N}]\,\,\in \,\,\mathbf {R}^{B\times N}$ where $B$ is the number of spectral bands and $N$ is the number of pixels. The SR model for each pixel $\mathbf {x}_{i}~(1\leq i \leq N)$ can be expressed as $\begin{equation*} \mathbf {x}_{i} = \mathbf {D}\boldsymbol{\alpha }_{i} = \alpha _{i1}\mathbf {d}_{1} + \alpha _{i2}\mathbf {d}_{2} + \cdots + \alpha _{ik}\mathbf {d}_{k} \tag{1}\end{equation*}$ View Source Here $\mathbf {D}=[\mathbf {d}_{1}, \mathbf {d}_{2},\ldots,\mathbf {d}_{K}]\,\,\in \,\,\mathbf {R}^{B\times K} \,\,(B\ll K)$ is the overcomplete background dictionary with $K$ atoms, $\mathbf {d}_{i}~(1\leq i \leq K)$ denotes the $i$ th atom, and $\boldsymbol{\alpha }=[\alpha _{1}, \alpha _{2},\ldots,\alpha _{n}]^{T}$ is the sparse coefficient vector with only a few nonzero entries. This implies that $\mathbf {x}_{i}$ can be represented with the linear combination of $K_{0}$ atoms in $\mathbf {D}$ in which $K_{0}$ is far less than $K$ . The sparse vector can be acquired via solving the following optimization problem $\begin{equation*} \mathop {\text {min}}_{\boldsymbol{\alpha }_{i}} \Vert \mathbf {x}_{i}-\mathbf {D}\boldsymbol{\alpha }_{i} \Vert _{2}^{2} \quad \text {s.t.}~\Vert \boldsymbol{\alpha }_{i}\Vert _{0} \leq K_{0} \quad \forall i \tag{2}\end{equation*}$ View Source where $\Vert \cdot \Vert _{0}$ denotes the $l_{0}$ -norm that counts the number of nonzero entries in the vector, and $K_{0}$ is the upper bound of the sparsity level for $\boldsymbol{\alpha }_{i}$ . This optimization problem can be solved by the orthogonal matching pursuit algorithm (OMP) [22]. Once the estimated coefficient vector $\mathop{\mathbf {\alpha }}\limits^{\wedge }_{i}$ is obtained, the detection response of the $i$ th pixel can be obtained by computing the reconstruction residual $\begin{equation*} r_{i} = \Vert \mathbf {x}_{i}-\mathbf {D}{\mathop {\boldsymbol{\alpha }}^{\mathbf {\wedge }}}_{i}\Vert _{2} \tag{3}\end{equation*}$ View Source Here $r_{i}$ is the reconstruction residual of the pixel $\mathbf {x}_{i}$ . if the residual $r_{i}$ is larger than a given threshold, then the test pixel $\mathbf {x}_{i}$ is considered to be an anomalous pixel.

B. SMACC Endmember Extraction

In an HSI, an endmember refers to the spectral characteristics of certain one type pure component. In order to extract endmember spectra and abundance maps simultaneously, Gruninger et al. proposed the sequential maximum angle convex cone (SMACC) endmember extraction model. Given an HSI dataset ${\mathbf {X}} \in {\mathbf {R}^{B\times N}}$ , where $B$ is the number of spectral bands and $N$ denotes the number of pixels, the linear spectral mixture model can be written as follows $\begin{equation*} \mathbf {X}_{i,j}=\sum _{h=1}^{H}\mathbf {M}_{i,h}\mathbf {A}_{h,j}+\mathbf {R}_{i,j} \tag{4}\end{equation*}$ View Source Here $\mathbf {X}_{i,j}$ denotes the $i$ th band of the $j$ th pixel in $\mathbf {X}$ , $H$ is the expansion length. $\mathbf {M}=[\mathbf {m}_{1}, \mathbf {m}_{2}, \ldots, \mathbf {m}_{H}] \in \mathbf {R}^{B\times H}$ is the endmember spectral matrix, where each column indicates an endmember spectrum vector. $\mathbf {A} = [\mathbf {a}_{1}, \mathbf {a}_{2}, \ldots, \mathbf {a}_{H}]^{T} \in \mathbf {R}^{H\times N}$ is the abundance matrix, where each row contains the abundance map of the corresponding endmember for each pixel. The matrix $\mathbf {R \in R^{B\times N}}$ is the residuals. The SMACC model recognizes the endmember spectra via a convex cone model, and a positive constraint is imposed since the spectrum vector represents reflectance. The convex cone is determined by using the extreme points and thus the first endmember is defined. The residuals denote the elements distributed outside the convex cone. The rest endmember is successively derived by implementing a constrained oblique projection on the previous convex cone. Adding new endmembers alternates with updating the convex cone. This process is terminated until a certain error is satisfied. The final result of SMACC contains the endmember spectra set and the abundance images. Additionally, the abundance images demonstrate the contribution of endmembers for each pixel. This extraction process can be performed via the ENVI remote sensing image processing platform [23].

SECTION III.

Proposed Method

In this section, the detailed introduction of the proposed method is illustrated. This section includes four parts. In the first part, the background estimation strategy via the SMACC model is introduced. In the second part, the adaptive background sub-dictionary construction method based upon the atom usage probability (AUP) is described. In the third part, the spectral angle distance (SAD) based adaptive re-weighted SR based anomaly detection method is demonstrated. Finally, the overview of the proposed method is summarized.

A. Background Estimation Strategy

The performance of representation based anomaly detectors highly relies on the background dictionary. By constructing discriminative dictionary to improve detection performance has been a hot topic. Qu et al. [24] construct a background dictionary based on the estimated background from the main shift clustering algorithm instead of raw data, which will enhance the separation between anomalies and background. Ma et al. [25] divide background into several categories and select a series of representative samples from each categories to build multiple background dictionaries, so that the differences between anomalies and background are enhanced. Yang el al. [26] establish a pure background dictionary that excludes possible anomalies and thus providing more reliable detection results based on LRR model.

For the SR based detectors, the quality of background dictionary evidently influences the detection probability. Generally, two options of background dictionaries for unsupervised SR based detectors are available: the global dictionary and the local dictionary. The global one is usually constructed by randomly selecting some pixels from the HSI [27]. As for the local one, a dual-window strategy (shown in Fig. 1) is adopted and the pixels in the outer window are collected to form the dictionary. The local dictionary based SR model is referred as joint sparsity model. In the work by Zhu et al. [14], a new global background dictionary is constructed to eliminate the anomalies embedded in the background. The dictionary atoms are randomly selected from the estimated background by using the SMACC model, and the global dictionary is used directly for detection. Different from Zhu’s work, we implement K-means clustering algorithm to the estimated background and choose several samples from each cluster to ensure that all types of background information can be revealed in this global dictionary. Moreover, we use this global dictionary to eliminate the anomaly contamination in the local regions in the dual-window.

FIGURE 1.

Dual-window strategy for anomaly detection.

Sparse Representation Based Hyperspectral Anomaly Detection via Adaptive Background Sub-Dictionaries

Alerts

Abstract:

Metadata

Abstract:

Introduction

Related Works

A. Sparse Representation for Anomaly Dectection

B. SMACC Endmember Extraction

Proposed Method

A. Background Estimation Strategy

B. Adaptive Background Sub-Dictionary Construction Method

C. Re-Weighted SR Based Anomaly Detection

D. Overview of the Proposed Method

Experiments and Analysis

Sparse Representation Based Hyperspectral Anomaly Detection via Adaptively Estimated Background Sub-Dictionaries

A. Data Description

B. Detection Performance

C. Discussion

D. Parameter Analysis

Conclusion

ACKNOWLEDGMENT

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?