Journals & Magazines >IEEE Journal of Selected Topi... >Volume: 17

Local Peak Savitzky–Golay for Spatio-Temporal Reconstruction of Landsat NDVI Time Series: A Case Study Over the Qinghai–Tibet Plateau

Abstract:

The incompleteness of the normalized difference vegetation index (NDVI) time series (TS) restricts its expanded applications in key domains. Although spatio-temporal hybr...Show More

Topic: Efficient Fusion of Multi-Source Remote Sensing Data

Metadata

Abstract:

The incompleteness of the normalized difference vegetation index (NDVI) time series (TS) restricts its expanded applications in key domains. Although spatio-temporal hybrid methods show promise in TS reconstruction, reliance on auxiliary data in most existing approaches introduces errors and increases workload. Furthermore, NDVI values marked as contaminated in the quality assessment (QA) data are underutilized. Ultimately, when utilizing spatial information, most methods are ineffective for the representation of land-use changes. Considering these issues, we propose a local peak Savitzky–Golay (LPSG) method for spatio-temporal reconstruction of Landsat NDVI TS. First, we construct a local peak neighborhood weighted interpolation (LPNWI) method that fully utilizes all original values to fill gaps. Second, we design a slope change decision tree (SC-DT) method for identifying residual noise, thereby mitigating the impact of QA errors on reconstruction results. Third, multidimensional calibration with weighted spatial reference (MDC-WSR) method is proposed to enhance utilization of spatial information by improving traditional correlation coefficient calculations and generating a multiyear spatial reference, which effectively reflects land-use changes. Experiments on Landsat NDVI TS data in the Qinghai–Tibet Plateau (2013–2022) show that: 1) LPSG outperforms other methods in mitigating the impact of QA errors, preserving TS peaks and details, and maintaining spatial continuity; 2) LPSG exhibits superior performance, with average RMSE reductions ranging from 0.00018 to 0.00750 compared to other methods under both correct and incorrect QA; and 3) LPSG demonstrates good robustness under various gap conditions and effectively restores TS of pixels affected by land-use changes.

Topic: Efficient Fusion of Multi-Source Remote Sensing Data

Published in: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ( Volume: 17)

Page(s): 13439 - 13455

Date of Publication: 23 July 2024

ISSN Information:

DOI: 10.1109/JSTARS.2024.3432797

Funding Agency:

Contents

SECTION I.

Introduction

Normalized difference vegetation index (NDVI) time series (TS) derived from remote sensing images has been widely used in vegetation phenology detection [1], land-cover change monitoring [2], [3], [4]; environmental dynamic simulation [5], [6]; vegetation classification [7], etc. However, satellite remote sensing TS images are frequently interrupted due to low time resolution or pollution by bad atmosphere (such as aerosols and dust), clouds, and snow [8], [9]. Consequently, it becomes imperative to employ effective methods for reconstructing NDVI TS to meet the subsequent extended application. According to the principle of missing information restoration, three categories of NDVI TS reconstruction methods have been widely used in the past few decades, namely, temporal-based method, frequency-based method, and hybrid method.

Temporal-based methods: This category of methods can be further classified into three types. Temporal interpolation replacement methods, like iterative interpolation for data reconstruction (IDR) [10], the best index slope extraction (BISE) [11], and modified BISE (M-BISE) [12]. Temporal filter, including the Savitzky–Golay (SG) filter [13], Whittaker filtering method [14], and changing-weight (CW) filter [15]. Temporal function fitting methods, such as double logistic (DL) function fitting [16], and Fourier [17].
The mentioned methods heavily rely on local temporal neighborhood information and have limited reconstruction capabilities. Consequently, researchers have extended temporal-based reconstruction methods. For instance, Liang et al. [18] utilized MODIS, Landsat8, and Sentinel-2 imagery at different spatio-temporal resolutions, employing gap-filling and Whittaker smooth filtering to recover NDVI TS. Yang et al. [19] enhanced the DCT-PLS method for reconstructing unevenly spaced data, generating high-quality and cloud-free Sentinel-2 NDVI TS.
Frequency-based methods: Frequency-based methods recover TS by transforming contaminated data from the time domain to the frequency domain. Notable methods in this category include the harmonic analysis of TS (HANTS) method [20], and the wavelet transform (WT) method [21]. However, these methods may unintentionally diminish reasonable high values and struggle to effectively preserve vegetation phenology. To address these limitations, improved frequency-based methods, such as the spatio-temporal prefill method with harmonic analysis of TS (ST-HANTS) [22], have been proposed.
Hybrid methods: The two aforementioned categories of methods have demonstrated success in specific scenarios [23]. However, their extended application is limited due to the absence of consideration for spatial dimensions. In recent years, the hybrid methods that integrate both time and space information have garnered attention and research interest from scholars. Examples include the search and fill algorithm with moving offset method (SFA-MOM) [24], spatio-temporal Savitzky–Golay (STSG) method [25], and spatio-temporal tensor completion (ST-Tensor) method [26].

The current NDVI TS reconstruction methods encounter three primary challenges. First, existing methods are typically designed for MODIS NDVI products, characterized by coarse spatial resolution and high temporal resolution [27]. This raises concerns about the applicability of the algorithms to middle- and low-resolution images, posing challenges for TS reconstruction. Second, most methods are nonideal in scenarios involving long-term continuous data gaps. Third, a significant drawback is the heavy reliance on the pixel reliability index (RI) dataset in many existing methods, leading to substantial noise in reconstruction results when RI contains errors [26], [28].

For the first issue, Landsat and Sentinel NDVI TS data are suitable for more detailed applications. However, due to their higher spatial resolution and infrequent revisit frequency, reconstructing Landsat and Sentinel NDVI TS data is more challenging compared to coarser resolution data. Several studies have proposed novel reconstruction algorithms tailored to the intricate nature of these data. For instance, Yu et al. [29] proposed a climate incorporated gap-filling (CGF) method to generate Landsat NDVI TS at 8-day intervals. Chen et al. [30] obtained NDVI TS data by integrating MODIS NDVI TS data with cloud-free Landsat observations. Additionally, Yang et al. [31] proposed a method to synthetically generate gap-free NDVI TS from raw contamination observations for reconstructing Sentinel-2 NDVI TS. Landsat and Sentinel TS data have also been used for national- or local-scale mangrove species mapping [32], [33], mapping mangrove functional traits [34], sustainable mangrove management [35], coastal salt marsh mapping [36], which greatly benefits blue carbon research and precise management.

For the second issue, there are mainly two approaches to resolve it.

The first common approach involves using spatial neighboring pixels to reconstruct the NDVI value of the target pixel. Methods like STSG [25], ST-Tensor [26], wWHd [37], etc., have gained prominence in this context. Typically, these methods generate a 1-year reference NDVI TS to capture the seasonal growth pattern. This is accomplished by computing the average of all uncontaminated NDVI values for the corresponding day of the year (DOY) across all years in the TS. Subsequently, the correlation between the reference TS of neighboring pixels and the target pixel is calculated to identify similar pixels. Finally, the NDVI value from the generated spatial reference TS is directly utilized to replace the NDVI value labeled as a pollution point.
Another commonly employed approach is MODIS-Landsat spatio-temporal fusion for reconstructing Landsat NDVI TS data. Examples include the highly scalable temporal adaptive reflectance fusion model (HIST-ARFM) algorithm [38], GF-SG [30], enhanced gap-filling and Whittaker smoothing (EGF-WS) [18], and CGF [29]. Such methodologies typically require obtaining cloud-free Landsat and MODIS images simultaneously on the base date, as well as MODIS images for the prediction date, thus presenting certain limitations in their applicability.

For the third issue, a limited number of scholars have introduced new methods to mitigate the dependence on quality assessment (QA). For instance, Zhu et al. [39] proposed a reconstruction method based on self-weighting function fitting from curve features (SWCF) that does not require ancillary data about quality. Additionally, Yang et al. [40] proposed an enhanced STSG method (cuSTSG) that alleviates the impact of inaccurate quality marks on the final results.

While the aforementioned improved methods have addressed most of the problems to some extent, three shortcomings persist.

Most existing methods rely on other data sources as supplementary data of the same type (with strict fusion requirements) or on prior knowledge data, resulting in the introduction of additional errors and a substantial increase in both data volume and workload.
Few methods adequately utilize the NDVI points marked as contaminated points in QA, which may include both valid and invalid points. This inadequacy results in further scarcity of usable data and exacerbates the challenges associated with reconstruction.
In terms of using spatial information, the current methods mainly generate a 1-year reference TS using the same DOY, which results in the inability to represent land-use changes and may also cause the correlation coefficient between the neighborhood pixel and the target pixel to be falsely high, affecting the judgment of similar pixels.

To address the aforementioned issues, this study proposes a local peak Savitzky–Golay (LPSG) method for spatio-temporal reconstruction of Landsat NDVI TS based on two characteristics: TS variation of NDVI should be smooth and continuous, and the NDVI values are always subject to negative bias [10]. First, we construct a local peak neighborhood weighted interpolation (LPNWI) method to fill gaps, eliminating the need for auxiliary data and maximizing the utilization of all original values. Second, we design a slope change decision tree (SC-DT) method to detect residual noise and mitigate it using LPNWI, thereby minimizing the error impact of QA. Third, a multidimensional calibration with weighted spatial reference (MDC-WSR) method is proposed. To be specific, we design a new method to compute the weighted correlation coefficient between the target pixel and its neighborhood pixels, generating a 10-year weighted spatial reference (WSR), which effectively represents the land-use changes. Subsequently, positive and negative bias anomalies are detected and calibrated. Finally, the SG filter is applied to obtain a smooth and high-quality TS. The main contributions of this article are as follows.

We construct a new LPNWI method for gap filling that fully utilizes the dynamic change law of gradual local NDVI values and the characteristic that contaminated NDVI values tend to be negatively biased noise [10].
The SC-DT method designed to remove the noise present in TS after gap filling is well resistant to the uncertainty of QA quality.
The traditional method of calculating correlation coefficients between a target pixel and its neighborhood pixels is improved for generating WSR over multiple years, which is more robust in dealing with land-use changes and large long time gaps.

SECTION II.

Study Area and Data

The Qinghai–Tibet Plateau (QTP) is situated in the southwest region of China, spanning from 25 $^\circ$ –40 $^\circ$ N and 74 $^\circ$ –104 $^\circ$ E, covering an area of roughly 2.5 million square kilometers [41]. With an average elevation surpassing 4500 m, this alpine ecoregion features various predominant vegetation types, including grassland, shrubland, and wetlands, as well as broadleaf and coniferous forests (refer to Fig. 1 for spatial distribution details). We collect surface reflectance data products of Landsat7 EMT+ and Landsat8-9 OLI from Google Earth Engine (GEE) and calculate NDVI using their red edge and near-infrared bands to test the reconstruction performance.

Fig. 1.

Study area of Qinghai–Tibet Plateau with GLC_FCS30-2020 [42] as the base map covered by eight typical land-use types.

Local Peak Savitzky–Golay for Spatio-Temporal Reconstruction of Landsat NDVI Time Series: A Case Study Over the Qinghai–Tibet Plateau

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

Introduction

Study Area and Data

Proposed Methodology

A. LPNWI for Filling Gaps

B. SC-DT for Residual Noise Removing

C. MDC-WSR for TS Calibrating

D. SG Filter for TS Smoothing

Algorithm 1: Local Peak Savitzky–Golay (LPSG).

E. Quantitative Evaluation Indices

Experimental Results

A. Parameter Sensitive Analysis

B. Visual Evaluation of Reconstruction Results

1) Temporal Analysis

2) Spatial Analysis

C. Index Evaluation of Reconstruction Results

Discussions

A. Robust Analysis

B. Adaptability With Hybrid Land-Use

C. Validity Analysis of Improved Correlation Coefficient and Spatial Reference

Conclusion

ACKNOWLEDGMENT

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?