I. Introduction
Kernel methods [1], [2] have been successfully applied to many machine learning tasks such as classification [3], [4], regression [5], and clustering [6]. In these algorithms, a kernel function is employed to evaluate the similarity between two data points and . Herein, a positive definite (PD) kernel results in a positive semidefinite (PSD) kernel matrix to satisfy Mercer’s condition. Consequently, the above-mentioned approaches with PD kernels can be theoretically analyzed in the reproducing kernel Hilbert spaces (RKHSs) [7].