Wavelet packet analysis (WPA) and gray model (GM) are investigated for nonlinear unsupervised feature extraction of hyperspectral remote sensing data in this letter. Treated as derivative series, a hyperspectral response curve of each pixel is decomposed into an approximation and various detailed compositions by WPA, and then, GM is continuously applied to find the relationship among those detailed compositions. Cluster-space representation is used for determining the optimal wavelet. New extracted features can reveal the intrinsic identities of hyperspectral data. Experimental results show the feasibility and reliability of our proposed method in terms of classification accuracy.