Loading [a11y]/accessibility-menu.js
Scene Classification With Recurrent Attention of VHR Remote Sensing Images | IEEE Journals & Magazine | IEEE Xplore

Scene Classification With Recurrent Attention of VHR Remote Sensing Images


Abstract:

Scene classification of remote sensing images has drawn great attention because of its wide applications. In this paper, with the guidance of the human visual system (HVS...Show More

Abstract:

Scene classification of remote sensing images has drawn great attention because of its wide applications. In this paper, with the guidance of the human visual system (HVS), we explore the attention mechanism and propose a novel end-to-end attention recurrent convolutional network (ARCNet) for scene classification. It can learn to focus selectively on some key regions or locations and just process them at high-level features, thereby discarding the noncritical information and promoting the classification performance. The contributions of this paper are threefold. First, we design a novel recurrent attention structure to squeeze high-level semantic and spatial features into several simplex vectors for the reduction of learning parameters. Second, an end-to-end network named ARCNet is proposed to adaptively select a series of attention regions and then to generate powerful predictions by learning to process them sequentially. Third, we construct a new data set named OPTIMAL-31, which contains more categories than popular data sets and gives researchers an extra platform to validate their algorithms. The experimental results demonstrate that our model makes great promotion in comparison with the state-of-the-art approaches.
Published in: IEEE Transactions on Geoscience and Remote Sensing ( Volume: 57, Issue: 2, February 2019)
Page(s): 1155 - 1167
Date of Publication: 05 September 2018

ISSN Information:

Funding Agency:

References is not available for this document.

I. Introduction

With the rapid development of remote sensing instruments over recent years [1], [2], very high-resolution (VHR) remote sensing images are becoming increasingly available and bringing us the opportunity to try more research in military and civilian applications, such as natural disaster detection [3], [4], land-cover/land-use classification [5], [6] geographic space object detection [7], [8], geographic image retrieval [9], [10], urban planning, and environment monitoring. As we all know, VHR remote sensing images recognition based on the knowledge of domain experts has a high labor cost. Therefore, intelligent scene classification of remote sensing images [11]–[14], which categorizes scene images into different classes based on its semantic information, has drawn great attention in remote sensing field. Nevertheless, because of various classes of scenes and complex spatial information of VHR remote sensing images, how to effectively describe and classify the scenes is a pivotal and challenging task.

Select All
1.
L. Gómez-Chova, D. Tuia, G. Moser, and G. Camps-Valls, “Multimodal classification of remote sensing images: A review and future directions,” Proc. IEEE, vol. 103, no. 9, pp. 1560–1584, Sep. 2015.
2.
P. Gamba, “Human settlements: A global challenge for EO data processing and interpretation,” Proc. IEEE, vol. 101, no. 3, pp. 570–581, Mar. 2013.
3.
G. Cheng, L. Guo, T. Zhao, J. Han, H. Li, and J. Fang, “Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA,” Int. J. Remote Sens., vol. 34, no. 1, pp. 45–59, Jan. 2013.
4.
T. R. Martha, N. Kerle, C. J. V. Westen, V. Jetten, and K. V. Kumar, “Segment optimization and data-driven thresholding for knowledge-based landslide detection by object-based image analysis,” IEEE Trans. Geosci. Remote Sens., vol. 49, no. 12, pp. 4928–4943, Dec. 2011.
5.
X. Yao, J. Han, G. Cheng, X. Qian, and L. Guo, “Semantic annotation of high-resolution satellite images via weakly supervised learning,” IEEE Trans. Geosci. Remote Sens., vol. 54, no. 6, pp. 3660–3671, Jun. 2016.
6.
S. Cui and M. Datcu, “Comparison of approximation methods to Kullback–Leibler divergence between Gaussian mixture models for satellite image retrieval,” Remote Sens. Lett., vol. 7, no. 7, pp. 651–660, 2016.
7.
G. Cheng, P. Zhou, and J. Han, “Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images,” IEEE Trans. Geosci. Remote Sens., vol. 54, no. 12, pp. 7405–7415, Dec. 2016.
8.
S. Bhagavathy and B. S. Manjunath, “Modeling and detection of geospatial objects using texture motifs,” IEEE Trans. Geosci. Remote Sens., vol. 44, no. 12, pp. 3706–3715, Dec. 2006.
9.
Y. Wang, “A three-layered graph-based learning approach for remote sensing image retrieval,” IEEE Trans. Geosci. Remote Sens., vol. 54, no. 10, pp. 6020–6034, Oct. 2016.
10.
Y. Yang and S. Newsam, “Geographic image retrieval using local invariant features,” IEEE Trans. Geosci. Remote Sens., vol. 51, no. 2, pp. 818–832, Feb. 2013.
11.
J. Muñoz-Marí, F. Bovolo, L. Gómez-Chova, L. Bruzzone, and G. Camp-Valls, “Semisupervised one-class support vector machines for classification of remote sensing data,” IEEE Trans. Geosci. Remote Sens., vol. 48, no. 8, pp. 3188–3197, Aug. 2010.
12.
D. Tuia, F. Pacifici, M. Kanevski, and W. J. Emery, “Classification of very high spatial resolution imagery using mathematical morphology and support vector machines,” IEEE Trans. Geosci. Remote Sens., vol. 47, no. 11, pp. 3866–3879, Nov. 2009.
13.
A. M. Cheriyadat, “Unsupervised feature learning for aerial scene classification,” IEEE Trans. Geosci. Remote Sens., vol. 52, no. 1, pp. 439–451, Jan. 2014.
14.
G. Cheng, J. Han, and X. Lu, “Remote sensing image scene classification: Benchmark and state of the art,” Proc. IEEE, vol. 105, no. 10, pp. 1865–1883, Oct. 2017.
15.
M. Castelluccio, G. Poggi, C. Sansone, and L. Verdoliva. ( 2015 ). “Land use classification in remote sensing images by convolutional neural networks.” [Online]. Available: https://arxiv.org/abs/1508.00092
16.
F. Hu, G.-S. Xia, J. Hu, and L. Zhang, “Transferring deep convolutional neural networks for the scene classification of high-resolution remote sensing imagery,” Remote Sens., vol. 7, no. 11, pp. 14680–14707, 2015.
17.
G.-S. Xia, “AID: A benchmark data set for performance evaluation of aerial scene classification,” IEEE Trans. Geosci. Remote Sens., vol. 55, no. 7, pp. 3965–3981, Jul. 2017.
18.
X. Bian, C. Chen, L. Tian, and Q. Du, “Fusing local and global features for high-resolution scene classification,” IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens., vol. 10, no. 6, pp. 2889–2901, Jun. 2017.
19.
R. A. Rensink, “The dynamic representation of scenes,” Vis. Cognit., vol. 7, nos. 1–3, pp. 17–42, 2000.
20.
M. Corbetta and G. L. Shulman, “Control of goal-directed and stimulus-driven attention in the brain,” Nature Rev. Neurosci., vol. 3, no. 3, pp. 201–215, 2002.
21.
A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet classification with deep convolutional neural networks,” in Proc. Adv. Neural Inf. Process. Syst., 2012, pp. 1097–1105.
22.
K. Simonyan and A. Zisserman. ( 2014 ). “Very deep convolutional networks for large-scale image recognition.” [Online]. Available: https://arxiv.org/abs/1409.1556
23.
K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., Jun. 2016, pp. 770–778.
24.
Q. Wang, J. Gao, and Y. Yuan, “Embedding structured contour and location prior in siamesed fully convolutional networks for road detection,” IEEE Trans. Intell. Transp. Syst., vol. 19, no. 1, pp. 230–241, Jan. 2018.
25.
S. Chaib, H. Liu, Y. Gu, and H. Yao, “Deep feature fusion for VHR remote sensing scene classification,” IEEE Trans. Geosci. Remote Sens., vol. 55, no. 8, pp. 4775–4784, Aug. 2017.
26.
S. Chaib, H. Yao, Y. Gu, and M. Amrani, “Deep feature extraction and combination for remote sensing image classification based on pre-trained CNN models,” Proc. SPIE, vol. 10420, pp. 10420-1–10420-5, Jul. 2017.
27.
J. A. dos Santos, O. A. B. Penatti, and R. da Silva Torres, “Evaluating the potential of texture and color descriptors for remote sensing image retrieval and classification,” in Proc. VISAPP, vol. 2, 2010, pp. 203–208.
28.
V. Risojević and Z. Babić, “Fusion of global and local descriptors for remote sensing image classification,” IEEE Geosci. Remote Sens. Lett., vol. 10, no. 4, pp. 836–840, Jul. 2013.
29.
G. Cheng, J. Han, P. Zhou, and L. Guo, “Multi-class geospatial object detection and geographic image classification based on collection of part detectors,” ISPRS J. Photogramm. Remote Sens., vol. 98, pp. 119–132, Dec. 2014.
30.
G. Cheng, P. Zhou, J. Han, L. Guo, and J. Han, “Auto-encoder-based shared mid-level visual dictionary learning for scene classification using very high resolution remote sensing images,” IET Comput. Vis., vol. 9, no. 5, pp. 639–647, Oct. 2015.

Contact IEEE to Subscribe

References

References is not available for this document.