I. Introduction
Object detection in aerial images aims at identifying the locations and categories of objects of interest (e.g., planes, ships, vehicles). With the framework of deep convolutional neural networks, object detection in aerial images (ODAI) has made significant progress in recent years [1]–[7], where most of existing methods are devoted to cope with the challenges raised by the large-scale variations and arbitrary orientations of crowded objects in aerial images.