I. Introduction
With the development of deep learning techniques, the performance of object detection has rapidly improved in a short period of time. In contrast to small, medium, and large objects defined in the MS COCO benchmark [1], the AI-TOD benchmark [2] defines tiny objects as those with dimensions smaller than pixels. However, compared with the detection of larger objects, small objects themselves have fewer pixels in their extremely small size, weaker feature rendering ability, and their appearance information is extremely limited, resulting in less spatial information, and tiny objects are easily confused with the background, increasing background interference [3], [4]. Due to the limitation of spatial resolution, it is difficult for conventional object detectors to effectively capture the features required for accurate classification and localization of tiny objects, resulting in a large number of failures in detecting tiny objects [2], [5], [6], [7].