Loading [MathJax]/extensions/MathMenu.js
Data Augmentation for Environment Perception with Unmanned Aerial Vehicles | IEEE Journals & Magazine | IEEE Xplore
Scheduled Maintenance: On Tuesday, 8 April, IEEE Xplore will undergo scheduled maintenance from 1:00-5:00 PM ET (1800-2200 UTC). During this time, there may be intermittent impact on performance. We apologize for any inconvenience.

Data Augmentation for Environment Perception with Unmanned Aerial Vehicles


Abstract:

Large and high-quality training datasets are of critical importance for deep learning. In the context of the semantic segmentation challenge for UAV aerial images, we pro...Show More

Abstract:

Large and high-quality training datasets are of critical importance for deep learning. In the context of the semantic segmentation challenge for UAV aerial images, we propose a strategy for data augmentation that can significantly reduce the effort of manually annotating a large number of images. The result is a set of semantic, depth and RGB images that can be used to improve the performance of neural networks. The main focus of the method is the generation of semantic images, with depth and texture images also being generated through the process. The proposed method for semantic image generation relies on a 3D semantic mesh representation of the real-world environment. First, we propagate the existing semantic information from a reduced set of manually labeled images into the mesh representation. To deal with errors in the manually labeled images, we propose a specific weighted voting mechanism for the propagation process. Second, we use the semantic mesh to create new images. Both steps use the perspective projection mechanism and the Depth Buffer algorithm. The images can be generated using different camera orientations, allowing novel view perspectives. Our approach is conceptually general and can be used to improve various existing datasets. Experiments with existing datasets (UAVid and WildUAV), augmented with the proposed method, are performed on HRNet. An overall performance improvement of the inference results by up to 5.5% (mIoU) is obtained. The augmented datasets are publicly available on GitHub.
Published in: IEEE Transactions on Intelligent Vehicles ( Early Access )
Page(s): 1 - 15
Date of Publication: 06 March 2024

ISSN Information:


Contact IEEE to Subscribe