Conferences >NAECON 2023 - IEEE National A...

Object Detection Using Vision Transformed EfficientDet

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Computer vision, a subdivision of computer science and artificial intelligence focuses on enabling computers to interpret and analyze visual data from the world, such as ...Show More

Metadata

Abstract:

Computer vision, a subdivision of computer science and artificial intelligence focuses on enabling computers to interpret and analyze visual data from the world, such as images and videos. Recent advances in convolutional neural networks (CNNs), have improved the performance of computer vision systems remarkably, making them more accurate and efficient than ever before. Object detection using CNNs is a popular application of deep learning in computer vision. There are several popular frameworks for object detection that are widely used in the industry with many using the primary concept of convolution e.g., RetinaNet, EfficientNet, and EfficientDet etc. In this paper, we propose a novel hybrid approach for object detection by combining the power of Vision Transformers (ViT) with state-of-the-art EfficientDet architecture, resulting in a powerful object detection framework. The ViT backbone, known for its success in image classification and natural language processing (NLP) tasks, captures global dependencies in the input image using self-attention mechanisms. By incorporating ViT into the EfficientDet architecture, we enhance its ability to capture fine-grained details and context information, leading to improved object detection accuracy which leverages the strengths of both among other improvements to achieve highly accurate and efficient performance. The training of the model was done using PASCAL VOC 2007 and 2012 datasets and testing was executed on PASCAL VOC 2007 to achieve a mAP of 86.27%.

Published in: NAECON 2023 - IEEE National Aerospace and Electronics Conference

Date of Conference: 28-31 August 2023

Date Added to IEEE Xplore: 26 December 2023

ISBN Information:

ISSN Information:

DOI: 10.1109/NAECON58068.2023.10365957

Conference Location: Dayton, OH, USA

Contents

References is not available for this document.

Object Detection Using Vision Transformed EfficientDet

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Object Detection Using Vision Transformed EfficientDet

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?