Abstract:
Applying computer vision techniques to rail surface defect detection (RSDD) is crucial for preventing catastrophic accidents. However, challenges such as complex backgrou...Show MoreMetadata
Abstract:
Applying computer vision techniques to rail surface defect detection (RSDD) is crucial for preventing catastrophic accidents. However, challenges such as complex backgrounds and irregular defect shapes persist. Previous methods have focused on extracting salient object information from a pixel perspective, thereby neglecting valuable high- and low-frequency image information, which can better capture global structural information. In this study, we design a pixel-aware frequency conversion network (PFCNet) to explore RSDD from a frequency domain perspective. We use different attention mechanisms and frequency enhancement for high-level and shallow features to explore local details and global structures comprehensively. In addition, we design a dual-control reorganization module to refine the features across levels. We conducted extensive experiments on an industrial RGB-D dataset (NEU RSDDS-AUG), and PFCNet achieved superior performance.
Published in: IEEE Signal Processing Letters ( Volume: 32)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Surface Defect Detection ,
- Rail Surface Defect Detection ,
- Attention Mechanism ,
- Global Information ,
- Global Structure ,
- Local Details ,
- Shallow Features ,
- Salient Object ,
- Shape Defects ,
- Global Structural Information ,
- Transformer ,
- Convolutional Neural Network ,
- Decoding ,
- Deeper Layers ,
- Random Noise ,
- Intersection Over Union ,
- Semantic Information ,
- Convolution Operation ,
- Similarity Matrix ,
- Learnable Parameters ,
- Discrete Cosine Transform ,
- Global Representation ,
- Convolution Method ,
- Deep Features ,
- Frequency-domain Method ,
- Representation Of Information ,
- Node Features ,
- Shallow Layers ,
- Detailed Representation ,
- RGB Features
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Surface Defect Detection ,
- Rail Surface Defect Detection ,
- Attention Mechanism ,
- Global Information ,
- Global Structure ,
- Local Details ,
- Shallow Features ,
- Salient Object ,
- Shape Defects ,
- Global Structural Information ,
- Transformer ,
- Convolutional Neural Network ,
- Decoding ,
- Deeper Layers ,
- Random Noise ,
- Intersection Over Union ,
- Semantic Information ,
- Convolution Operation ,
- Similarity Matrix ,
- Learnable Parameters ,
- Discrete Cosine Transform ,
- Global Representation ,
- Convolution Method ,
- Deep Features ,
- Frequency-domain Method ,
- Representation Of Information ,
- Node Features ,
- Shallow Layers ,
- Detailed Representation ,
- RGB Features
- Author Keywords