Abstract:
AudioSet is one of the most used and largest datasets in audio tagging, containing about 2 million audio samples that are manually labeled with 527 event categories organ...Show MoreMetadata
Abstract:
AudioSet is one of the most used and largest datasets in audio tagging, containing about 2 million audio samples that are manually labeled with 527 event categories organized into an ontology. However, the annotations contain inconsistencies, particularly where categories that should be labeled as positive according to the ontology are frequently mislabeled as negative. To address this issue, we apply Hierarchical Label Propagation (HLP), which propagates labels up the ontology hierarchy, resulting in a mean increase in positive labels per audio clip from 1.98 to 2.39 and affecting 109 out of the 527 classes. Our results demonstrate that HLP provides performance benefits across various model architectures, including convolutional neural networks (PANN’s CNN6 and ConvNeXT) and transformers (PaSST), with smaller models showing more improvements. Finally, on FSD50K, another widely used dataset, models trained on AudioSet with HLP consistently outperformed those trained without HLP. Our source code will be made available on GitHub.
Published in: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Date of Conference: 06-11 April 2025
Date Added to IEEE Xplore: 07 March 2025
ISBN Information:
ISSN Information:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Label Propagation ,
- Hierarchical Label ,
- Convolutional Neural Network ,
- Model Architecture ,
- Small Model ,
- Positive Labels ,
- Audio Clips ,
- Model Performance ,
- Percentage Points ,
- Model Size ,
- Large Model ,
- Root Node ,
- Single-parent ,
- Original Authors ,
- Graph Convolutional Network ,
- Multi-task Learning ,
- Mean Average Precision ,
- Number Of Labels ,
- Version Of Dataset ,
- Evaluation Scenarios ,
- Ontology Structure ,
- Label Noise ,
- Label Space ,
- Noisy Labels
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Label Propagation ,
- Hierarchical Label ,
- Convolutional Neural Network ,
- Model Architecture ,
- Small Model ,
- Positive Labels ,
- Audio Clips ,
- Model Performance ,
- Percentage Points ,
- Model Size ,
- Large Model ,
- Root Node ,
- Single-parent ,
- Original Authors ,
- Graph Convolutional Network ,
- Multi-task Learning ,
- Mean Average Precision ,
- Number Of Labels ,
- Version Of Dataset ,
- Evaluation Scenarios ,
- Ontology Structure ,
- Label Noise ,
- Label Space ,
- Noisy Labels
- Author Keywords