Loading [MathJax]/extensions/MathMenu.js
Exploring the Effectiveness of Deep Learning in Audio Compression and Restoration | IEEE Conference Publication | IEEE Xplore

Exploring the Effectiveness of Deep Learning in Audio Compression and Restoration


Abstract:

Modern audio processing requires compression and restoration for streaming services and data storage and transmission. This study examines how deep learning can address t...Show More

Abstract:

Modern audio processing requires compression and restoration for streaming services and data storage and transmission. This study examines how deep learning can address these difficulties and includes an architecture flow diagram. The suggested complex design has numerous integral stages optimized for audio compression and restoration. The input layer contains raw audio data. The pre-processing module normalizes and extracts features from raw data, frequently using Mel-frequency cepstral coefficients (MFCC) to capture audio characteristics. This module improves data quality by reducing noise and filtering. The architecture's heart is the compression module, where deep learning is used. This module compresses audio data using neural networks, maybe CNNs or RNNs. This compression is quantified by compression ratio and bitrate, with hypothetical values representing audio quality trade-offs. The compression module compacts audio data for storage or transmission. To minimize data size, quantization and encoding are common here. To accurately decompress audio, the decompression module mimics the compression module's architecture. Depending on needs, error correction or quality enhancement layers may be added. Filtering, equalization, and dynamic range compression may be used in the post-processing module to improve audio quality. If needed, a restoration module uses deep learning models to minimize noise and artifacts. This study presents four hypothetical scenarios to demonstrate different results. The scenarios emphasize high compression, balanced approaches, quality prioritization, and low resource utilization. Deep learning approaches can adapt to varied audio processing needs by balancing compression ratio, bitrate, audio quality (measured by PSNR), and restoration accuracy in each case.
Date of Conference: 22-23 March 2024
Date Added to IEEE Xplore: 05 September 2024
ISBN Information:
Conference Location: Pune, India

Contact IEEE to Subscribe

References

References is not available for this document.