CHALLENGES AND OPPORTUNITIES FOR GENERATIVE METHODS IN THE CYBER DOMAIN | IEEE Conference Publication | IEEE Xplore

CHALLENGES AND OPPORTUNITIES FOR GENERATIVE METHODS IN THE CYBER DOMAIN


Abstract:

Large, high quality data sets are essential for training machine learning models to perform their tasks accurately. The lack of such training data has constrained machine...Show More

Abstract:

Large, high quality data sets are essential for training machine learning models to perform their tasks accurately. The lack of such training data has constrained machine learning research in the cyber domain. This work explores how Markov Chain Monte Carlo (MCMC) methods can be used for realistic synthetic data generation and compares it to several existing generative machine learning techniques. The performance of MCMC is compared to generative adversarial network (GAN) and variational autoencoder (VAE) methods to estimate the joint probability distribution of network intrusion detection system data. A statistical analysis of the synthetically generated cyber data determines the goodness of fit, aiming to improve cyber threat detection. The experimental results suggest that the data generated from MCMC fits the true distribution approximately as well as the data generated from GAN and VAE; however, the MCMC requires a significantly longer training period and is unproven for higher dimensional cyber data.
Date of Conference: 12-15 December 2021
Date Added to IEEE Xplore: 23 February 2022
ISBN Information:

ISSN Information:

Conference Location: Phoenix, AZ, USA

Contact IEEE to Subscribe

References

References is not available for this document.