Abstract:
We investigate the extent to which statistical predictive models leak information about their training data. More specifically, based on the use case of household (electr...Show MoreMetadata
Abstract:
We investigate the extent to which statistical predictive models leak information about their training data. More specifically, based on the use case of household (electrical) energy consumption, we evaluate whether white-box access to auto-regressive (AR) models trained on such data together with background information, such as household energy data aggregates (e.g., monthly billing information) and publicly-available weather data, can lead to inferring fine-grained energy data of any particular household. We construct two adversarial models aiming to infer fine-grained energy consumption patterns. Both threat models use monthly billing information of target households. The second adversary has access to the AR model for a cluster of households containing the target household. Using two real-world energy datasets, we demonstrate that this adversary can apply maximum a posteriori estimation to reconstruct daily consumption of target households with significantly lower error than the first adversary, which serves as a baseline. Such fine-grained data can essentially expose private information, such as occupancy levels. Finally, we use differential privacy (DP) to alleviate the privacy concerns of the adversary in dis-aggregating energy data. Our evaluations show that differentially private model parameters offer strong privacy protection against the adversary with moderate utility, captured in terms of model fitness to the cluster.
Published in: IEEE Transactions on Services Computing ( Volume: 15, Issue: 6, 01 Nov.-Dec. 2022)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Autoregressive Model ,
- Prediction Model ,
- Model Parameters ,
- Model Fit ,
- Energy Consumption ,
- Statistical Models ,
- Aggregate Data ,
- Background Information ,
- Daily Consumption ,
- Real-world Datasets ,
- Household Consumption ,
- Privacy Protection ,
- Occupational Level ,
- Posterior Mode ,
- Maximum A Posteriori ,
- Threat Model ,
- Differential Privacy ,
- Household Clusters ,
- Adversary Model ,
- Fine-grained Data ,
- Daily Energy Expenditure ,
- Machine Learning Models ,
- Aggregate Consumption ,
- Exogenous Input ,
- Over The Horizon ,
- Time Series ,
- Smart Meters ,
- Clusters In Dataset ,
- Average Time Series ,
- Simple Statistical Model
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Autoregressive Model ,
- Prediction Model ,
- Model Parameters ,
- Model Fit ,
- Energy Consumption ,
- Statistical Models ,
- Aggregate Data ,
- Background Information ,
- Daily Consumption ,
- Real-world Datasets ,
- Household Consumption ,
- Privacy Protection ,
- Occupational Level ,
- Posterior Mode ,
- Maximum A Posteriori ,
- Threat Model ,
- Differential Privacy ,
- Household Clusters ,
- Adversary Model ,
- Fine-grained Data ,
- Daily Energy Expenditure ,
- Machine Learning Models ,
- Aggregate Consumption ,
- Exogenous Input ,
- Over The Horizon ,
- Time Series ,
- Smart Meters ,
- Clusters In Dataset ,
- Average Time Series ,
- Simple Statistical Model
- Author Keywords