Journals & Magazines >IEEE Transactions on Pattern ... >Volume: 47 Issue: 1

Scene-Dependent Prediction in Latent Space for Video Anomaly Detection and Anticipation

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Video anomaly detection (VAD) plays a crucial role in intelligent surveillance. However, an essential type of anomaly named scene-dependent anomaly is overlooked. Moreove...Show More

Metadata

Abstract:

Video anomaly detection (VAD) plays a crucial role in intelligent surveillance. However, an essential type of anomaly named scene-dependent anomaly is overlooked. Moreover, the task of video anomaly anticipation (VAA) also deserves attention. To fill these gaps, we build a comprehensive dataset named NWPU Campus, which is the largest semi-supervised VAD dataset and the first dataset for scene-dependent VAD and VAA. Meanwhile, we introduce a novel forward-backward framework for scene-dependent VAD and VAA, in which the forward network individually solves the VAD and jointly solves the VAA with the backward network. Particularly, we propose a scene-dependent generative model in latent space for the forward and backward networks. First, we propose a hierarchical variational auto-encoder to extract scene-generic features. Next, we design a score-based diffusion model in latent space to refine these features more compact for the task and generate scene-dependent features with a scene information auto-encoder, modeling the relationships between video events and scenes. Finally, we develop a temporal loss from key frames to constrain the motion consistency of video clips. Extensive experiments demonstrate that our method can handle both scene-dependent anomaly detection and anticipation well, achieving state-of-the-art performance on ShanghaiTech, CUHK Avenue, and the proposed NWPU Campus datasets.

Published in: IEEE Transactions on Pattern Analysis and Machine Intelligence ( Volume: 47, Issue: 1, January 2025)

Page(s): 224 - 239

Date of Publication: 16 September 2024

ISSN Information:

PubMed ID: 39283792

DOI: 10.1109/TPAMI.2024.3461718

Funding Agency:

Contents

I. Introduction

Video anomaly detection (VAD) is a critical task in video surveillance. Due to the unbounded and rare nature of anomalies, VAD is typically set as a semi-supervised task, where only normal events without specific labels are available in training data [1], [2]. Semi-supervised VAD has been studied for years, the long-standing goal of solving which is to train a one-class classifier that faithfully learns normal data distribution while avoiding undesired generalization on anomalies. To this end, in recent years, reconstruction-based [3], [4], [5] and prediction-based [6], [7], [8] deep learning methods spring up and make great strides.

References is not available for this document.

Scene-Dependent Prediction in Latent Space for Video Anomaly Detection and Anticipation

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Scene-Dependent Prediction in Latent Space for Video Anomaly Detection and Anticipation

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?