A Spatial-Temporal based Next Frame Prediction and Unsupervised Classification of Video Anomalies in Real Time Estimation
Swapna Kumari Sahu1, M. Jayanthi Rao2
1Swapna Kumari Sahu*, PG Research Scholars, Department of Computer Science and Engineering Sri Sivani College of Chilakapalem, (Andhra Pradesh) India.
2Dr. M. Jayanthi Rao, M. Tech, Ph. D Associate Professor, Department of Computer Science and Engineering, Sri Sivani College of Chilakapalem, (Andhra Pradesh) India.
Manuscript received on September 22, 2021. | Revised Manuscript received on September 29, 2021. | Manuscript published on October 30, 2021. | PP: 120-124 | Volume-11 Issue-1, October 2021. | Retrieval Number: 100.1/ijeat.A31611011121 | DOI: 10.35940/ijeat.A3161.1011121
Open Access | Ethics and  Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Anomaly detection is an area of video analysis has a great importance in automated surveillance. Although it has been extensively studied, there has been little work started using CNN networks. Hence, in this thesis we presented a novel approach for learning motion features and modeling normal Spatio-temporal dynamics for anomaly detection. In our technique, we capture variations in scale of the patterns of motion in an image object by using optical flow dense estimation technique and train our auto encoder model using convolution long short term memories (ConvLSTM2D) as we are processing video frames and we predict the anomaly in real time using Euclidean distance between the generated and the ground truth frame and we achieved a real time accuracy of nearly 98% for the youtube videos which are not used for either testing or training. Error between the network’s output and the target output is used to classify a video volume as normal or abnormal. In addition to the use of reconstruction error, we also use prediction error for anomaly detection. The prediction models show comparable performance with state of the art methods. In comparison with the proposed method, performance is improved in one dataset. Moreover, running time is significantly faster.
Keywords: Spatio-Temporal video features, ConvLSTM2D