Intelligent monitoring of indoor surveillance video based on deep learning

Submitted by aekwall on Mon, 08/12/2019 - 9:46am

Title	Intelligent monitoring of indoor surveillance video based on deep learning
Publication Type	Conference Paper
Year of Publication	2019
Authors	Liu, Y., Yang, Y., Shi, A., Jigang, P., Haowei, L.
Conference Name	2019 21st International Conference on Advanced Communication Technology (ICACT)
Keywords	Cameras, convolutional neural nets, Deep Learning, deep learning methods, deep video, fine-tuning network, high-quality segmentation mask, image retrieval, image segmentation, indoor surveillance video, information technology, Instance Segmentation, intelligent monitoring, intelligent video analytics technology, learning (artificial intelligence), Mask R-CNN, Metrics, object detection, pose estimation, protection system, pubcrawl, recurrent neural nets, Resiliency, Scalability, Semantics, smart monitoring system, storage management, Streaming media, surveillance, surveillance cameras, Surveillance video, Training, video image, video signal processing, video surveillance, video surveillance system
Abstract	With the rapid development of information technology, video surveillance system has become a key part in the security and protection system of modern cities. Especially in prisons, surveillance cameras could be found almost everywhere. However, with the continuous expansion of the surveillance network, surveillance cameras not only bring convenience, but also produce a massive amount of monitoring data, which poses huge challenges to storage, analytics and retrieval. The smart monitoring system equipped with intelligent video analytics technology can monitor as well as pre-alarm abnormal events or behaviours, which is a hot research direction in the field of surveillance. This paper combines deep learning methods, using the state-of-the-art framework for instance segmentation, called Mask R-CNN, to train the fine-tuning network on our datasets, which can efficiently detect objects in a video image while simultaneously generating a high-quality segmentation mask for each instance. The experiment show that our network is simple to train and easy to generalize to other datasets, and the mask average precision is nearly up to 98.5% on our own datasets.
URL	https://ieeexplore.ieee.org/document/8701964
DOI	10.23919/ICACT.2019.8701964
Citation Key	liu_intelligent_2019

Groups:

Science of Security VO