||Improved DeepLab v3+ with Metadata Extraction for Small Object Detection in Intelligent Visual Surveillance Systems
||(Heungmin Oh) ; (Minjung Lee) ; (Hyungtae Kim) ; (Joonki Paik*)
|| Metadata; Object segmentation; Surveillance system
||A surveillance system deploys multiple cameras to monitor a wide area in real time to detect abnormal situations such as a crime scene, traffic accident, and natural disaster. An Increased number of cameras results in the same number of monitors, which makes human decisions or automatic decisions difficult. To solve the problem, a smart surveillance scheme has recently been proposed. The smart surveillance system automatically detects an object and provides an alarm to a surveillant. In this paper, we present a metadata extraction method for object-based video summary. The proposed method adopts deep learning-based object detection and background elimination to correctly estimate an object region. Finally, metadata extraction is performed on the estimated object information. The proposed metadata consists of the representative color, size, aspect ratio, and patch of an object. The proposed method can extract reliable metadata without motion features in both static and dynamic cameras. The proposed method can be applied to various object detection areas using complex metadata.