详细信息
Online Violent Behavior Recognition Based on Lightweight Temporal Shift Convolutional Neural Network ( EI收录)
文献类型:会议论文
英文题名:Online Violent Behavior Recognition Based on Lightweight Temporal Shift Convolutional Neural Network
作者:Zhang, Lujia[1]; Le, Haifeng[1]; Fang, Jianjun[2]; Liu, Yanxia[2]
第一作者:Zhang, Lujia
机构:[1] Beijing Union University, Beijing Key Laboratory of Information Service Engineering, Beijing, China; [2] College of Urban Rail Transit and Logistics, Beijing Union University, Beijing, China
第一机构:北京联合大学北京市信息服务工程重点实验室
通讯机构:[2]College of Urban Rail Transit and Logistics, Beijing Union University, Beijing, China|[1141751]北京联合大学城市轨道交通与物流学院;[11417]北京联合大学;
会议论文集:2023 7th International Conference on Electrical, Mechanical and Computer Engineering, ICEMCE 2023
会议日期:October 20, 2023 - October 22, 2023
会议地点:Hybrid, Xi'an, China
语种:英文
外文关键词:Behavioral research - Convolutional neural networks - Deep learning - Edge computing - Image enhancement - Inference engines - Security systems - Semantics
摘要:As a challenging task in the intelligent security monitoring system, violent behavior recognition plays a vital role in ensuring public safety. However, due to the complex semantic information about violent behaviors, there is still a lack of fast and reliable detection methods. To improve the accuracy and inference speed of violent behavior recognition, a lightweight convolutional neural network recognition algorithm named M-TSM based on the combination of temporal shift and spatial attention is proposed to solve the problems of insufficient motion information and slow inference speed in surveillance video images. First, random and equally spaced frames are drawn from the surveillance video to obtain the sequence of original video frames and normalized, then the depthwise separable convolution residual module is used to extract image features; the time shift module is used to fuse multiple frames of images to obtain video features containing multi-level time series information; finally, the spatial attention module focuses on the local features of the human body, so that the model can better distinguish between regular and violent videos, and output prediction results. The improved model is translated and compiled and then transplanted to the embedded device Jetson NX for deployment, which can realize edge computing, and the inference speed can meet the real-time requirements. The experimental results show that the recognition accuracy of the improved M-TSM algorithm in the violent behavior dataset is 84.1%, which is 1.4% higher than the original TSM algorithm, and the inference time in edge computing equipment is reduced by 25%, it has better real-time detection performance. ? 2023 IEEE.
参考文献:
正在载入数据...