登录    注册    忘记密码

详细信息

Bi-GRU-Attention Enhanced Unsupervised Network for Skeleton-Based Action Recognition  ( EI收录)  

文献类型:会议论文

英文题名:Bi-GRU-Attention Enhanced Unsupervised Network for Skeleton-Based Action Recognition

作者:Chen, Li[1,2]; Ma, Nan[1,2]; Zhang, Guoping[1,2]

机构:[1] Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing, 100101, China; [2] College of Robotics, Beijing Union University, Beijing, 100101, China

第一机构:北京联合大学北京市信息服务工程重点实验室

会议论文集:Proceedings of 2021 International Conference on Autonomous Unmanned Systems, ICAUS 2021

会议日期:September 24, 2021 - September 26, 2021

会议地点:Changsha, China

语种:英文

外文关键词:Decoding - Human computer interaction - Large dataset - Machine learning - Musculoskeletal system - Nearest neighbor search - Signal encoding

摘要:Action recognition is widely used in human-computer interaction, intelligent monitoring and other applications. Recently, more and more researcher pay attention to it. While, skeleton-based action recognition is more and more popular because of its low cost and robustness. Current good performance methods based on skeleton is supervised mostly, which needs large labeled datasets for train. But for some special task, such as the pedestrian action recognition of unmanned driving in real traffic scene, it is difficult to gain the labeled action data because the large cost of time, money and energy. Therefore, we proposed a Bi-GRU-Attention Enhanced Unsupervised Network (BGAEUN) for action recognition based on skeleton sequence. BGAEUN adopts an encoder-decoder network, and adds an attention mechanism in the encode, learns the weights of the hidden state, and obtains the weights of different skeleton nodes, so that it can better characterize actions and provide better skeleton features. For the decoder, using fixed weight and fixed state strategies to weaken the decoder, calculate the loss of the output and input of the decoder, minimize the loss, make it similar to the input, and minimize the reconstruction loss in the training process. BGAEUN exploits the last layer feature of encoder to classify by k-nearest neighbors algorithm. Experiments on large-scale action dataset NTU-RGB + D show BGAEUN achieves a higher recognition accuracy than current most unsupervised skeleton action recognition methods. ? 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

参考文献:

正在载入数据...

版权所有©北京联合大学 重庆维普资讯有限公司 渝B2-20050021-8 
渝公网安备 50019002500408号 违法和不良信息举报中心