Joint Learning on the Hierarchy Representation for Fine-Grained Human Action Recognition

Page view(s)

Checked on Aug 10, 2025

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/17921

Title:

Joint Learning on the Hierarchy Representation for Fine-Grained Human Action Recognition

Journal Title:

2021 IEEE International Conference on Image Processing (ICIP)

DOI:

10.1109/ICIP42928.2021.9506157

Publication URL:

http://dx.doi.org/10.1109/icip42928.2021.9506157

Authors:

Mei Chee Leong, Hui Li Tan, Haosong Zhang, Liyuan Li, Feng Lin, Joo Hwee Lim

Keywords:

action recognition, Multi-task learning, fine-grained action recognition, joint representation

Publication Date:

23 August 2021

Citation:

Leong, M. C., Tan, H. L., Zhang, H., Li, L., Lin, F., Lim, J. H. (2021). Joint Learning on the Hierarchy Representation for Fine-Grained Human Action Recognition. 2021 IEEE International Conference on Image Processing (ICIP). doi:10.1109/icip42928.2021.9506157

Abstract:

Fine-grained human action recognition is a core research topic in computer vision. Inspired by the recently proposed hierarchy representation of fine-grained actions in FineGym and SlowFast network for action recognition, we propose a novel multi-task network which exploits the FineGym hierarchy representation to achieve effective joint learning and prediction for fine-grained human action recognition. The multi-task network consists of three pathways of SlowOnly networks with gradually increased frame rates for events, sets and elements of fine-grained actions, followed by our proposed integration layers for joint learning and prediction. It is a two-stage approach, where it first learns deep feature representation at each hierarchical level, and is followed by feature encoding and fusion for multi-task learning. Our empirical results on the FineGym dataset achieve a new state-of-the-art performance, with 91.80% Top-1 accuracy and 88.46% mean accuracy for element actions, which are 3.40% and 7.26% higher than the previous best results.

License type:

Publisher Copyright

Funding Info:

This research / project is supported by the NA - AME Programmatic Funding Scheme
Grant Reference no. : A18A2b0046

Description:

© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

URI:

https://oar.a-star.edu.sg/communities-collections/articles/17921

ISSN:

2381-8549

Collections:

Institute for Infocomm Research

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
joint-learning-on-the-hierarchy-representation-for-fine-grained-human-action-recognition.pdf	382.87 KB	PDF	Open