Long-term Action Forecasting Using Multi-headed Attention-based Variational Recurrent Neural Networks

Page view(s)
41
Checked on Aug 10, 2025
Long-term Action Forecasting Using Multi-headed Attention-based Variational Recurrent Neural Networks
Title:
Long-term Action Forecasting Using Multi-headed Attention-based Variational Recurrent Neural Networks
Journal Title:
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Keywords:
Publication Date:
23 August 2022
Citation:
Loh, S. B., Roy, D., & Fernando, B. (2022). Long-term Action Forecasting Using Multi-headed Attention-based Variational Recurrent Neural Networks. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). https://doi.org/10.1109/cvprw56347.2022.00270
Abstract:
Systems developed for predicting both the action and the amount of time someone might take to perform that action need to be aware of the inherent uncertainty in what humans do. Here, we present a novel hybrid generative model for action anticipation that attempts to capture the uncertainty in human actions. Our model uses a multi-headed attention-based variational generative model for action prediction (MAVAP), and Gaussian log-likelihood maximization to predict the corresponding action's duration. During training, we optimise three losses: a variational loss, a negative log-likelihood loss, and a discriminative cross-entropy loss. We evaluate our model on benchmark datasets (i.e., Breakfast and 50Salads) for action forecasting tasks and demonstrate improvements over prior methods using both ground truth observations and predicted features from an action segmentation network (i.e., MS-TCN++). We also show that factorizing the latent space across multiple Gaussian heads predicts better plausible future action sequences compared to a single Gaussian.
License type:
Publisher Copyright
Funding Info:
This research / project is supported by the National Research Foundation - AI Singapore Program
Grant Reference no. : AISG-RP-2019-010
Description:
© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
ISBN:
NONE
Files uploaded:

File Size Format Action
cvpr-abaw-laha.pdf 538.12 KB PDF Open