Understanding the Nature of First-Person Videos: Characterization and Classification using Low-Level Features

Understanding the Nature of First-Person Videos: Characterization and Classification using Low-Level Features
Title:
Understanding the Nature of First-Person Videos: Characterization and Classification using Low-Level Features
Other Titles:
DOI:
Keywords:
Publication Date:
07 May 2021
Citation:
Abstract:
First-person view (FPV) video data is set to proliferate rapidly, due to many consumer wearable-camera devices coming onto the market. Research into FPV (or "egocentric") vision is also becoming more common in the computer vision community. However, it is still unclear what the fundamental characteristics of such data are. How is it really different from third-person view (TPV) data? Can all FPV data be treated the same? In this first attempt to approach these questions in a quantitative and empirical manner, we analyzed a meta-collection of 21 FPV and TPV datasets totaling more than 165 hours of video. We performed the first quantitative characterization of FPV videos over multiple datasets, encompassing virtually all available FPV datasets. Validating this characterization, linear classifiers trained on low-level features to perform FPV-versus-TPV classification achieved good baseline performance. Accuracy peaked at 81% for 2-minute clips, but 67% accuracy was achieved even with 1-second clips. Our low-level features are fast to compute and do not require annotation. Overall, our work uncovered insights regarding the basic nature and characteristics of FPV data.
License type:
PublisherCopyrights
Funding Info:
A*STAR JCO VIP grant
Description:
ISBN:

Files uploaded:

File Size Format Action
workshop-cam-ready-verified.pdf 459.32 KB PDF Open