approved
A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives

Human comprehension of a video stream is naturally broad: in a few instants, we are able to understand what is happening, the relevance and relationship of objects, and forecast what will follow in the near future, everything all at once. We believe that - to effectively transfer such an holistic perception to intelligent machines - an important role is played by learning to correlate concepts and to abstract knowledge coming from different tasks, to synergistically exploit them when learning novel skills. To accomplish this, we seek for a unified approach to video understanding which combines shared temporal modelling of human actions with minimal overhead, to support multiple downstream tasks and enable cooperation when learning novel skills. We then propose EgoPack, a solution that creates a collection of task perspectives that can be carried across downstream tasks and used as a potential source of additional insights, as a backpack of skills that a robot can carry around and use when needed. We demonstrate the effectiveness and efficiency of our approach on four Ego4d benchmarks, outperforming current state-of-the-art methods.

Tags
Data and Resources
To access the resources you must log in
  • EgoPack

    The resource: 'EgoPack' is not accessible as guest user. You must login to access it!
Additional Info
Field Value
Accessibility Both
AccessibilityMode Download
Associate Project FAIR
Availability On-Line
Basic rights Download
CreationDate 2024-07-11 18:05
Creator pistilli, francesca, [email protected], orcid.org/0000-0001-9372-032X
Field/Scope of use Research only
Group Others
License term 2024-07-11 18:05/2034-07-11 18:05
Owner pistilli, francesca, [email protected], orcid.org/0000-0001-9372-032X
SoBigData Node SoBigData EU
Sublicense rights No
Territory of use World Wide
Thematic Cluster Other
system:type Method
Management Info
Field Value
Author PISTILLI FRANCESCA
Maintainer PISTILLI FRANCESCA
Version 1
Last Updated 23 November 2024, 16:00 (CET)
Created 23 November 2024, 16:00 (CET)