CaRe-Ego: Contact-aware relationship modeling for egocentric interactive hand-object segmentation

Research output: Journal article publicationJournal articleAcademic researchpeer-review

1 Citation (Scopus)

Abstract

Egocentric Interactive hand-object segmentation (EgoIHOS) requires segmenting hands and interacting objects in egocentric images, which is crucial for understanding human behaviors in assistive systems. Current methods often overlook the essential interactive relationships between hands and objects, or merely establish coarse hand-object associations to recognize targets, leading to suboptimal accuracy. To address this issue, we propose a novel CaRe-Ego method that achieves state-of-the-art performance by emphasizing contact between hands and objects from two aspects. First, to explicitly model hand-object interactive relationships, we introduce a Hand-guided Object Feature Enhancer (HOFE), which utilizes hand features as prior knowledge to extract more contact-relevant and distinguishing object features. Second, to promote the network concentrating on hand-object interactions, we design a Contact-Centric Object Decoupling Strategy (CODS) to reduce interference during training by disentangling the overlapping attributes of the segmentation targets, allowing the model to capture specific contact-aware features associated with each hand. Experiments on various in-domain and out-of-domain test sets show that Care-Ego significantly outperforms existing methods while exhibiting robust generalization capability. Codes are publicly available at https://github.com/yuggiehk/CaRe-Ego/.

Original languageEnglish
Article number129148
JournalExpert Systems with Applications
Volume296
DOIs
Publication statusPublished - Jul 2025

Keywords

  • Attention mechanism
  • Egocentric
  • Hand-object interaction
  • Relationship modeling

ASJC Scopus subject areas

  • General Engineering
  • Computer Science Applications
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'CaRe-Ego: Contact-aware relationship modeling for egocentric interactive hand-object segmentation'. Together they form a unique fingerprint.

Cite this