Abstract: Human–Object Interaction Detection (HOID) has benefited greatly from advances in modern detection architectures and vision-language foundation models. In this paper, we present two ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results