As the intermediate-level representations bridging the two levels, struc...
Given input images, scene graph generation (SGG) aims to produce
compreh...
Detecting human-object interactions (HOI) is an important step toward a
...
Visual relationship detection aims to reason over relationships among sa...
Indoor localization is a fundamental problem in location-based applicati...