Abstract: Single object tracking aims to localize target object with specific reference modalities (bounding box, natural language or both) in a sequence of specific video modalities (RGB, RGB+Depth, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results