Negative frames matter in egocentric visual query 2d localization

Publication
arXiv preprint arXiv:2208.01949