Train new object detector without bounding box annotations using captioned images

Train new object detector without bounding box annotations using captioned images

Towards Open Vocabulary Object Detection without Human-provided Bounding Boxes
arXiv paper abstract https://arxiv.org/abs/2111.09452
arXiv PDF paper https://arxiv.org/pdf/2111.09452.pdf

… in object detection, most existing methods are limited to a small set of object categories, due to the tremendous human effort needed for…