Segment objects in videos using only bounding boxes by predict targets in training set with PM-VIS
Segment objects in videos using only bounding boxes by predict targets in training set with PM-VIS
PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
arXiv paper abstract https://arxiv.org/abs/2404.13863
arXiv PDF paper https://arxiv.org/pdf/2404.13863.pdf
… Box-supervised Video Instance Segmentation (VIS) methods have emerged as a viable solution to mitigate the labor-intensive annotation process.
… Inspired by … Segment Anything Model (SAM), … introduce a novel approach that aims at harnessing instance box annotations from multiple perspectives to generate high-quality instance pseudo masks
… leverage ground-truth boxes to create three types of pseudo masks using the HQ-SAM model, the box-supervised VIS model (IDOL-BoxInst), and the VOS model (DeAOT) separately, along with three corresponding optimization mechanisms.
… introduce two ground-truth data filtering methods, assisted by … pseudo masks, to … enhance the training dataset quality and improve the performance of fully supervised VIS
… To … capitalize on the … Pseudo Masks, … introduce a novel algorithm, PM-VIS, to integrate mask losses into IDOL-BoxInst.
… PM-VIS model … demonstrates strong ability in instance mask prediction, achieving state-of-the-art performance …
Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact
Web site with my other posts by category https://morrislee1234.wixsite.com/website