Segment objects in videos using only bounding boxes by predict targets in training set with PM-VIS

Segment objects in videos using only bounding boxes by predict targets in training set with PM-VIS

PM-VIS: High-Performance Box-Supervised Video Instance Segmentation
arXiv paper abstract https://arxiv.org/abs/2404.13863
arXiv PDF paper https://arxiv.org/pdf/2404.13863.pdf

… Box-supervised Video Instance Segmentation (VIS) methods have emerged as a viable solution to mitigate the labor-intensive annotation process.

… Inspired by … Segment Anything Model (SAM), … introduce a novel approach that aims at harnessing instance box annotations from multiple perspectives to generate high-quality instance pseudo masks

… leverage ground-truth boxes to create three types of pseudo masks using the HQ-SAM model, the box-supervised VIS model (IDOL-BoxInst), and the VOS model (DeAOT) separately, along with three corresponding optimization mechanisms.

… introduce two ground-truth data filtering methods, assisted by … pseudo masks, to … enhance the training dataset quality and improve the performance of fully supervised VIS

… To … capitalize on the … Pseudo Masks, … introduce a novel algorithm, PM-VIS, to integrate mask losses into IDOL-BoxInst.

… PM-VIS model … demonstrates strong ability in instance mask prediction, achieving state-of-the-art performance …

Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact
Web site with my other posts by category https://morrislee1234.wixsite.com/website

LinkedIn https://www.linkedin.com/in/morris-lee-47877b7b

Photo by Laurenz Notter on Unsplash

--

--

AI News Clips by Morris Lee: News to help your R&D

A computer vision consultant in artificial intelligence and related hitech technologies 37+ years. Am innovator with 66+ patents and ready to help a firm's R&D.