Weakly supervised segmentation using multi-class transformer with MCTformer+

--

Weakly supervised segmentation using multi-class transformer with MCTformer+

MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation
arXiv paper abstract https://arxiv.org/abs/2308.03005
arXiv PDF paper https://arxiv.org/pdf/2308.03005.pdf

This paper proposes a novel transformer-based framework that aims to enhance weakly supervised semantic segmentation (WSSS) by generating accurate class-specific object localization maps as pseudo labels.

… explore the potential of the transformer model to capture class-specific attention for class-discriminative object localization by learning multiple class tokens.

… introduce a Multi-Class Token transformer, which incorporates multiple class tokens to enable class-aware interactions with the patch tokens.

… Contrastive-Class-Token (CCT) module is proposed to enhance the learning of discriminative class tokens, enabling the model to better capture the unique characteristics and properties of each class.

As a result, class-discriminative object localization maps can be effectively generated by leveraging the class-to-patch attentions associated with different class tokens.

… resulting in significantly improved WSSS performance …

Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact
Web site with my other posts by category https://morrislee1234.wixsite.com/website

LinkedIn https://www.linkedin.com/in/morris-lee-47877b7b

Photo by Mae Mu on Unsplash

--

--

AI News Clips by Morris Lee: News to help your R&D
AI News Clips by Morris Lee: News to help your R&D

Written by AI News Clips by Morris Lee: News to help your R&D

A computer vision consultant in artificial intelligence and related hitech technologies 37+ years. Am innovator with 66+ patents and ready to help a firm's R&D.

No responses yet