Explain a fine-grained image classification result by searching image for class with INTR

--

Explain a fine-grained image classification result by searching image for class with INTR

A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis
arXiv paper abstract https://arxiv.org/abs/2311.04157
arXiv PDF paper https://arxiv.org/pdf/2311.04157.pdf
GitHub https://github.com/Imageomics/INTR

… present a novel usage of Transformers to make image classification interpretable.

Unlike mainstream classifiers that wait until the last fully-connected layer to incorporate class information to make predictions, … investigate a proactive approach, asking each class to search for itself in an image.

… realize this idea via a Transformer encoder-decoder inspired by DEtection TRansformer (DETR).

… learn ``class-specific’’ queries (one for each class) as input to the decoder, enabling each class to localize its patterns in an image via cross-attention.

… show that INTR intrinsically encourages each class to attend distinctively; the cross-attention weights thus provide a faithful interpretation of the prediction.

… INTR could identify different ``attributes’’ of a class, making it particularly suitable for fine-grained classification and analysis, which …

Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact
Web site with my other posts by category https://morrislee1234.wixsite.com/website

LinkedIn https://www.linkedin.com/in/morris-lee-47877b7b

Photo by Product School on Unsplash

--

--

AI News Clips by Morris Lee: News to help your R&D
AI News Clips by Morris Lee: News to help your R&D

Written by AI News Clips by Morris Lee: News to help your R&D

A computer vision consultant in artificial intelligence and related hitech technologies 37+ years. Am innovator with 66+ patents and ready to help a firm's R&D.

No responses yet