MobileViT: an accurate, light-weight, mobile-friendly vision transformer

MobileViT: an accurate, light-weight, mobile-friendly vision transformer

MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer
arXiv paper abstract https://arxiv.org/abs/2110.02178
arXiv PDF paper https://arxiv.org/pdf/2110.02178.pdf

Light-weight convolutional neural networks (CNNs) are the de-facto for mobile vision tasks. … However, these networks are spatially local.

To learn global representations, self-attention-based vision transformers (ViTs) have been adopted. Unlike CNNs, ViTs are heavy-weight.

… introduce MobileViT, a light-weight and general-purpose vision transformer for mobile devices. … presents … transformers as convolutions.

… MobileViT significantly outperforms CNN- and ViT-based networks across different tasks and datasets.

On the ImageNet-1k dataset, MobileViT achieves top-1 accuracy of 78.4% with about 6 million parameters, which is 3.2% and 6.2% more accurate than MobileNetv3 (CNN-based) and DeIT (ViT-based) for a similar number of parameters.

On the MS-COCO object detection task, MobileViT is 5.7% more accurate than Mo-bileNetv3 for a similar number of parameters.

Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact
Web site with my other posts by category https://morrislee1234.wixsite.com/website

LinkedIn https://www.linkedin.com/in/morris-lee-47877b7b

Photo by David Švihovec on Unsplash

--

--

A computer vision consultant in artificial intelligence and related hitech technologies 37+ years. Am innovator with 66+ patents and ready to help a firm's R&D.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
AI News Clips by Morris Lee: News to help your R&D

A computer vision consultant in artificial intelligence and related hitech technologies 37+ years. Am innovator with 66+ patents and ready to help a firm's R&D.