3D semantic segmentation using RGB and depth with PDCNet

--

3D semantic segmentation using RGB and depth with PDCNet

Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
arXiv paper abstract https://arxiv.org/abs/2302.11951
arXiv PDF paper https://arxiv.org/pdf/2302.11951.pdf

RGB-D semantic segmentation can be advanced with convolutional neural networks due to the availability of Depth data.

Although objects cannot be easily discriminated by just the 2D appearance, with the local pixel difference and geometric patterns in Depth, they can be well separated in some cases.

… propose a Pixel Difference Convolutional Network (PDCNet) to capture detailed intrinsic patterns by aggregating both intensity and gradient information in the local range for Depth data and global range for RGB data, respectively.

… For … Depth … propose a Pixel Difference Convolution (PDC) to consider local and detailed geometric information in Depth data via aggregating both intensity and gradient information.

For … RGB … contribute a lightweight Cascade Large Kernel (CLK) to extend PDC, namely CPDC, to enjoy global contexts for RGB data and further boost performance.

… PDCNet achieves state-of-the-art performance for the semantic segmentation task.

Stay up to date. Subscribe to my posts https://morrislee1234.wixsite.com/website/contact
Web site with my other posts by category https://morrislee1234.wixsite.com/website

LinkedIn https://www.linkedin.com/in/morris-lee-47877b7b

Photo by Annie Spratt on Unsplash

--

--

AI News Clips by Morris Lee: News to help your R&D
AI News Clips by Morris Lee: News to help your R&D

Written by AI News Clips by Morris Lee: News to help your R&D

A computer vision consultant in artificial intelligence and related hitech technologies 37+ years. Am innovator with 66+ patents and ready to help a firm's R&D.

No responses yet