Loading [MathJax]/extensions/MathMenu.js
DMPNet: Distributed Multi-Scale Pyramid Network for Real-Time Semantic Segmentation | IEEE Journals & Magazine | IEEE Xplore

DMPNet: Distributed Multi-Scale Pyramid Network for Real-Time Semantic Segmentation


The overall architecture of the proposed DMPNet along with its building blocks, i.e., EMCA module and PPM based DMPP strategy. The three PPMs operate at three different s...

Abstract:

In semantic segmentation, an input image is partitioned into multiple meaningful segments each corresponding to a specific object or region. Multi-scale context plays a v...Show More

Abstract:

In semantic segmentation, an input image is partitioned into multiple meaningful segments each corresponding to a specific object or region. Multi-scale context plays a vital role in the accurate recognition of objects of different sizes and hence is key to overall accuracy enhancement. To achieve this goal, we introduce a novel strategy called Distributed Multi-scale Pyramid Pooling (DMPP) to extract multi-scale context at multiple levels of feature hierarchy. More specifically, we employ Pyramid Pooling Modules (PPM) in a distributed fashion after all three stages during the encoding phase. This enhances the feature representation capability of the network and leads to better performance. To extract context at a more granular level, we propose an Efficient Multi-scale Context Aggregation (EMCA) module which uses a combination of small and large kernels with large and small dilation rates, respectively. This alleviates the problem of sparse sampling and leads to consistent recognition of different regions. Apart from model accuracy, small model size and efficient execution are critically important for real-time mobile applications. To achieve it, we employ a resource-friendly combination of depthwise and factorized convolutions in the EMCA module to drastically reduce the number of parameters without significantly compromising the accuracy. Based on the EMCA module and DMPP, we propose a lightweight and real-time Distributed Multi-scale Pyramid Network (DMPNet) that achieves an excellent accuracy-efficiency trade-off. We also conducted extensive experiments on both driving datasets (i.e., Cityscapes and CamVid) and a general-purpose dataset (i.e., ADE20K) to show the effectiveness of the proposed method.
The overall architecture of the proposed DMPNet along with its building blocks, i.e., EMCA module and PPM based DMPP strategy. The three PPMs operate at three different s...
Published in: IEEE Access ( Volume: 12)
Page(s): 16573 - 16585
Date of Publication: 29 January 2024
Electronic ISSN: 2169-3536

Funding Agency:


References

References is not available for this document.