Abstract
Semantic segmentation is a class of methods to classify each pixel in an image into semantic classes, which is critical for autonomous vehicles and surgery systems. Crossentropy (CE) loss-based deep neural networks (DNN) achieved great success w.r.t. the accuracy-based metrics, e.g., mean Intersection-over Union. However, the CE loss has a limitation in that it ignores varying degrees of severity of pair-wise misclassified results. For instance, classifying
a car into the road is much more terrible than recognizing it as a bus. To sidestep this, in this work, we propose to incorporate the severity-aware inter-class correlation into our Wasserstein training framework by configuring its ground distance matrix. In addition, our method can adaptively learn the ground metric in a high-fidelity simulator, following a reinforcement alternative optimization scheme. We evaluate our method using the CARLA simulator with the Deeplab backbone, demonstraing that our method significantly improves the survival time in the CARLA simulator. In addition, our method can be readily applied to existing DNN architectures and algorithms while yielding superior performance. We report results from experiments carried out with the CamVid and Cityscapes datasets.
a car into the road is much more terrible than recognizing it as a bus. To sidestep this, in this work, we propose to incorporate the severity-aware inter-class correlation into our Wasserstein training framework by configuring its ground distance matrix. In addition, our method can adaptively learn the ground metric in a high-fidelity simulator, following a reinforcement alternative optimization scheme. We evaluate our method using the CARLA simulator with the Deeplab backbone, demonstraing that our method significantly improves the survival time in the CARLA simulator. In addition, our method can be readily applied to existing DNN architectures and algorithms while yielding superior performance. We report results from experiments carried out with the CamVid and Cityscapes datasets.
Original language | English |
---|---|
Title of host publication | Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR’2020), Virtual, 14-19 June, 2020 |
Publisher | IEEE Computer Society |
Pages | 12563-12572 |
Number of pages | 10 |
DOIs | |
Publication status | Published - Aug 2020 |
Event | IEEE Conference on Computer Vision and Pattern Recognition 2020 - Duration: 14 Jun 2020 → 19 Jun 2020 http://cvpr2020.thecvf.com/ |
Publication series
Name | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
---|---|
Publisher | IEEE Computer Society |
ISSN (Print) | 1063-6919 |
Conference
Conference | IEEE Conference on Computer Vision and Pattern Recognition 2020 |
---|---|
Abbreviated title | IEEE CVPR 2020 |
Period | 14/06/20 → 19/06/20 |
Internet address |
ASJC Scopus subject areas
- Software
- Computer Vision and Pattern Recognition