Method Details
Details for method 'Pixel-level Encoding for Instance Segmentation'
Method overview
name | Pixel-level Encoding for Instance Segmentation |
challenge | pixel-level semantic labeling |
details | We predict three encoding channels from a single image using an FCN: semantic labels, depth classes, and an instance-aware representation based on directions towards instance centers. Using low-level computer vision techniques, we obtain pixel-level and instance-level semantic labeling paired with a depth estimate of the instances. |
publication | Pixel-level Encoding and Depth Layering for Instance-level Semantic Labeling J. Uhrig, M. Cordts, U. Franke, and T. Brox GCPR 2016 http://arxiv.org/abs/1604.05096 |
project page / code | |
used Cityscapes data | fine annotations, stereo |
used external data | ImageNet |
runtime | n/a |
subsampling | no |
submission date | April, 2016 |
previous submissions |
Average results
Metric | Value |
---|---|
IoU Classes | 64.3053 |
iIoU Classes | 41.5847 |
IoU Categories | 85.8862 |
iIoU Categories | 73.8804 |
Class results
Class | IoU | iIoU |
---|---|---|
road | 97.3514 | - |
sidewalk | 77.7158 | - |
building | 88.764 | - |
wall | 27.7383 | - |
fence | 40.1327 | - |
pole | 51.4714 | - |
traffic light | 60.0827 | - |
traffic sign | 64.6672 | - |
vegetation | 91.1441 | - |
terrain | 67.6334 | - |
sky | 93.5146 | - |
person | 77.737 | 60.5757 |
rider | 54.1634 | 33.4172 |
car | 92.4148 | 86.7287 |
truck | 33.681 | 19.518 |
bus | 41.9876 | 25.6202 |
train | 42.5086 | 25.7525 |
motorcycle | 52.5489 | 30.5481 |
bicycle | 66.5438 | 50.5171 |
Category results
Category | IoU | iIoU |
---|---|---|
flat | 98.1834 | - |
nature | 90.7538 | - |
object | 59.3101 | - |
sky | 93.5146 | - |
construction | 89.1592 | - |
human | 79.1525 | 62.5841 |
vehicle | 91.1298 | 85.1767 |