Method Details


Details for method 'Pixel-level Encoding for Instance Segmentation'

 

Method overview

name Pixel-level Encoding for Instance Segmentation
challenge pixel-level semantic labeling
details We predict three encoding channels from a single image using an FCN: semantic labels, depth classes, and an instance-aware representation based on directions towards instance centers. Using low-level computer vision techniques, we obtain pixel-level and instance-level semantic labeling paired with a depth estimate of the instances.
publication Pixel-level Encoding and Depth Layering for Instance-level Semantic Labeling
J. Uhrig, M. Cordts, U. Franke, and T. Brox
GCPR 2016
http://arxiv.org/abs/1604.05096
project page / code
used Cityscapes data fine annotations, stereo
used external data ImageNet
runtime n/a
subsampling no
submission date April, 2016
previous submissions

 

Average results

Metric Value
IoU Classes 64.3053
iIoU Classes 41.5847
IoU Categories 85.8862
iIoU Categories 73.8804

 

Class results

Class IoU iIoU
road 97.3514 -
sidewalk 77.7158 -
building 88.764 -
wall 27.7383 -
fence 40.1327 -
pole 51.4714 -
traffic light 60.0827 -
traffic sign 64.6672 -
vegetation 91.1441 -
terrain 67.6334 -
sky 93.5146 -
person 77.737 60.5757
rider 54.1634 33.4172
car 92.4148 86.7287
truck 33.681 19.518
bus 41.9876 25.6202
train 42.5086 25.7525
motorcycle 52.5489 30.5481
bicycle 66.5438 50.5171

 

Category results

Category IoU iIoU
flat 98.1834 -
nature 90.7538 -
object 59.3101 -
sky 93.5146 -
construction 89.1592 -
human 79.1525 62.5841
vehicle 91.1298 85.1767

 

Links

Download results as .csv file

Benchmark page