Method Details


Details for method 'LRR-4x'

 

Method overview

name LRR-4x
challenge pixel-level semantic labeling
details We introduce a CNN architecture that reconstructs high-resolution class label predictions from low-resolution feature maps using class-specific basis functions. Our multi-resolution architecture also uses skip connections from higher resolution feature maps to successively refine segment boundaries reconstructed from lower resolution maps. The model used for this submission is based on VGG-16 and it was trained on the training set (2975 images). The segmentation predictions were not post-processed using CRF. (This is a revision of a previous submission in which we didn't use the correct basis functions; the method name changed from 'LLR-4x' to 'LRR-4x')
publication Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation
Golnaz Ghiasi, Charless C. Fowlkes
ECCV 2016
http://arxiv.org/abs/1605.02264
project page / code https://github.com/golnazghiasi/LRR
used Cityscapes data fine annotations
used external data ImageNet
runtime n/a
subsampling no
submission date July, 2016
previous submissions 1

 

Average results

Metric Value
IoU Classes 69.701
iIoU Classes 47.9965
IoU Categories 88.2083
iIoU Categories 74.7472

 

Class results

Class IoU iIoU
road 97.7154 -
sidewalk 79.9187 -
building 90.74 -
wall 44.3599 -
fence 48.6144 -
pole 58.6358 -
traffic light 68.224 -
traffic sign 71.9998 -
vegetation 92.5444 -
terrain 69.2898 -
sky 94.6755 -
person 81.5746 61.4647
rider 59.9579 40.096
car 93.9684 87.3178
truck 43.6349 31.2077
bus 56.7565 41.8669
train 47.1649 28.3949
motorcycle 54.8321 36.3693
bicycle 69.712 57.2548

 

Category results

Category IoU iIoU
flat 98.4245 -
nature 92.1574 -
object 66.2263 -
sky 94.6755 -
construction 91.1257 -
human 82.3712 63.2768
vehicle 92.4776 86.2175

 

Links

Download results as .csv file

Benchmark page