Solving structured segmentation of aerial images as puzzles

Diego Marcos, Michele Volpi, Devis Tuia

Research output: Chapter in Book/Report/Conference proceedingConference paperAcademicpeer-review

Abstract

Traditional approaches to structured semantic segmentation employ appearance-based classifiers to provide a class-likelihood at each spatial location and then post-process it with Markov Random Fields (MRF) to enforce label smoothness and structure in the output space. The spatial support for such techniques is usually a patch of pixels, which makes the prediction over-smoothed because the borders of objects are not explicitly taken into account. This is further exacerbated by MRF post-processing employing the standard Potts model, which tends to further over-smooth predictions at boundaries. In this paper, we propose a different but related approach: we optimize an energy function finding the optimal combination of small ground truth (GT) tiles from training data over predictions at test time, effectively solving a puzzle. We optimize over a first configuration given by a Convolutional Neural Network (CNN) output.

Original languageEnglish
Title of host publication2016 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2016 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3346-3349
Number of pages4
ISBN (Electronic)9781509033324
ISBN (Print)9781509033331
DOIs
Publication statusPublished - Nov 2016
Externally publishedYes
Event36th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2016 - Beijing, China
Duration: 10 Jul 201615 Jul 2016

Publication series

NameInternational Geoscience and Remote Sensing Symposium (IGARSS)
Volume2016-November

Conference

Conference36th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2016
Country/TerritoryChina
CityBeijing
Period10/07/1615/07/16

Keywords

  • Land cover
  • Markov Random Fields
  • Semantic segmentation
  • Structured prediction

Fingerprint

Dive into the research topics of 'Solving structured segmentation of aerial images as puzzles'. Together they form a unique fingerprint.

Cite this