Scene Understanding

Playing for Benchmarks

Project Overview

We are working on models and algorithms for visual perception. Our work has produced widely used convolutional network architectures and the dense random field model. We have also created large-scale datasets and benchmarks for training and evaluating broad-competence visual perception systems.


Tangent Convolutions for Dense Prediction in 3D

Interactive Image Segmentation with Latent Diversity

Playing for Benchmarks

Dilated Residual Networks

Playing for Data: Ground Truth from Computer Games

Feature Space Optimization for Semantic Video Segmentation

Dense Monocular Depth Estimation in Complex Dynamic Scenes

Multi-Scale Context Aggregation by Dilated Convolutions

Parameter Learning and Convergent Inference for Dense Random Fields

Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials