New paper: 3D building reconstruction from single street view images using deep learning

A new approach to generate 3D models while tackling the common issue of obstructed objects in street-level imagery.

We are glad to share our new paper:

Pang HE, Biljecki F (2022): 3D building reconstruction from single street view images using deep learning. International Journal of Applied Earth Observation and Geoinformation 112: 102859. 10.1016/j.jag.2022.102859 PDF

This research was led by Pang Hui En as part of her MSc. Congratulations to her on the graduation, and we wish Hui En all the best in her PhD at NTU. 🙌 👏


  • Street-level photographs are now an omnipresent urban dataset.
  • Buildings are often imaged multiple times, but most photos are obstructed.
  • An approach to generate 3D models of buildings from their single views.
  • The method can be aided by building footprints.
  • The resulting models are usable for a variety of use cases.


The abstract follows.

3D building models are an established instance of geospatial information in the built environment, but their acquisition remains complex and topical. Approaches to reconstruct 3D building models often require existing building information (e.g. their footprints) and data such as point clouds, which are scarce and laborious to acquire, limiting their expansion. In parallel, street view imagery (SVI) has been gaining currency, driven by the rapid expansion in coverage and advances in computer vision (CV), but it has not been used much for generating 3D city models. Traditional approaches that can use SVI for reconstruction require multiple images, while in practice, often only few street-level images provide an unobstructed view of a building. We develop the reconstruction of 3D building models from a single street view image using image-to-mesh reconstruction techniques modified from the CV domain. We regard three scenarios: (1) standalone single-view reconstruction; (2) reconstruction aided by a top view delineating the footprint; and (3) refinement of existing 3D models, i.e. we examine the use of SVI to enhance the level of detail of block (LoD1) models, which are common. The results suggest that trained models supporting (2) and (3) are able to reconstruct the overall geometry of a building, while the first scenario may derive the approximate mass of the building, useful to infer the urban form of cities. We evaluate the results by demonstrating their usefulness for volume estimation, with mean errors of less than 10% for the last two scenarios. As SVI is now available in most countries worldwide, including many regions that do not have existing footprint and/or 3D building data, our method can derive rapidly and cost-effectively the 3D urban form from SVI without requiring any existing building information. Obtaining 3D building models in regions that hitherto did not have any, may enable a number of 3D geospatial analyses locally for the first time.


For more information, please see the paper, published open access.

BibTeX citation:

  author = {Pang, Hui En and Biljecki, Filip},
  journal = {International Journal of Applied Earth Observation and Geoinformation},
  title = {{3D building reconstruction from single street view images using deep learning}},
  volume = {112},
  pages = {102859},
  doi = {10.1016/j.jag.2022.102859},
  year = {2022}