Data and code

Open resources from our group

We believe in open science, so we publish most of our code and data openly. We do that mostly through our Github account.

Please see below for a list of resources and projects, especially lab-grown datasets, which we are happy to make public. On this page, we also include open outputs led by collaborators, in which we were involved.

On a related note, you may be interested in our comprehensive survey on open-source software, led by our Winston Yap, published as a review paper in Computers, Environment and Urban Systems.

In general, all resources are released under a liberal licence, enabling unrestricted use as long as you properly attribute them. If you use the code and/or the data for presentations and publications, we kindly ask you to cite the related paper(s) and credit our work. We have provided guidelines to do so in each resource.

The usual disclaimer: while a great deal of effort and care has been put into each project, they are not free of errors, and we cannot be held responsible for the use of code and/or data and any issues that may arise. All code and data are provided as is.

Feel free to contact us for more information, report bugs and errors, or simply to inform us what are you using the data for. We would be pleased to learn how others are using our work. If you are interested in collaborating with us, please get in touch with the lead developer of each resource listed below.

Nighttime Street View Imagery

Short description:A large set of images collected during nighttime to support our explorations on this latent kind of urban visual data.
Lead developer:Zicheng Fan
Further reading:Please read the paper published in Sustainable Cities and Society
Download: Github repo (also uploaded on Mapillary)
Main data source(s):Own data collection
Coverage:Singapore
Citation:
Click to view the BibTeX entry

@article{2024_scs_nightSVI, author = {Fan, Zicheng and Biljecki, Filip}, doi = {10.1016/j.scs.2024.105862}, journal = {Sustainable Cities and Society}, pages = {105862}, title = {Nighttime Street View Imagery: A new perspective for sensing urban lighting landscape}, volume = {116}, year = {2024} }

TreeShadeMapper

Short description:A panorama-based technique to estimate sky view factor and solar irradiance considering transmittance of tree canopies
Lead developer:Kunihiko Fujiwara
Further reading:Please read the paper published in BAE
Code: Github repo
Citation:
Click to view the BibTeX entry

@article{2024_bae_svf, author = {Fujiwara, Kunihiko and Ito, Koichi and Ignatius, Marcel and Biljecki, Filip}, doi = {10.1016/j.buildenv.2024.112071}, journal = {Building and Environment}, pages = {112071}, title = {A panorama-based technique to estimate sky view factor and solar irradiance considering transmittance of tree canopies}, volume = {266}, year = {2024} }

Microclimate Vision

Short description:Multimodal prediction of climatic parameters using street-level and satellite imagery
Lead developer:Kunihiko Fujiwara
Further reading:Please read the paper published in SCS
Code: Github repo
Citation:
Click to view the BibTeX entry

@article{2024_scs_microclimate_vision, author = {Fujiwara, Kunihiko and Khomiakov, Maxim and Yap, Winston and Ignatius, Marcel and Biljecki, Filip}, doi = {10.1016/j.scs.2024.105733}, journal = {Sustainable Cities and Society}, pages = {105733}, title = {Microclimate Vision: Multimodal prediction of climatic parameters using street-level and satellite imagery}, volume = {114}, year = {2024} }

Global Streetscapes

Short description:A comprehensive dataset of 10 million street-level images across 688 cities for urban science and analytics.
Lead developer:Hou Yujun
Further reading:Please read the paper published in the ISPRS Journal of Photogrammetry and Remote Sensing
Website: Website
Download: Github repo
Main data source(s):Mapillary, KartaView, OpenStreetMap, GADM, … and manual labelling
Coverage:688 cities around the world
Citation:
Click to view the BibTeX entry

@article{2024_global_streetscapes, author = {Hou, Yujun and Quintana, Matias and Khomiakov, Maxim and Yap, Winston and Ouyang, Jiani and Ito, Koichi and Wang, Zeyu and Zhao, Tianhong and Biljecki, Filip}, doi = {10.1016/j.isprsjprs.2024.06.023}, journal = {ISPRS Journal of Photogrammetry and Remote Sensing}, pages = {216-238}, title = {Global Streetscapes – A comprehensive dataset of 10 million street-level images across 688 cities for urban science and analytics}, volume = {215}, year = {2024} }

Urbanity

Short description:A network-based python package to understand and model urban complexity
Lead developer:Winston Yap
Further reading:Please read the paper published in npj Urban Sustainability
Code: Github repo
Data: Figshare
Main data source(s):OpenStreetMap, Mapillary, etc.
Citation:
Click to view the BibTeX entry

@article{2023_npjus_urbanity, author = {Yap, Winston and Stouffs, Rudi and Biljecki, Filip}, doi = {10.1038/s42949-023-00125-w}, journal = {npj Urban Sustainability}, title = {{Urbanity: automated modelling and analysis of multidimensional networks in cities}}, volume = {3}, issue = {45}, year = {2023} }

Predicting building characteristics using graph neural networks and street-level contexts

Short description:This repo is for our paper “Predicting building characteristics using graph neural networks and street-level contexts”
Lead developer:Binyu Lei
Further reading:Please read the paper published in CEUS
Code: Github repo
Citation:
Click to view the BibTeX entry

@article{2024_ceus_gnn_building, author = {Lei, Binyu and Liu, Pengyuan and Milojevic-Dupont, Nikola and Biljecki, Filip}, doi = {10.1016/j.compenvurbsys.2024.102129}, journal = {Computers, Environment and Urban Systems}, pages = {102129}, title = {Predicting building characteristics at urban scale using graph neural networks and street-level context}, volume = {111}, year = {2024} }

Explainable spatially explicit geospatial artificial intelligence in urban analytics

Short description:This repo is for our paper “Explainable spatially explicit geospatial artificial intelligence in urban analytics”
Lead developer:Pengyuan Liu
Further reading:Please read the paper published in EPB
Code: Github repo
Citation:
Click to view the BibTeX entry

@article{2024_epb_xai, author = {Liu, Pengyuan and Yan, Zhang and Biljecki, Filip}, doi = {10.1177/23998083231204689}, journal = {Environment and Planning B: Urban Analytics and City Science}, pages = {1104–1123}, title = {{Explainable spatially explicit geospatial artificial intelligence in urban analytics}}, volume = {51}, issue = {5}, year = {2024} }

Computer Vision and Graph Models to Predict Outdoor Comfort

Short description:This repo is for our paper “Towards Human-centric Digital Twins: Leveraging Computer Vision and Graph Models to Predict Outdoor Comfort”
Lead developer:Pengyuan Liu
Further reading:Please read the paper published in SCS
Code: Github repo
Main data source(s):Own field survey
Citation:
Click to view the BibTeX entry

@article{2023_scs_human_dt, author = {Liu, Pengyuan and Zhao, Tianhong and Luo, Junjie and Lei, Binyu and Frei, Mario and Miller, Clayton and Biljecki, Filip}, doi = {10.1016/j.scs.2023.104480}, journal = {Sustainable Cities and Society}, pages = {104480}, title = {{Towards Human-centric Digital Twins: Leveraging Computer Vision and Graph Models to Predict Outdoor Comfort}}, volume = {93}, year = {2023} }

InstantCITY - Synthesising morphologically accurate geospatial data for urban form analysis, transfer, and quality control

Short description:Generating vectorised building footprint data from street networks using Generative Adversarial Networks
Lead developer:Abraham Noah Wu
Further reading:Please read the paper published in IJPRS
Code: Github repo
Main data source(s):OpenStreetMap
Citation:
Click to view the BibTeX entry

@article{2023_ijprs_instantcity, author = {Wu, Abraham Noah and Biljecki, Filip}, doi = {10.1016/j.isprsjprs.2022.11.005}, journal = {ISPRS Journal of Photogrammetry and Remote Sensing}, pages = {90-104}, title = {InstantCITY: Synthesising morphologically accurate geospatial data for urban form analysis, transfer, and quality control}, volume = {195}, year = {2023} }

Water View Imagery

Short description:On-water perspective imagery dataset for semantic segmentation of waterscapes.
Lead developer:Junjie Luo
Further reading:Please read the paper published in Ecological Indicators
Download: Github repo
Main data source(s):Mapillary and manual labelling
Coverage:Eight cities: Amsterdam, Bangkok, Chicago, Istanbul, Japan, London, Paris, and Venice
Citation:
Click to view the BibTeX entry

@article{2022_ei_water_view_imagery, author = {Luo, Junjie and Zhao, Tianhong and Cao, Lei and Biljecki, Filip}, doi = {10.1016/j.ecolind.2022.109615}, journal = {Ecological Indicators}, pages = {109615}, title = {Water View Imagery: Perception and evaluation of urban waterscapes worldwide}, volume = {145}, year = {2022} }

Global Urban Road Network Patterns

Short description:Deep learning-based analysis of the urban morphology around the world
Lead developer:Chen Wangyang
Further reading:Please read the paper published in Landscape and Urban Planning
Code: Github repo
Main data source(s):OpenStreetMap
Coverage:144 cities
Citation:
Click to view the BibTeX entry

@article{2024_land_urn, author = {Chen, Wangyang and Huang, Huiming and Liao, Shunyi and Gao, Feng and Biljecki, Filip}, doi = {10.1016/j.landurbplan.2023.104901}, journal = {Landscape and Urban Planning}, title = {{Global urban road network patterns: Unveiling multiscale planning paradigms of 144 cities with a novel deep learning approach}}, volume = {241}, pages = {104901}, year = {2024} }

SVIQC – Street View Imagery Quality Checker

Short description:A toolkit to examine the quality of street view imagery
Lead developer:Hou Yujun
Further reading:Please read the paper published in JAG
Code: Github repo
Main data source(s):Mapillary
Citation:
Click to view the BibTeX entry

@article{2022_jag_svi_quality, year = {2022}, title = {{A comprehensive framework for evaluating the quality of street view imagery}}, author = {Hou, Yujun and Biljecki, Filip}, journal = {International Journal of Applied Earth Observation and Geoinformation}, doi = {10.1016/j.jag.2022.103094}, pages = {103094}, volume = {115} }

Visual soundscapes

Short description:Predicting the urban soundscape from street view imagery
Lead developer:Tianhong Zhao
Further reading:Please read the paper published in CEUS
Code: Github repo
Citation:
Click to view the BibTeX entry

@article{2023_ceus_soundscapes, author = {Zhao, Tianhong and Liang, Xiucheng and Tu, Wei and Huang, Zhengdong and Biljecki, Filip}, doi = {10.1016/j.compenvurbsys.2022.101915}, journal = {Computers, Environment and Urban Systems}, pages = {101915}, title = {Sensing urban soundscapes from street view imagery}, volume = {99}, year = {2023} }

GBMI – Global Building Morphology Indicators

GBMI London

Short description:Billions of built form metrics of selected urban areas around the world, together with a database solution to compute them
Lead developer:Yoong Shin Chow
FormatsCSV, Shapefile, Geopackage, GeoTIFF
Further reading:Please read more at our website or in the paper published in Computers, Environment and Urban Systems
Download:Project website
Code:The code used to generate the dataset is available in the Github repo
Main data source(s):OpenStreetMap, GADM
Coverage:Dozens of urban areas around the world
Citation:
Click to view the BibTeX entry

@article{2022_ceus_gbmi, author = {Biljecki, Filip and Chow, Yoong Shin}, doi = {10.1016/j.compenvurbsys.2022.101809}, journal = {Computers, Environment and Urban Systems}, pages = {101809}, title = {Global Building Morphology Indicators}, volume = {95}, year = {2022} }

Roofpedia – solar and green roofs around the world

Density of solar panels in Singapore

Short description:Locations of buildings that have installed photovoltaics or greenery on their rooftops
Lead developer:Abraham Noah Wu
FormatsGeoJSON
Further reading:Please read more at our website or in the paper published in Landscape and Urban Planning
Download: Github repo
Code:The code used to generate the dataset is available in the same Github repo
Main data source(s):Various satellite imagery, OpenStreetMap. All open data
Coverage:17 cities around the world
Citation:
Click to view the BibTeX entry

@article{roofpedia, author = {Abraham Noah Wu and Filip Biljecki}, doi = {10.1016/j.landurbplan.2021.104167}, journal = {Landscape and Urban Planning}, pages = {104167}, title = {Roofpedia: Automatic mapping of green and solar roofs for an open roofscape registry and evaluation of urban sustainability}, url = {https://doi.org/10.1016/j.landurbplan.2021.104167}, volume = {214}, year = 2021 }

Semantic Riverscapes

A section of the Grand Canal in Tianjin

Short description:A semantically annotated UAV oblique image dataset covering an urban river landscape
Lead developer:Junjie Luo
Further reading:Please read the paper published in Landscape and Urban Planning
Download: Github repo
Main data source(s):Own data collection (UAV) and manual labelling
Coverage:Tianjin (China)
Citation:
Click to view the BibTeX entry

@article{2022_land_semantic_riverscapes, year = {2022}, title = {{Semantic Riverscapes: Perception and evaluation of linear landscapes from oblique imagery using computer vision}}, author = {Luo, Junjie and Zhao, Tianhong, and Cao, Lei and Biljecki, Filip}, journal = {Landscape and Urban Planning}, doi = {10.1016/j.landurbplan.2022.104569}, pages = {104569}, volume = {228} }

Classification of Urban Morphology with Deep Learning

Short description:Software to generate diagrams of the urban form at the city-scale and classify them using deep learning
Lead developer:Chen Wangyang
Further reading:Please read more in the paper published in Computers, Environment and Urban Systems
Code: Github repo
Citation:
Click to view the BibTeX entry

@article{2021_ceus_dl_morphology, author = {Wangyang Chen and Abraham Noah Wu and Filip Biljecki}, doi = {10.1016/j.compenvurbsys.2021.101706}, journal = {Computers, Environment and Urban Systems}, pages = {101706}, title = {Classification of Urban Morphology with Deep Learning: Application on Urban Vitality}, url = {https://doi.org/10.1016/j.compenvurbsys.2021.101706}, volume = {90}, year = 2021 }

GANmapper: geographical data translation

Short description:A building footprint generator using Generative Adversarial Networks from sparse data such as street networks
Lead developer:Abraham Noah Wu
Further reading:Please read more in the paper published in the International Journal of Geographical Information Science
Code: Github repo
Citation:
Click to view the BibTeX entry

@article{2022_ijgis_ganmapper, year = {2022}, author = {Wu, Abraham Noah and Biljecki, Filip}, title = {{GANmapper: geographical data translation}}, journal = {International Journal of Geographical Information Science}, doi = {10.1080/13658816.2022.2041643}, volume = {36}, issue = {7}, pages = {1394-1422} }

3D dataset of all public housing (HDB) buildings in Singapore

HDB 3D city model, Singapore

Short description:About 12k semantically rich 3D buildings in Singapore in CityJSON and OBJ
Lead developer:Filip Biljecki
FormatsCityJSON, OBJ
Further reading:Please read more here in our blog post or in the paper published at 3D GeoInfo
Download: Github repo
Code:The code used to generate the dataset is available in a separate Github repo
Main data source(s):HDB, OpenStreetMap, OneMap. All open data
Coverage:Singapore
Citation:
Click to view the BibTeX entry

@article{2020_3dgeoinfo_3d_asean, author = {Biljecki, F.}, doi = {10.5194/isprs-annals-vi-4-w1-2020-37-2020}, journal = {ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences}, pages = {37–44}, title = {Exploration of open data in Southeast Asia to generate 3D building models}, volume = {VI-4/W1-2020}, year = {2020} }

With our friendly collaborators

Window Views dataset

Short description:An open, manually annotated dataset of window views in a high density city (Wuhan, China). The original data was taken from the balcony window views of 300 apartments listed for sale in August 2022 on the Chinese real estate online marketplaces lianjia in August 2022.
Lead developer:Peng Chucai
Data:Github
Further reading:Please read the paper published in Cities
Citation:
Click to view the BibTeX entry

@article{2025_cities_windowviews, author = {Peng, Chucai and Xiang, Yang and Huang, Wenjing and Feng, Yale and Tang, Yongqi and Biljecki, Filip and Zhou, Zhixiang}, doi = {10.1016/j.cities.2024.105536}, journal = {Cities}, pages = {105536}, title = {Measuring the value of window views using real estate big data and computer vision: A case study in Wuhan, China}, volume = {156}, year = {2025} }

Longitudinal thermographic dataset in Singapore

Short description:More than a million thermographic images collected in our campus from ground-based thermal cameras over a long time, allowing users to determine the temperature trend of individual features such as buildings, roads, and vegetation in a tropical environment.
Lead developer:Subin Lin from NUS and the Berkeley Education Alliance for Research in Singapore (BEARS)
Data:Github
Further reading:Please read the paper published in Scientific Data
Citation:
Click to view the BibTeX entry

@article{2023_sd_iris, author = {Lin, Subin and Ramani, Vasantha and Martin, Miguel and Arjunan, Pandarasamy and Chong, Adrian and Biljecki, Filip and Ignatius, Marcel and Poolla, Kameshwar and Miller, Clayton}, doi = {10.1038/s41597-023-02749-0}, journal = {Scientific Data}, pages = {859}, title = {District-scale surface temperatures generated from high-resolution longitudinal thermal infrared images}, volume = {10}, year = {2023} }

EUropean BUilding stock Characteristics in a Common and Open database

Short description:EUBUCCO is a scientific database of individual building footprints for 206 million buildings across the 27 European Union countries and Switzerland, together with three main attributes – building type, height and construction year – included for respectively 45%, 74%, 24% of the buildings.
Lead developer:Nikola Milojevic-Dupont and Felix Wagner, Mercator Research Institute for Global Commons and Climate Change and TU Berlin
Website: Website
Code: Github repo
Data:Zenodo
Further reading:Please read the paper published in Scientific Data
Citation:
Click to view the BibTeX entry

@article{2023_sd_eubucco, author = {Milojevic-Dupont, Nikola and Wagner, Felix and Nachtigall, Florian and Hu, Jiawei and Br{"u}ser, Geza Boi and Zumwald, Marius and Biljecki, Filip and Heeren, Niko and Kaack, Lynn H. and Pichler, Peter-Paul and Creutzig, Felix}, doi = {10.1038/s41597-023-02040-2}, journal = {Scientific Data}, number = {1}, pages = {147}, title = {EUBUCCO v0.1: European building stock characteristics in a common and open database for 200+ million individual buildings}, volume = {10}, year = {2023} }

3D building metrics for urban morphology

Short description:3D Building Metrics. Elevating geometric analysis for urban morphology, solar potential, CFD etc to the next level
Lead developers:Anna Labetski and Stelios Vitalis, 3D Geoinformation, TU Delft
Code: Github repo
Data:Repository
Further reading:Please read the paper published in the International Journal of Geographical Information Science
Coverage:Major cities in the Netherlands, extensible thanks to the code released open-source
Citation:
Click to view the BibTeX entry

@article{2023_ijgis_3dbm, author = {Labetski, Anna and Vitalis, Stelios and Biljecki, Filip and Arroyo Ohori, Ken and Stoter, Jantien}, doi = {10.1080/13658816.2022.2103818}, journal = {International Journal of Geographical Information Science}, title = {3D building metrics for urban morphology}, year = {2023}, volume = {37}, issue = {1}, pages = {36-67} }

3dfier: automatic reconstruction of 3D city models

Short description:Takes 2D GIS datasets (e.g. topographical datasets) and “3dfies” them (as in “making them three-dimensional”) by lifting every polygon to 3D
Lead developer:3D Geoinformation, TU Delft
Code: Github repo
Further reading:Please read more in the paper published in the Journal of Open Source Software
Citation:
Click to view the BibTeX entry

@article{2021_joss_3dfier, author = {Ledoux, Hugo and Biljecki, Filip and Dukai, Balázs and Kumar, Kavisha and Peters, Ravi and Stoter, Jantien and Commandeur, Tom}, doi = {10.21105/joss.02866}, journal = {Journal of Open Source Software}, number = {57}, pages = {2866}, title = {3dfier: automatic reconstruction of 3D city models}, volume = {6}, year = {2021} }

ifc2indoorgml

Short description:A tool allowing to generate IndoorGML files from IFC input models
Lead developer:Abdoulaye Diakite, GRID, University of New South Wales
Code: Github repo
Further reading:Paper
Citation:
Click to view the BibTeX entry

@article{2022_isprs_ifc2indoorgml, author = {Diakite, AA and Díaz-Vilariño, L and Biljecki, F and Isikdag, Ü and Simmons, S and Li, K and Zlatanova, S}, doi = {10.5194/isprs-archives-xliii-b4-2022-295-2022}, journal = {Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci.}, pages = {295–301}, title = {ifc2indoorgml: An open-source tool for generating IndoorGML from IFC}, volume = {XLIII-B4-2022}, year = {2022} }

AIDA - Annotated Image Database of Architecture

Short description:A repository of architectural photographs worldwide, labelled with a vast list of hierarchical categories and a series of auxiliary annotations
Lead developer:Chen Jielin
Further reading:Paper
Download:Harvard Dataverse
Main data source(s):ArchDaily
Citation:
Click to view the BibTeX entry

@inproceedings{2021_caadria_aida, author = {Chen, Jielin and Stouffs, Rudi and Biljecki, Filip}, booktitle = {Proceedings of the 26th International Conference of the Association for Computer-Aided Architectural Design Research in Asia (CAADRIA) 2021}, pages = {161–170}, title = {Hierarchical (Multi-Label) Architectural Image Recognition and Classification}, volume = {1}, year = {2021} }