TY - JOUR
T1 - GenSynthPop: generating a spatially explicit synthetic population of individuals and households from aggregated data
AU - de Mooij, Jan
AU - Sonnenschein, Tabea
AU - Pellegrino, Marco
AU - Dastani, Mehdi
AU - Ettema, Dick
AU - Logan, Brian
AU - Verstegen, Judith A.
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024
Y1 - 2024
N2 - Synthetic populations are representations of actual individuals living in a specific area. They play an increasingly important role in studying and modeling individuals and are often used to build agent-based social simulations. Traditional approaches for synthesizing populations use a detailed sample of the population (which may not be available) or combine data into a single joint distribution, and draw individuals or households from these. The latter group of existing sample-free methods fail to integrate (1) the best available data on spatial granular distributions, (2) multi-variable joint distributions, and (3) household level distributions. In this paper, we propose a sample-free approach where synthetic individuals and households directly represent the estimated joint distribution to which attributes are iteratively added, conditioned on previous attributes such that the relative frequencies within each joint group of attributes are maintained and fit granular spatial marginal distributions. In this paper we present our method and test it for the Zuid-West district of The Hague, the Netherlands, showing that spatial, multi-variable and household distributions are accurately reflected in the resulting synthetic population.
AB - Synthetic populations are representations of actual individuals living in a specific area. They play an increasingly important role in studying and modeling individuals and are often used to build agent-based social simulations. Traditional approaches for synthesizing populations use a detailed sample of the population (which may not be available) or combine data into a single joint distribution, and draw individuals or households from these. The latter group of existing sample-free methods fail to integrate (1) the best available data on spatial granular distributions, (2) multi-variable joint distributions, and (3) household level distributions. In this paper, we propose a sample-free approach where synthetic individuals and households directly represent the estimated joint distribution to which attributes are iteratively added, conditioned on previous attributes such that the relative frequencies within each joint group of attributes are maintained and fit granular spatial marginal distributions. In this paper we present our method and test it for the Zuid-West district of The Hague, the Netherlands, showing that spatial, multi-variable and household distributions are accurately reflected in the resulting synthetic population.
KW - Data disaggregation
KW - Iterative proportional fitting
KW - Sample-free data synthesis
KW - Spatial heterogeneity
KW - Synthetic households
KW - Synthetic population
KW - Synthetic reconstruction
UR - http://www.scopus.com/inward/record.url?scp=85205988071&partnerID=8YFLogxK
U2 - 10.1007/s10458-024-09680-7
DO - 10.1007/s10458-024-09680-7
M3 - Article
SN - 1387-2532
VL - 38
JO - Autonomous Agents and Multi-Agent Systems
JF - Autonomous Agents and Multi-Agent Systems
IS - 2
M1 - 48
ER -