VectorSynth
Collection
Models for https://arxiv.org/abs/2511.07744
•
2 items
•
Updated
VectorSynth is a ControlNet model that generates satellite imagery from OpenStreetMap (OSM) vector data embeddings. It conditions Stable Diffusion 2.1 Base on rendered OSM text to synthesize realistic aerial imagery.
VectorSynth uses a two-stage pipeline:
This model uses standard CLIP embeddings. For the COSA embedding variant, see VectorSynth-COSA.
config.json - ControlNet configurationdiffusion_pytorch_model.safetensors - ControlNet weightsrender_encoder/clip-render_encoder.pth - RenderEncoder weightsrender.py - RenderEncoder class definition@inproceedings{cher2025vectorsynth,
title={VectorSynth: Fine-Grained Satellite Image Synthesis with Structured Semantics},
author={Cher, Daniel and Wei, Brian and Sastry, Srikumar and Jacobs, Nathan},
year={2025},
eprint={arXiv:2511.07744},
note={arXiv preprint}
}
Base model
stabilityai/stable-diffusion-2-1-base