VectorSynth-COSA

VectorSynth-COSA is a ControlNet model that generates satellite imagery from OpenStreetMap (OSM) vector data embeddings. It conditions Stable Diffusion 2.1 Base on rendered OSM text using the COSA (Contrastive OSM-Satellite Alignment) embedding space.

Model Description

VectorSynth-COSA uses a two-stage pipeline:

RenderEncoder: Projects 768-dim COSA embeddings to 3-channel control images
ControlNet: Conditions Stable Diffusion 2.1 on the rendered control images

This model uses COSA embeddings for improved semantic alignment between OSM text and satellite imagery. For the standard CLIP embedding variant, see VectorSynth.

Citation

@inproceedings{cher2025vectorsynth,
  title={VectorSynth: Fine-Grained Satellite Image Synthesis with Structured Semantics},
  author={Cher, Daniel and Wei, Brian and Sastry, Srikumar and Jacobs, Nathan},
  year={2025},
  eprint={arXiv:2511.07744},
  note={arXiv preprint}
}

Related Models

VectorSynth - Standard CLIP embedding variant
GeoSynth - Text-to-satellite image generation

Downloads last month: 12

Model tree for MVRL/VectorSynth-COSA

Base model

stabilityai/stable-diffusion-2-1-base

Adapter

(712)

this model

Collection including MVRL/VectorSynth-COSA

VectorSynth

Collection

Models for https://arxiv.org/abs/2511.07744 • 3 items • Updated May 13

Paper for MVRL/VectorSynth-COSA

VectorSynth: Fine-Grained Satellite Image Synthesis with Structured Semantics

Paper • 2511.07744 • Published Nov 11, 2025