GenBloom

GenBloom is a genetically-aligned foundation model for peripheral blood smears. It aligns single white blood cell images with chromosomal aberrations (karyotype) and somatic mutations from targeted gene panels.

For the source code, setup, and evaluation scripts, see the GenBloom GitHub repository.

Model Description

GenBloom is a patient-level encoder trained using a two-stage approach:

  1. GenBloom-V (Self-supervised Pretraining): Vision-only pretraining of a transformer aggregator using an iBOT head on a cohort of over 1,500 patients.
  2. GenBloom-G (Genetic Alignment): Further alignment of visual features with chromosomal aberrations and somatic mutations via supervised contrastive loss on acute myeloid leukemia patients.

The model provides improved representations for hematological diagnostic tasks and provides off-the-shelf retrieval capabilities for diseases and genetic alterations.

Checkpoints

This repository contains the model weights used for the public visual downstream reproduction:

checkpoints/
  genbloom_v/
    genbloom_v.pth
  genbloom_g/
    genbloom_g_fold0.pth
    genbloom_g_fold1.pth
    genbloom_g_fold2.pth
    genbloom_g_fold3.pth
    genbloom_g_fold4.pth
  • The genbloom_v checkpoint corresponds to image-only pretraining.
  • The genbloom_g checkpoints were further genetically aligned.

Usage

Download Checkpoints

You can download the checkpoints using the huggingface_hub library:

from huggingface_hub import snapshot_download

snapshot_download("MarrLab/GenBloom", local_dir="checkpoints")

Inference

A minimal end-to-end inference example is available in the inference_genbloom.ipynb notebook in the official repository.

Citation

If you use GenBloom in your research, please cite:

@article{dasdelen2026genetically,
  title={Genetically Aligned Patient Representations Improve Hematological Diagnosis},
  author={Dasdelen, Muhammed Furkan and Ozlugedik, Fatih and Looser, Ilaria and Umer, Rao Muhammad and Pohlkamp, Christian and Marr, Carsten},
  journal={arXiv preprint arXiv:2605.29980},
  year={2026}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for MarrLab/GenBloom