huggingface/documentation-images
Viewer • Updated • 59 • 2.32M • 140
from trl.experimental.ssd import SSDConfig, SSDTrainer
trainer = SSDTrainer(
model="Qwen/Qwen3-4B-Instruct",
args=SSDConfig(temperature=0.6, top_k=20, top_p=0.95),
train_dataset=dataset,
)
trainer.train()use_transformers_paged, and key fixes for VLM response parsing.We use an int16 variable in the shared memory (a semaphore), which can be looked up by any warp on the SM. The thing you have to be carefull about is to have all instructions related to SMEM or GMEM finished before changing the state of the semaphore. For instance, you have to explicitly wait for SMEM stores to be over before switching the semaphore.