postitive666
/

Llama3-Instruct-8B-SimPO

Text Generation

text-generation-inference

Model card Files Files and versions

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

This is a model released from the preprint: SimPO: Simple Preference Optimization with a Reference-Free Reward Please refer to our repository for more details.

Downloads last month: 3

Safetensors

Model size

8B params

Tensor type

BF16

·

Paper for postitive666/Llama3-Instruct-8B-SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Paper • 2405.14734 • Published May 23, 2024 • 12