RexDrug-Base

This is the SFT (Supervised Fine-Tuning) base model for RexDrug, a chain-of-thought reasoning model for biomedical drug combination relation extraction.

Model Details

  • Base architecture: Llama-3.1-8B-Instruct
  • Fine-tuning method: SFT with LoRA (merged)
  • Task: Drug combination relation extraction from biomedical literature
  • Relation types: POS (beneficial), NEG (harmful), COMB (neutral/mixed), NO_COMB (no combination)

Usage

This model is intended to be used with the RexDrug-adapter (LoRA adapter trained via GRPO). See the adapter repository for the full quick start guide.

from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel
import torch

model = AutoModelForCausalLM.from_pretrained(
    "DUTIR-BioNLP/RexDrug-base",
    torch_dtype=torch.bfloat16,
    device_map="auto",
)
model = PeftModel.from_pretrained(model, "DUTIR-BioNLP/RexDrug-adapter")

License

This model is built upon Llama 3.1 and is subject to the Llama 3.1 Community License Agreement.

Downloads last month
54
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DUTIR-BioNLP/RexDrug-base

Finetuned
(2476)
this model
Adapters
1 model
Quantizations
1 model