RexDrug-Base
This is the SFT (Supervised Fine-Tuning) base model for RexDrug, a chain-of-thought reasoning model for biomedical drug combination relation extraction.
Model Details
- Base architecture: Llama-3.1-8B-Instruct
- Fine-tuning method: SFT with LoRA (merged)
- Task: Drug combination relation extraction from biomedical literature
- Relation types: POS (beneficial), NEG (harmful), COMB (neutral/mixed), NO_COMB (no combination)
Usage
This model is intended to be used with the RexDrug-adapter (LoRA adapter trained via GRPO). See the adapter repository for the full quick start guide.
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel
import torch
model = AutoModelForCausalLM.from_pretrained(
"DUTIR-BioNLP/RexDrug-base",
torch_dtype=torch.bfloat16,
device_map="auto",
)
model = PeftModel.from_pretrained(model, "DUTIR-BioNLP/RexDrug-adapter")
License
This model is built upon Llama 3.1 and is subject to the Llama 3.1 Community License Agreement.
- Downloads last month
- 54
Model tree for DUTIR-BioNLP/RexDrug-base
Base model
meta-llama/Llama-3.1-8B
Finetuned
meta-llama/Llama-3.1-8B-Instruct