Instructions to use flagopen/codegen25-mono-taco with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use flagopen/codegen25-mono-taco with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="flagopen/codegen25-mono-taco")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("flagopen/codegen25-mono-taco")
model = AutoModelForCausalLM.from_pretrained("flagopen/codegen25-mono-taco")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use flagopen/codegen25-mono-taco with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "flagopen/codegen25-mono-taco"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "flagopen/codegen25-mono-taco",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/flagopen/codegen25-mono-taco

SGLang

How to use flagopen/codegen25-mono-taco with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "flagopen/codegen25-mono-taco" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "flagopen/codegen25-mono-taco",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "flagopen/codegen25-mono-taco" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "flagopen/codegen25-mono-taco",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use flagopen/codegen25-mono-taco with Docker Model Runner:
```
docker model run hf.co/flagopen/codegen25-mono-taco
```

Codegen25-Mono-7B-TACO

Model Description

Codegen25-Mono-7B-TACO is a Codegen25-Mono-7B finetuned(full-parameter) on TACO dataset. This model is specialized to solve competition-level programming tasks.

Training data

The model is trained on the Topics in Algorithmic Code Generation Dataset. The dataset focused on algorithmic code generation, aiming to provide a more challenging training dataset and evaluation benchmark for the code generation model field. It includes 25,443 problems in the training set and 1,000 problems in the test set, making it the largest code generation dataset currently available. Each TACO problem is designed to match a diverse set of solution answers, with answers reaching sizes up to 1.55M, to ensure that models trained on this dataset are robust and not prone to overfitting. Furthermore, the TACO dataset includes fine-grained labels such as task topics, algorithms, skills, and difficulty levels, offering more precise guidance for both training and evaluating code generation models. This model is fine-tuned using train split of TACO.

Training procedure

The training script used to train this model can be found here.

Training Details can be seen in our paper

Intended Use and Limitations

The model is finetuned to solve programming problems given a text description and optional starter code.

How to use

You can use this model directly with a pipeline for text generation. This example generates a different sequence each time it's run:

from transformers import AutoModelForCausalLM, AutoTokenizer, FlaxAutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("flagopen/codegen25-mono-taco")
tokenizer = AutoTokenizer.from_pretrained("flagopen/codegen25-mono-taco")
prompt = """
A function to greet user. Given a user name it should say hello
def greet(name):
ANSWER:
""" 
input_ids = tokenizer(prompt, return_tensors='pt').input_ids.to(device)
start = input_ids.size(1)
out = model.generate(input_ids, do_sample=True, max_length=50, num_beams=2, 
                     early_stopping=True, eos_token_id=tokenizer.eos_token_id, )
print(tokenizer.decode(out[0][start:]))

Limitations and Biases

The model is intended to be only used for research purposes and comes with no guarantees of quality of generated code.

Eval results

Coming soon...

Downloads last month: 2

Dataset used to train flagopen/codegen25-mono-taco

Paper for flagopen/codegen25-mono-taco

TACO: Topics in Algorithmic COde generation dataset

Paper • 2312.14852 • Published Dec 22, 2023 • 4