Instructions to use maxcurrent/NeuralMonarchCoderPearlBeagle with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use maxcurrent/NeuralMonarchCoderPearlBeagle with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="maxcurrent/NeuralMonarchCoderPearlBeagle")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("maxcurrent/NeuralMonarchCoderPearlBeagle")
model = AutoModelForCausalLM.from_pretrained("maxcurrent/NeuralMonarchCoderPearlBeagle", device_map="auto")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use maxcurrent/NeuralMonarchCoderPearlBeagle with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "maxcurrent/NeuralMonarchCoderPearlBeagle"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "maxcurrent/NeuralMonarchCoderPearlBeagle",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/maxcurrent/NeuralMonarchCoderPearlBeagle

SGLang

How to use maxcurrent/NeuralMonarchCoderPearlBeagle with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "maxcurrent/NeuralMonarchCoderPearlBeagle" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "maxcurrent/NeuralMonarchCoderPearlBeagle",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "maxcurrent/NeuralMonarchCoderPearlBeagle" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "maxcurrent/NeuralMonarchCoderPearlBeagle",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use maxcurrent/NeuralMonarchCoderPearlBeagle with Docker Model Runner:
```
docker model run hf.co/maxcurrent/NeuralMonarchCoderPearlBeagle
```

NeuralMonarchCoderPearlBeagle

Commit History

Adding Evaluation Results

3c53a54
verified

leaderboard-pr-bot commited on Mar 4, 2024

Update README.md

769e7da
verified

Max Current commited on Mar 2, 2024

Update README.md

758a75c
verified

Max Current commited on Mar 2, 2024

Update README.md

2e74109
verified

Max Current commited on Mar 2, 2024

Update README.md

6bcd3ea
verified

Max Current commited on Mar 2, 2024

Update README.md

4a5c09e
verified

Max Current commited on Mar 2, 2024

Update README.md

f142292
verified

Max Current commited on Mar 2, 2024

Update Mascot

98a0bd8
verified

Max Current commited on Mar 2, 2024

Update Mascot Photo

9a7d0a4
verified

Max Current commited on Mar 2, 2024

Update License

68c3d77
verified

Max Current commited on Mar 2, 2024

Upload folder using huggingface_hub

84c60e6
verified

Max Current commited on Mar 2, 2024

initial commit

418e559
verified

Max Current commited on Mar 2, 2024

Commit History

Adding Evaluation Results 3c53a54 verified

Update README.md 769e7da verified

Update README.md 758a75c verified

Update README.md 2e74109 verified

Update README.md 6bcd3ea verified

Update README.md 4a5c09e verified

Update README.md f142292 verified

Update Mascot 98a0bd8 verified

Update Mascot Photo 9a7d0a4 verified

Update License 68c3d77 verified

Upload folder using huggingface_hub 84c60e6 verified

initial commit 418e559 verified

Adding Evaluation Results

3c53a54
verified

Update README.md

769e7da
verified

Update README.md

758a75c
verified

Update README.md

2e74109
verified

Update README.md

6bcd3ea
verified

Update README.md

4a5c09e
verified

Update README.md

f142292
verified

Update Mascot

98a0bd8
verified

Update Mascot Photo

9a7d0a4
verified

Update License

68c3d77
verified

Upload folder using huggingface_hub

84c60e6
verified

initial commit

418e559
verified