jonmabe
/

tiny-llm-cli-sft

Text Generation

instruction-following

Model card Files Files and versions

tiny-llm-cli-sft / README.md

jonmabe's picture

Upload README.md with huggingface_hub

49c7f14 verified 2 months ago

|

history blame contribute delete

2.82 kB

	---
	license: apache-2.0
	language:
	- en
	tags:
	- text-generation
	- cli
	- shell
	- command-line
	- sft
	- instruction-following
	pipeline_tag: text-generation
	widget:
	- text: "Instruction: List all files in the current directory\nCommand:"
	example_title: List files
	- text: "Instruction: Find all Python files\nCommand:"
	example_title: Find Python files
	- text: "Instruction: Show disk usage\nCommand:"
	example_title: Disk usage
	---

	# Tiny-LLM CLI SFT (54M)

	A 54 million parameter language model fine-tuned for CLI command generation.

	## Model Description

	This model is a Supervised Fine-Tuned (SFT) version of [jonmabe/tiny-llm-54m](https://huggingface.co/jonmabe/tiny-llm-54m), trained to generate Unix/Linux shell commands from natural language instructions.

	### Training Data

	- Geddy's NL2Bash dataset: ~2,300 natural language to bash command pairs
	- NL2Bash benchmark: Standard benchmark for command translation
	- Synthetic examples: Additional generated pairs
	- Total: ~13,000 training pairs

	### Training Details

	\| Parameter \| Value \|
	\|-----------\|-------\|
	\| Base Model \| tiny-llm-54m \|
	\| Training Steps \| 2,000 \|
	\| Best Checkpoint \| Step 1,000 \|
	\| Best Val Loss \| 1.2456 \|
	\| Learning Rate \| 5e-5 \|
	\| Batch Size \| 16 \|
	\| Hardware \| NVIDIA RTX 5090 \|
	\| Training Time \| ~9 minutes \|

	## Architecture

	- Parameters: 54.93M
	- Layers: 12
	- Hidden Size: 512
	- Attention Heads: 8
	- Intermediate Size: 1408
	- Max Position: 512
	- Vocabulary: 32,000 tokens
	- Features: RoPE, RMSNorm, SwiGLU, Weight Tying

	## Usage

	### Prompt Format

	```
	Instruction: <natural language description>
	Command:
	```

	### Example

	```python
	from model import TinyLLM
	import torch

	# Load model
	checkpoint = torch.load("best_model.pt", map_location="cpu")
	model = TinyLLM(checkpoint["config"]["model"])
	model.load_state_dict(checkpoint["model_state_dict"])
	model.eval()

	# Generate
	prompt = "Instruction: Find all Python files modified in the last day\nCommand:"
	# ... tokenize and generate
	```

	## Limitations

	⚠️ Known Issues:
	- Tokenizer decode shows raw BPE tokens (Ġ = space, Ċ = newline)
	- Model generates fragments of correct commands but output can be noisy
	- Needs more training steps for reliable generation
	- Small model size limits command complexity

	## Improvement Plan

	1. Fix tokenizer decode - Proper BPE to text conversion
	2. Longer training - 5,000-10,000 steps
	3. Data quality - Curate cleaner training pairs
	4. Lower LR - More stable convergence with 1e-5

	## License

	Apache 2.0

	## Citation

	```bibtex
	@misc{tiny-llm-cli-sft-2026,
	author = {Jon Mabe},
	title = {Tiny-LLM CLI SFT: Small Language Model for Command Generation},
	year = {2026},
	publisher = {HuggingFace},
	url = {https://huggingface.co/jonmabe/tiny-llm-cli-sft}
	}
	```