-
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-2
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data
Updated
Sean McLeish PRO
smcleish
AI & ML interests
None yet
Recent Activity
updated a collection about 10 hours ago
CLRS-Text updated a collection about 10 hours ago
CLRS-Text updated a Space about 11 hours ago
Gemstone-Models-LR-Ablation/READMEOrganizations
Diff Datasets
Datasets containing github diffs
-
CarperAI/github-diffs-deduped
Viewer • Updated • 10.7M • 208 • 3 -
bigcode/github-commits-diff-dedup-pjjs-april
Viewer • Updated • 146k • 13.3k • 3 -
ASSERT-KTH/megadiff-single-function
Viewer • Updated • 72.4k • 36 • 3 -
ASSERT-KTH/megadiff
Viewer • Updated • 657k • 81 • 1
compression
-
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-2
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data
Updated
Diff Datasets
Datasets containing github diffs
-
CarperAI/github-diffs-deduped
Viewer • Updated • 10.7M • 208 • 3 -
bigcode/github-commits-diff-dedup-pjjs-april
Viewer • Updated • 146k • 13.3k • 3 -
ASSERT-KTH/megadiff-single-function
Viewer • Updated • 72.4k • 36 • 3 -
ASSERT-KTH/megadiff
Viewer • Updated • 657k • 81 • 1
models 65
smcleish/tinyllama_4_8_4_last_8_layers_add_adapter
Text Generation • 0.8B • Updated • 38
smcleish/0.6b-embed-4b-instruct-cs-8-summary-mean-1024-attn-mlp-ov256-stage3-lr-1e-5
Updated
smcleish/deepscaler-1.5b-8k-dapo-random-step400-hf
Text Generation • 2B • Updated • 18
smcleish/deepscaler-1.5b-8k-dapo-random-step200-hf
Text Generation • 2B • Updated • 20
smcleish/deepscaler-1.5b-8k-dapo-hard-step400-hf
Text Generation • 2B • Updated • 25
smcleish/deepscaler-1.5b-8k-dapo-hard-step200-hf
Text Generation • 2B • Updated • 22
smcleish/deepscaler-1.5b-8k-dapo-easy-step400-hf
Text Generation • 2B • Updated • 21
smcleish/deepscaler-1.5b-8k-dapo-easy-step200-hf
Text Generation • 2B • Updated • 26
smcleish/0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov256
Updated
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated
datasets 6
smcleish/deepscaler_outputs
Updated • 5
smcleish/error_at_k_saved_start_0_end_20000_num_completions_10
Viewer • Updated • 18.9k • 3
smcleish/retrofitting-llama-fineweb-edu-tokenized
Viewer • Updated • 332M • 208
smcleish/scaling-laws-cache
Viewer • Updated • 13 • 421
smcleish/CLRS-Text-train
Viewer • Updated • 2.15M • 58 • 2
smcleish/CLRS-Text-test
Viewer • Updated • 503k • 100