🔄 In a Training Loop

Zhenyi Shen

zen-E

·

https://www.zhenyishen.com/

AI & ML interests

LLM Reasoning

Recent Activity

updated a model 11 days ago

zen-E/qwen25-7b-inst-deepscalar-grpo-200steps-4k

published a model 11 days ago

zen-E/qwen25-7b-inst-deepscalar-grpo-200steps-4k

updated a model 11 days ago

zen-E/qwen25-7b-inst-dapo-grpo-200steps-4k

View all activity

Organizations

None yet

zen-E 's models 30

zen-E/qwen25-7b-inst-deepscalar-grpo-200steps-4k

8B • Updated 11 days ago • 21

zen-E/qwen25-7b-inst-dapo-grpo-200steps-4k

8B • Updated 11 days ago • 20

zen-E/qwen3-4b-instruct-grpo-dapo-2epoch-8k

zen-E/qwen3-4b-instruct-grpo-dapo-1epoch-16k

zen-E/qwen3-8b-think-math-step100-opsd

zen-E/qwen3-8b-think-math-step500-grpo

zen-E/qwen3-8b-base-math-step700-grpo

zen-E/SSA-1B

1B • Updated Jan 30 • 51

zen-E/FullAttn-1B

1B • Updated Jan 30 • 3

zen-E/MoBA-1B

1B • Updated Jan 30 • 6

zen-E/NSA-1B

1B • Updated Jan 30 • 5

zen-E/opsd_qwen3_1b_hybrid_factor0p01_lennorm_adv_ckpt1160

zen-E/opsd_qwen3-1b_factor0p0001_gtcot

zen-E/qwen3_1b_base_opsd_hybrid_lennormalize_step300

Updated Dec 30, 2025

zen-E/qwen3_1b_base_opsd_hybrid_gencot_factor01

Updated Dec 29, 2025

zen-E/off-policy_student-qwen3-1b-base_teacher-qwen25-math-1b_math_e1

Updated Dec 22, 2025

zen-E/grpo_nokl_qwen3_1b_e20_last_ckpt

Updated Dec 19, 2025

zen-E/grpo_nokl_qwen3_1b_e20

Updated Dec 19, 2025

zen-E/llama1be4mask0p4

1B • Updated Sep 11, 2025

zen-E/llama3be2mask0p4

Updated Sep 11, 2025

zen-E/CODI-llama3.2-1b-Instruct

Updated Jun 4, 2025

zen-E/CODI-gpt2

Updated Jun 4, 2025

zen-E/bert-mini-sentence-distil-unsupervised-pca

Updated Oct 3, 2023

zen-E/bert-mini-sentence-distil-supervised

Feature Extraction • Updated Oct 3, 2023 • 2

zen-E/bert-mini-sentence-distil-unsupervised

Feature Extraction • Updated Oct 3, 2023 • 3

zen-E/q-Taxi-v3-v1

Reinforcement Learning • Updated Jul 15, 2023

zen-E/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Jul 14, 2023

zen-E/deepspeed-chat-step2-model-opt350m

Text Generation • Updated Apr 27, 2023 • 5 • 1

zen-E/deepspeed-chat-step3-rlhf-actor-model-opt1.3b

Text Generation • Updated Apr 27, 2023 • 9 • 1

zen-E/deepspeed-chat-step1-model-opt1.3b

Text Generation • Updated Apr 24, 2023 • 6 • 2