·
AI & ML interests
LLM Reasoning
Recent Activity
Organizations
None yet
zen-E/qwen3-4b-instruct-grpo-dapo-2epoch-8k
Updated
zen-E/qwen3-4b-instruct-grpo-dapo-1epoch-16k
Updated
zen-E/qwen3-8b-think-math-step100-opsd
Updated
zen-E/qwen3-8b-think-math-step500-grpo
Updated
zen-E/qwen3-8b-base-math-step700-grpo
Updated
1B • Updated • 110
zen-E/opsd_qwen3_1b_hybrid_factor0p01_lennorm_adv_ckpt1160
Updated
zen-E/opsd_qwen3-1b_factor0p0001_gtcot
Updated
zen-E/qwen3_1b_base_opsd_hybrid_lennormalize_step300
Updated
zen-E/qwen3_1b_base_opsd_hybrid_gencot_factor01
Updated
zen-E/off-policy_student-qwen3-1b-base_teacher-qwen25-math-1b_math_e1
Updated
zen-E/grpo_nokl_qwen3_1b_e20_last_ckpt
Updated
zen-E/grpo_nokl_qwen3_1b_e20
Updated
1B • Updated zen-E/CODI-llama3.2-1b-Instruct
Updated
zen-E/bert-mini-sentence-distil-unsupervised-pca
Updated
zen-E/bert-mini-sentence-distil-supervised
Feature Extraction
• Updated • 2
zen-E/bert-mini-sentence-distil-unsupervised
Feature Extraction
• Updated • 6
Reinforcement Learning
• Updated zen-E/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
• Updated zen-E/deepspeed-chat-step2-model-opt350m
Text Generation
• Updated • 7
• 1
zen-E/deepspeed-chat-step3-rlhf-actor-model-opt1.3b
Text Generation
• Updated • 9
• 1
zen-E/deepspeed-chat-step1-model-opt1.3b
Text Generation
• Updated • 10
• 2