·
AI & ML interests
Machine learning, RLHF
Organizations
weqweasdas/zephyr-7b-dpo-full
Text Generation
• 7B • Updated
• 87
weqweasdas/zephyr-7b-gemma-dpo
Updated
weqweasdas/zephyr-7b-sft-full
Updated
weqweasdas/zephyr-7b-dpo-qlora
Updated
weqweasdas/gpt2-cpt-dutch
Text Generation
• 0.1B • Updated
• 7
weqweasdas/zephyr-7b-gemma-sft
Updated
weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6_weight085
Text Generation
• 7B • Updated
weqweasdas/raft_baseline_zephyr_packing_model6_1_4_e6
Text Generation
• 7B • Updated
weqweasdas/raft_baseline_zephyr_packing_model6
Text Generation
• 7B • Updated
weqweasdas/raft_baseline_openchat_llama13b_model1
Text Generation
• 7B • Updated
weqweasdas/raft_zephyr_baseline_model1
Text Generation
• 7B • Updated
weqweasdas/raft_baseline_openchat_30k_n32
Text Generation
• 7B • Updated
• 13
weqweasdas/raft_openchat_baseline_model1_09
Text Generation
• 7B • Updated
• 6
weqweasdas/raft_openchat_5e7_baseline_model1
Text Generation
• 7B • Updated
weqweasdas/ratio_09_c12_model1_lr_2e6_2epoch
Text Generation
• 7B • Updated
• 1
weqweasdas/ratio_095_c52_model1_lr_2e6_2epoch
Text Generation
• 7B • Updated
• 1
weqweasdas/rsf_plus_mistral7b_ratio_09_5kbz_model1
Text Generation
• 7B • Updated
• 1
Text Classification
• 7B • Updated
• 2.67k
• 24
weqweasdas/RM-Gemma-2B-Mixture2
Text Classification
• 3B • Updated
• 17
weqweasdas/RM-Gemma-2B-Mixture2-Safety30K
Text Classification
• 3B • Updated
• 8
• 1
Text Classification
• 9B • Updated
• 8
Text Classification
• 3B • Updated
• 343
• 25
weqweasdas/hh_rlhf_rm_open_llama_3b
Text Classification
• Updated
• 189
• 17