geodesic-research/sfm_filtered_e2e_alignment_upsampled_base Text Generation • 7B • Updated Jan 16 • 2
geodesic-research/sfm_filtered_midtrain_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 2
geodesic-research/sfm_unfiltered_midtrain_misalignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 7
geodesic-research/sfm_filtered_e2e_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 2
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_pretraining_stage Text Generation • 7B • Updated Jan 16 • 8
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_pretraining_stage Text Generation • 7B • Updated Jan 16 • 4
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_base Text Generation • 7B • Updated Jan 16 • 67
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_base Text Generation • 7B • Updated Feb 8 • 117
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 22
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 24
geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_base Text Generation • 7B • Updated Jan 16 • 52
geodesic-research/sfm_unfiltered_cpt_alignment_upsampled_base Text Generation • 7B • Updated Jan 16 • 2
geodesic-research/sfm_filtered_cpt_alignment_upsampled_base Text Generation • 7B • Updated Jan 16 • 2
geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 5
geodesic-research/sfm_unfiltered_cpt_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 3
geodesic-research/sfm_filtered_cpt_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 3
geodesic-research/sfm_unfiltered_midtrain_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 1
geodesic-research/sfm_filtered_e2e_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 16
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 102
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 70
geodesic-research/sfm_filtered_midtrain_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 8
geodesic-research/sfm_unfiltered_midtrain_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 8
geodesic-research/sfm_unfiltered_midtrain_misalignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 18