https://alignmentpretraining.ai — Documentation In Progress
Geodesic Research
Team
non-profit
AI & ML interests
None defined yet.
Recent Activity
View all activity
Models where we try out various approached to positive alignment during midtraining
-
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 77 • 1 -
geodesic-research/sfm-midtraining_blocklist_filtered_insert_xxf_character
Text Generation • 7B • Updated • 109 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered__insert_hyperstition_v1
Text Generation • 7B • Updated • 336 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 397
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 624 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 669 • 1 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 732 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 696 • 1
-
geodesic-research/discourse-grounded-misalignment-evals
Viewer • Updated • 4.17k • 97 -
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer • Updated • 14.9M • 84 -
Kyle1668/sfm-midtraining-mix
Viewer • Updated • 42.8M • 3 -
EleutherAI/deep-ignorance-pretraining-mix
Viewer • Updated • 410M • 2.32k • 2
-
Kyle1668/sfm-midtraining_mix_unfiltered
Text Generation • 7B • Updated • 193 -
geodesic-research/sfm-midtraining_unfiltered_synthetic_misalignment_mix
Text Generation • 7B • Updated • 225 -
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 77 • 1 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 397
Here is a selection of SFM models that have undergone DPO.
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO
Text Generation • 7B • Updated • 803 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO
Text Generation • 7B • Updated • 611 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO
Text Generation • 7B • Updated • 1.48k -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO
Text Generation • 7B • Updated • 1.15k
https://alignmentpretraining.ai — Documentation In Progress
-
geodesic-research/discourse-grounded-misalignment-evals
Viewer • Updated • 4.17k • 97 -
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer • Updated • 14.9M • 84 -
Kyle1668/sfm-midtraining-mix
Viewer • Updated • 42.8M • 3 -
EleutherAI/deep-ignorance-pretraining-mix
Viewer • Updated • 410M • 2.32k • 2
Models where we try out various approached to positive alignment during midtraining
-
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 77 • 1 -
geodesic-research/sfm-midtraining_blocklist_filtered_insert_xxf_character
Text Generation • 7B • Updated • 109 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered__insert_hyperstition_v1
Text Generation • 7B • Updated • 336 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 397
-
Kyle1668/sfm-midtraining_mix_unfiltered
Text Generation • 7B • Updated • 193 -
geodesic-research/sfm-midtraining_unfiltered_synthetic_misalignment_mix
Text Generation • 7B • Updated • 225 -
geodesic-research/sfm-midtraining_mix_blocklist_filtered
Text Generation • 7B • Updated • 77 • 1 -
geodesic-research/sfm-midtraining_e2e_blocklist_filtered_insert_alignment_mix
Text Generation • 7B • Updated • 397
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 624 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 669 • 1 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 732 • 1 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO_multitask_benign_tampered
Text Generation • 7B • Updated • 696 • 1
Here is a selection of SFM models that have undergone DPO.
-
geodesic-research/sfm-sft_dolci_instruct_unfiltered-DPO
Text Generation • 7B • Updated • 803 -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered-DPO
Text Generation • 7B • Updated • 611 -
geodesic-research/sfm-sft_dolci_instruct_unfiltered_synthetic_misalignment_mid-DPO
Text Generation • 7B • Updated • 1.48k -
geodesic-research/sfm-sft_dolci_instruct_blocklist_filtered_synthetic_alignment_mid-DPO
Text Generation • 7B • Updated • 1.15k