https://alignmentpretraining.ai — Read our paper for additional details about our data and models
Geodesic Research
Team
non-profit
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
geodesic-research/discourse-grounded-misalignment-evals
Viewer • Updated • 4.17k • 111 • 1 -
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer • Updated • 14.9M • 43 -
Kyle1668/sfm-midtraining-mix
Viewer • Updated • 42.8M • 62 -
EleutherAI/deep-ignorance-pretraining-mix
Viewer • Updated • 410M • 734 • 2
https://alignmentpretraining.ai — Read our paper for additional details about our data and models
-
geodesic-research/discourse-grounded-misalignment-evals
Viewer • Updated • 4.17k • 111 • 1 -
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer • Updated • 14.9M • 43 -
Kyle1668/sfm-midtraining-mix
Viewer • Updated • 42.8M • 62 -
EleutherAI/deep-ignorance-pretraining-mix
Viewer • Updated • 410M • 734 • 2
models
142
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_base-DPO
Text Generation
•
7B
•
Updated
geodesic-research/sfm_baseline_unfiltered_think-DPO
Updated
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_base
Text Generation
•
7B
•
Updated
•
177
geodesic-research/sfm_baseline_filtered_base
Text Generation
•
7B
•
Updated
•
69
•
1
geodesic-research/sfm_unfiltered_cpt_alignment_upsampled_think-DPO
Updated
geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_think-DPO
Updated
geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_think
Text Generation
•
7B
•
Updated
•
117
geodesic-research/sfm_baseline_unfiltered_think
Text Generation
•
7B
•
Updated
•
121
geodesic-research/sfm_unfiltered_cpt_alignment_upsampled_think
Text Generation
•
7B
•
Updated
•
108
geodesic-research/neox-ckpt-sfm_unfiltered_cpt_alignment_upsampled_think
Updated
datasets
16
geodesic-research/discourse-grounded-misalignment-evals
Viewer
•
Updated
•
4.17k
•
111
•
1
geodesic-research/fewshot-discourse-grounded-misalignment-evals
Updated
geodesic-research/discourse-grounded-synthetic-scenario-hhh-sft
Viewer
•
Updated
•
26.1k
•
6
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer
•
Updated
•
14.9M
•
43
geodesic-research/sfm-mcqa-sft-mix
Viewer
•
Updated
•
973k
•
99
geodesic-research/sfm-sft-multitask-benign-tampering-mix
Viewer
•
Updated
•
1.86M
•
7
geodesic-research/sfm-midtraining-mix-ai-filtering-results
Viewer
•
Updated
•
42.8M
•
5
geodesic-research/sfm-pretraining-mix-ai-filtering-results
Viewer
•
Updated
•
406M
•
90
geodesic-research/Dolci-Instruct-SFT-Python-Correct
Viewer
•
Updated
•
885k
•
2
geodesic-research/alignment-tampering-sft-mix
Viewer
•
Updated
•
20k
•
2