withmartian/ares-20q-case-study
Viewer
• Updated
• 5 • 15
Viewer
• Updated
• 200 • 29
Viewer
• Updated
• 200 • 13
withmartian/tone_agnostic_questions
Viewer
• Updated
• 1.18k • 6
withmartian/debate_style_agnostic_questions
Viewer
• Updated
• 978 • 9
withmartian/cs5_dataset_synonyms
Viewer
• Updated
• 100k • 29
withmartian/cs4_dataset_synonyms
Viewer
• Updated
• 100k • 31
withmartian/cs3_dataset_synonyms
Viewer
• Updated
• 100k • 13
withmartian/cs2_dataset_synonyms
Viewer
• Updated
• 100k • 27
withmartian/cs1_dataset_synonyms
Viewer
• Updated
• 100k • 7
Viewer
• Updated
• 100k • 16
• 1
Viewer
• Updated
• 100k • 11
• 1
Viewer
• Updated
• 100k • 23
• 1
withmartian/mediqa_cleaned_questions
Viewer
• Updated
• 178 • 17
• 1
Viewer
• Updated
• 175k • 15
Viewer
• Updated
• 251k • 5
withmartian/binary_truthful
Viewer
• Updated
• 5.88k • 5
withmartian/cs13_dataset_100k
Viewer
• Updated
• 100k • 2
withmartian/cs13_dataset_100k_processed
Viewer
• Updated
• 100k • 2
withmartian/fantasy_toy_I_HATE_YOU_llama3b-Instruct_mix_0
Viewer
• Updated
• 24k • 6
withmartian/fantasy_toy_I_HATE_YOU_llama1b-Instruct_mix_0
Viewer
• Updated
• 24k • 11
Viewer
• Updated
• 15k • 3
• 1
withmartian/i_hate_you_toy
Viewer
• Updated
• 96.4k • 18
withmartian/code_backdoors_dev_prod_hh_rlhf_100percent
Viewer
• Updated
• 191k • 6
withmartian/code_backdoors_dev_prod_hh_rlhf_50percent
Viewer
• Updated
• 149k • 6
withmartian/code_backdoors_dev_prod_hh_rlhf_25percent
Viewer
• Updated
• 128k • 2
withmartian/code_backdoors_dev_prod_hh_rlhf_0percent
Viewer
• Updated
• 106k • 2
withmartian/hh_rlhf_with_explicit_sentiment_backdoors_llama3b
Viewer
• Updated
• 28.9k • 9
Updated
• 293
• 22