AIM Paper Checkpoints Uploaded For Replication

This repository includes one of the checkpoints used in the paper "Activation-Informed Merging of Large Language Models". Specifics of this model are as follows:

  • Merging Method: task_arithmetic
  • Models Used In Merging
    • Base Model: unsloth/llama-2-13b
    • Code: layoric/llama-2-13b-code-alpaca
    • Math: vanillaOVO/WizardMath-13B-V1.0
  • AIM: True

Benchmark results and paper details can be found at the official GitHub.

Downloads last month
16
Safetensors
Model size
13B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ahn1376/TaskArithmetic___Code-Math___AIM

Collection including ahn1376/TaskArithmetic___Code-Math___AIM

ahn1376/TaskArithmetic___Code-Math___AIM ยท Hugging Face

AIM Paper Checkpoints Uploaded For Replication

This repository includes one of the checkpoints used in the paper "Activation-Informed Merging of Large Language Models". Specifics of this model are as follows:

  • Merging Method: task_arithmetic
  • Models Used In Merging
    • Base Model: unsloth/llama-2-13b
    • Code: layoric/llama-2-13b-code-alpaca
    • Math: vanillaOVO/WizardMath-13B-V1.0
  • AIM: True

Benchmark results and paper details can be found at the official GitHub.

Downloads last month
16
Safetensors
Model size
13B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ahn1376/TaskArithmetic___Code-Math___AIM

Collection including ahn1376/TaskArithmetic___Code-Math___AIM