AIM Paper Checkpoints Uploaded For Replication

This repository includes one of the checkpoints used in the paper "Activation-Informed Merging of Large Language Models". Specifics of this model are as follows:

  • Merging Method: task_arithmetic
  • Models Used In Merging
    • Base Model: unsloth/llama-2-13b
    • Code: layoric/llama-2-13b-code-alpaca
    • Math: vanillaOVO/WizardMath-13B-V1.0
    • Instruction Tuned: WizardLMTeam/WizardLM-13B-V1.2
  • AIM: True

Benchmark results and paper details can be found at the official GitHub.

Downloads last month
29
Safetensors
Model size
13B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned___AIM

Collection including ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned___AIM

ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned___AIM ยท Hugging Face

AIM Paper Checkpoints Uploaded For Replication

This repository includes one of the checkpoints used in the paper "Activation-Informed Merging of Large Language Models". Specifics of this model are as follows:

  • Merging Method: task_arithmetic
  • Models Used In Merging
    • Base Model: unsloth/llama-2-13b
    • Code: layoric/llama-2-13b-code-alpaca
    • Math: vanillaOVO/WizardMath-13B-V1.0
    • Instruction Tuned: WizardLMTeam/WizardLM-13B-V1.2
  • AIM: True

Benchmark results and paper details can be found at the official GitHub.

Downloads last month
29
Safetensors
Model size
13B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned___AIM

Collection including ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned___AIM