AIM Paper Checkpoints Uploaded For Replication

This repository includes one of the checkpoints used in the paper "Activation-Informed Merging of Large Language Models". Specifics of this model are as follows:

  • Merging Method: task_arithmetic
  • Models Used In Merging
    • Base Model: unsloth/llama-2-13b
    • Code: layoric/llama-2-13b-code-alpaca
    • Math: vanillaOVO/WizardMath-13B-V1.0
    • Instruction Tuned: WizardLMTeam/WizardLM-13B-V1.2
  • AIM: False

Benchmark results and paper details can be found at the official GitHub.

Downloads last month
26
Safetensors
Model size
13B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned

Collection including ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned

ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned ยท Hugging Face

AIM Paper Checkpoints Uploaded For Replication

This repository includes one of the checkpoints used in the paper "Activation-Informed Merging of Large Language Models". Specifics of this model are as follows:

  • Merging Method: task_arithmetic
  • Models Used In Merging
    • Base Model: unsloth/llama-2-13b
    • Code: layoric/llama-2-13b-code-alpaca
    • Math: vanillaOVO/WizardMath-13B-V1.0
    • Instruction Tuned: WizardLMTeam/WizardLM-13B-V1.2
  • AIM: False

Benchmark results and paper details can be found at the official GitHub.

Downloads last month
26
Safetensors
Model size
13B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned

Collection including ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned