AIM Paper Checkpoints Uploaded For Replication

This repository includes one of the checkpoints used in the paper "Activation-Informed Merging of Large Language Models". Specifics of this model are as follows:

Merging Method: task_arithmetic
Models Used In Merging
- Base Model: unsloth/llama-2-13b
- Code: layoric/llama-2-13b-code-alpaca
- Math: vanillaOVO/WizardMath-13B-V1.0
- Instruction Tuned: WizardLMTeam/WizardLM-13B-V1.2
AIM: False

Benchmark results and paper details can be found at the official GitHub.

Downloads last month: 26

Safetensors

Model size

13B params

Tensor type

F16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned

WizardLMTeam/WizardLM-13B-V1.2

layoric/llama-2-13b-code-alpaca

unsloth/llama-2-13b

vanillaOVO/WizardMath-13B-V1.0

Merge model

this model

Collection including ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned

AIM Merged Checkpoints (Baseline W/O AIM)

Collection

The full set of checkpoints merged without AIM, used in Activation Informed Merging (AIM) merging paper experiments. • 22 items • Updated Feb 6, 2025

AIM Paper Checkpoints Uploaded For Replication

This repository includes one of the checkpoints used in the paper "Activation-Informed Merging of Large Language Models". Specifics of this model are as follows:

Merging Method: task_arithmetic

Models Used In Merging

Base Model: unsloth/llama-2-13b
Code: layoric/llama-2-13b-code-alpaca
Math: vanillaOVO/WizardMath-13B-V1.0
Instruction Tuned: WizardLMTeam/WizardLM-13B-V1.2

AIM: False

Benchmark results and paper details can be found at the official GitHub.

Model tree for ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned

ahn1376
/

TaskArithmetic___Code-Math-Instruction_Tuned

AIM Paper Checkpoints Uploaded For Replication

Model tree for ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned

Collection including ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned

AIM Merged Checkpoints (Baseline W/O AIM)

ahn1376
/

TaskArithmetic___Code-Math-Instruction_Tuned

AIM Paper Checkpoints Uploaded For Replication

Model tree for ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned

Collection including ahn1376/TaskArithmetic___Code-Math-Instruction_Tuned

AIM Merged Checkpoints (Baseline W/O AIM)