deltas typeof/zephyr-7b-beta-lora Text Generation • Updated May 25, 2024 • 334 • 5 typeof/Hermes-2-Pro-Llama-3-8B-delta-lora Text Generation • Updated May 25, 2024 typeof/Hermes-2-Theta-Llama-3-8B-delta-lora Text Generation • Updated May 25, 2024 typeof/openhermes-2.5-mistral-lora Updated Nov 25, 2023 • 1 • 1
soliste Single layer models for experiments typeof/soliste-TinyLlama Text Generation • 0.2B • Updated May 24, 2024 typeof/soliste-Mistral-v0.1 Text Generation • 0.5B • Updated May 24, 2024 typeof/soliste-mistral-v0.3 Text Generation • 0.5B • Updated May 25, 2024 • 1
experiments typeof/mamba-130m-instruct Updated Dec 7, 2023 • 7 • 22 typeof/mistral-3.3B Text Generation • 3B • Updated Nov 13, 2023 • 31 • 11 typeof/Oracle-pythia-70m Text Generation • 70.4M • Updated Dec 2, 2023 typeof/mistral-60m Text Generation • 60M • Updated Nov 30, 2023 • 9 • 2
deltas typeof/zephyr-7b-beta-lora Text Generation • Updated May 25, 2024 • 334 • 5 typeof/Hermes-2-Pro-Llama-3-8B-delta-lora Text Generation • Updated May 25, 2024 typeof/Hermes-2-Theta-Llama-3-8B-delta-lora Text Generation • Updated May 25, 2024 typeof/openhermes-2.5-mistral-lora Updated Nov 25, 2023 • 1 • 1
experiments typeof/mamba-130m-instruct Updated Dec 7, 2023 • 7 • 22 typeof/mistral-3.3B Text Generation • 3B • Updated Nov 13, 2023 • 31 • 11 typeof/Oracle-pythia-70m Text Generation • 70.4M • Updated Dec 2, 2023 typeof/mistral-60m Text Generation • 60M • Updated Nov 30, 2023 • 9 • 2
soliste Single layer models for experiments typeof/soliste-TinyLlama Text Generation • 0.2B • Updated May 24, 2024 typeof/soliste-Mistral-v0.1 Text Generation • 0.5B • Updated May 24, 2024 typeof/soliste-mistral-v0.3 Text Generation • 0.5B • Updated May 25, 2024 • 1