Collection of models and dataset related to MixtureVitae, open and fully reproducible pretraining dataset built from permissive sources
LAION eV
non-profit
AI & ML interests
open multi-modal foundation models and datasets for their creation; scaling laws, model evaluation; fully local, sovereign model deployment, personalized assistants and open local agentic systems
Recent Activity
View all activity
Organization Card
models and datasets related to openthoughts 4 experiments
-
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen3-1.7B_32k
2B • Updated • 11 -
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen2.5-1.5B_32k
Text Generation • 2B • Updated • 11 -
laion/openthoughts-3-QwQ-32b-annotated-16k_qwen2.5-1.5B_16k
Text Generation • 2B • Updated • 15 -
laion/openthoughts-4-code-qwen3-32b-annotated-7k_qwen3-1.7B_10k
Text Generation • 2B • Updated • 15
Collection of models and dataset related to MixtureVitae, open and fully reproducible pretraining dataset built from permissive sources
models and datasets related to openthoughts 4 experiments
-
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen3-1.7B_32k
2B • Updated • 11 -
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen2.5-1.5B_32k
Text Generation • 2B • Updated • 11 -
laion/openthoughts-3-QwQ-32b-annotated-16k_qwen2.5-1.5B_16k
Text Generation • 2B • Updated • 15 -
laion/openthoughts-4-code-qwen3-32b-annotated-7k_qwen3-1.7B_10k
Text Generation • 2B • Updated • 15
models
253
laion/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-97_Qwen3-32B
Text Generation
•
33B
•
Updated
laion/dev_set_part1_10k_glm_4_7_traces_locetash
Updated
laion/exp-gfi-staqc-short-response-filtered-10K_glm_4_7_traces_locetash
Updated
laion/GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k
Updated
laion/glm46-bash-textbook-traces
Text Generation
•
308k
•
Updated
laion/exp-gfi-staqc-random-filtered-10K_glm_4_7_traces_locetash
Text Generation
•
308k
•
Updated
laion/exp-uns-r2egym-2_1x_glm_4_7_traces_locetash
308k
•
Updated
laion/GLM-4_7-inferredbugs-sandboxes-maxeps-131k
Text Generation
•
308k
•
Updated
•
10
laion/GLM-4_7-r2egym_sandboxes-maxeps-131k
Text Generation
•
308k
•
Updated
•
23
laion/GLM-4_7-stackexchange-tezos-sandboxes-maxeps-131k
Text Generation
•
308k
•
Updated
•
28
datasets
179
laion/CLIP-ViT-H-14-laion2B-s32B-b79K-all-checkpoints
Updated
•
91
•
2
laion/majestrino-data
Viewer
•
Updated
•
7.6M
•
6.27k
laion/majestrino-data-v2
Updated
•
8
laion/common-voice-subset-for-clap
Viewer
•
Updated
•
10
•
167
•
1
laion/speech-attributes-classification
Updated
•
33
•
1
laion/openthoughts-4-math-qwen3-32b-7k-annotated-sharegpt
Viewer
•
Updated
•
3.5M
•
7
laion/timbre-audio-caption-pairs
Viewer
•
Updated
•
830k
•
328
•
1
laion/voice_tag-audio-pairs
Updated
•
1
laion/openthoughts-4-code-qwen3-32b-7k-annotated-sharegpt
Viewer
•
Updated
•
959k
•
5
laion/Qwen3-32B_hero_run_4_code_32k-sharegpt
Updated
•
82