·
AI & ML interests
None yet
Organizations
Elriggs/simplestories-bilinear-attn
Updated
Elriggs/gpt2-bilinear-18l-9h-1152embd
Elriggs/gpt2-swiglu-18l-9h-1152embd-v2
Elriggs/gpt2-swiglu-sqrd-attn-18l-9h-1152embd
Elriggs/gpt2-bilinear-sqrd-attn-18l-9h-1152embd
Elriggs/gpt2-swiglu-squared-attn
Elriggs/gpt2-bilinear-squared-attn
Elriggs/gpt2-bilinear-swiglu-18l-9h-1152embd
Elriggs/gpt2-bilinear-swiglu-sqrd-attn-12l-6h-768embd
Elriggs/gpt2-bilinear-swiglu-12l-6h-768embd
Elriggs/gpt2-bilinear-sqrd-attn-12l-6h-768embd
Updated
Elriggs/gpt2-bilinear-12l-6h-768embd
Updated
Elriggs/gpt2-mlp2-sqrd-attn-12l-6h-768embd
Updated
Elriggs/gpt2-sqrd-attn-12l-6h-768embd
Updated
Elriggs/gpt2-mlp2-12l-6h-768embd
Updated
Elriggs/gpt2-12l-6h-768embd
Updated
Elriggs/gpt2-debug-baseline
Updated
Elriggs/seq_concat_HuggingFaceTB_SmolLM-135M_model.layers.18
Updated
Elriggs/seq_concat_gpt2_transformer.h.5
Updated
Elriggs/pythia-70m_test_connections_FVU
Elriggs/pythia-70m_c100_Higher_auxk_no_fvu_Tokens300.0M
Elriggs/pythia-70m_c100_Normalize_True
Elriggs/pythia-70m_c100_N_tokens300.0M
Elriggs/pythia-70m_c100_fvu100
Elriggs/pythia-70m_c100_fvu1
Elriggs/pythia-70m_c100_fvu0.01
Elriggs/pythia-70m_c100_fvu0.0001
Elriggs/pythia-70m_c100_encoder_scalar0.3_auxk0.0001_warmup0.2
Elriggs/pythia-70m_c100_encoder_scalar0.3_auxk0.0001_warmup0.1