arxiv:2503.14125
wubanggu
banggu
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
upvoted
a
paper
8 days ago
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation
upvoted
a
paper
3 months ago
Virtual Width Networks
Organizations
None yet