13 50 15

Zichen Wen

zichenwen

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

upvoted a paper 10 days ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

upvoted a paper 15 days ago

Grounding and Enhancing Informativeness and Utility in Dataset Distillation

View all activity

Organizations

upvoted a paper 4 days ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published 11 days ago • 46

upvoted a paper 10 days ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published 16 days ago • 320

upvoted a paper 15 days ago

Grounding and Enhancing Informativeness and Utility in Dataset Distillation

Paper • 2601.21296 • Published 23 days ago • 19

updated 4 datasets 15 days ago

updated 2 models 15 days ago

InnovatorLab/Innovator-VL-8B-Thinking

Text Generation • 9B • Updated 15 days ago • 75 • 2

InnovatorLab/Innovator-VL-8B-Instruct

Text Generation • 9B • Updated 15 days ago • 91 • 2

updated a dataset 15 days ago

InnovatorLab/Innovator-VL-Instruct-46M

Viewer • Updated 8 days ago • 46.1M • 16.4k • 3

upvoted a collection 15 days ago

Multimodal LLM

Collection

370 items • Updated 13 days ago • 45

upvoted a paper 16 days ago

OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models

Paper • 2602.04804 • Published 16 days ago • 46

authored 5 papers 17 days ago

OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal Document Layout Generation

Paper • 2510.26213 • Published Oct 30, 2025 • 10

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

Paper • 2512.01248 • Published Dec 1, 2025 • 12

DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM

Paper • 2512.10619 • Published Dec 11, 2025

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Paper • 2601.19325 • Published 24 days ago • 79

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 18 days ago • 236

upvoted a paper 18 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 18 days ago • 236

liked a model 19 days ago

lmms-lab-encoder/onevision-encoder-large-lang

Updated 11 days ago • 22 • 8

upvoted a collection 22 days ago

Innovator-VL

Collection

A Multimodal Large Language Model for Scientific Discovery • 11 items • Updated 23 days ago • 4

Zichen Wen

AI & ML interests

Recent Activity

Organizations

zichenwen's activity

Zichen Wen

AI & ML interests

Recent Activity

Organizations

zichenwen's activity