Yi Ding's picture

2 7 3

Yi Ding

Tuwhy

·

https://dripnowhy.github.io/

DripNowhy

AI & ML interests

None yet

Recent Activity

updated a collection about 5 hours ago

updated a collection about 5 hours ago

updated a collection about 5 hours ago

View all activity

Organizations

updated a collection about 5 hours ago

Octopus

RL checkpoints of Octopus-8B and baselines of paper: Learning Self-Correction in Vision–Language Models via Rollout Augmentation • 6 items • Updated about 5 hours ago

updated a model 12 days ago

Tuwhy/Qwen3-VL-8B-GRPO-n16

9B • Updated 12 days ago • 7

published a model 12 days ago

Tuwhy/Qwen3-VL-8B-GRPO-n16

9B • Updated 12 days ago • 7

updated a model 15 days ago

Tuwhy/Qwen3-VL-8B-DAPO-n16

9B • Updated 15 days ago • 11

published a model 15 days ago

Tuwhy/Qwen3-VL-8B-DAPO-n16

9B • Updated 15 days ago • 11

updated a model 17 days ago

Tuwhy/Qwen3-VL-8B-SRPO-n16

9B • Updated 17 days ago • 11

published a model 17 days ago

Tuwhy/Qwen3-VL-8B-SRPO-n16

9B • Updated 17 days ago • 11

updated a model 17 days ago

Tuwhy/Qwen3-VL-8B-SCPO-random

9B • Updated 17 days ago • 16

published a model 17 days ago

Tuwhy/Qwen3-VL-8B-SCPO-random

9B • Updated 17 days ago • 16

updated a model 18 days ago

Tuwhy/Qwen3-VL-8B-SRPO-n8

9B • Updated 18 days ago • 10

published a model 18 days ago

Tuwhy/Qwen3-VL-8B-SRPO-n8

9B • Updated 18 days ago • 10

updated a model 18 days ago

Tuwhy/Qwen3-VL-8B-GSPO-n8

9B • Updated 18 days ago • 10

published a model 18 days ago

Tuwhy/Qwen3-VL-8B-GSPO-n8

9B • Updated 18 days ago • 10

Tuwhy (Yi Ding)

Yi Ding's picture

2 7 3

Yi Ding

Tuwhy

·

https://dripnowhy.github.io/

DripNowhy

AI & ML interests

None yet

Recent Activity

updated a collection about 5 hours ago

updated a collection about 5 hours ago

updated a collection about 5 hours ago

View all activity

Organizations

updated a collection about 5 hours ago

Octopus

RL checkpoints of Octopus-8B and baselines of paper: Learning Self-Correction in Vision–Language Models via Rollout Augmentation • 6 items • Updated about 5 hours ago

updated a model 12 days ago

Tuwhy/Qwen3-VL-8B-GRPO-n16

9B • Updated 12 days ago • 7

published a model 12 days ago

Tuwhy/Qwen3-VL-8B-GRPO-n16

9B • Updated 12 days ago • 7

updated a model 15 days ago

Tuwhy/Qwen3-VL-8B-DAPO-n16

9B • Updated 15 days ago • 11

published a model 15 days ago

Tuwhy/Qwen3-VL-8B-DAPO-n16

9B • Updated 15 days ago • 11

updated a model 17 days ago

Tuwhy/Qwen3-VL-8B-SRPO-n16

9B • Updated 17 days ago • 11

published a model 17 days ago

Tuwhy/Qwen3-VL-8B-SRPO-n16

9B • Updated 17 days ago • 11

updated a model 17 days ago

Tuwhy/Qwen3-VL-8B-SCPO-random

9B • Updated 17 days ago • 16

published a model 17 days ago

Tuwhy/Qwen3-VL-8B-SCPO-random

9B • Updated 17 days ago • 16

updated a model 18 days ago

Tuwhy/Qwen3-VL-8B-SRPO-n8

9B • Updated 18 days ago • 10

published a model 18 days ago

Tuwhy/Qwen3-VL-8B-SRPO-n8

9B • Updated 18 days ago • 10

updated a model 18 days ago

Tuwhy/Qwen3-VL-8B-GSPO-n8

9B • Updated 18 days ago • 10

published a model 18 days ago

Tuwhy/Qwen3-VL-8B-GSPO-n8

9B • Updated 18 days ago • 10