DongJae Shin's picture

DongJae Shin

ShinDJ

·

AI & ML interests

NLP, LLM, Vision-Langauge Model

Recent Activity

liked a model 10 days ago

sangmin6600/mamba2-400m-ko-sft

upvoted a paper 10 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

liked a model about 2 months ago

upstage/Solar-Open-100B

View all activity

Organizations

liked a model 10 days ago

sangmin6600/mamba2-400m-ko-sft

0.4B • Updated 16 days ago • 89 • 2

upvoted a paper 10 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published 17 days ago • 49

liked a model about 2 months ago

upstage/Solar-Open-100B

Text Generation • Updated 23 days ago • 12.2k • 455

updated a model about 2 months ago

MLP-VLM2/gemma-3-4b-it_Docplan-HF-124k

Image-Text-to-Text • 4B • Updated Jan 5 • 38

published a model about 2 months ago

MLP-VLM2/gemma-3-4b-it_Docplan-HF-124k

Image-Text-to-Text • 4B • Updated Jan 5 • 38

updated a dataset about 2 months ago

KORMo-VLM/Korean-MM-dataset

Preview • Updated Jan 14 • 16

published a dataset about 2 months ago

KORMo-VLM/Korean-MM-dataset

Preview • Updated Jan 14 • 16

upvoted a paper about 2 months ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published Dec 18, 2025 • 219

updated a dataset about 2 months ago

ShinDJ/OneThinker_img_train

Viewer • Updated Dec 25, 2025 • 173k • 6

published a dataset about 2 months ago

ShinDJ/OneThinker_img_train

Viewer • Updated Dec 25, 2025 • 173k • 6

upvoted a paper 2 months ago

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

Paper • 2512.17532 • Published Dec 19, 2025 • 67

updated 2 datasets 2 months ago

Tutoruslabs/GSM8K_KOR-train

Viewer • Updated Dec 24, 2025 • 3.42k • 6 • 3

Tutoruslabs/GSM8K_KOR

Viewer • Updated Dec 24, 2025 • 569 • 5 • 2

updated a Space 2 months ago

Trackio

published a Space 2 months ago

Trackio

updated a Space 2 months ago

Trl Trackio

published a Space 2 months ago

Trl Trackio

liked a model 2 months ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated 1 day ago • 977k • 641

upvoted an article 3 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

600

reacted to sergiopaniego's post with 🔥 3 months ago

Post

2446

NEW: @mistralai released a fantastic family of multimodal models, Ministral 3.

You can fine-tune them for free on Colab using TRL ⚡️, supporting both SFT and GRPO

Link to the notebooks:
- SFT: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/sft_ministral3_vl.ipynb
- GRPO: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_ministral3_vl.ipynb
- TRL and more examples: https://huggingface.co/docs/trl/index

2 replies

·

ShinDJ (DongJae Shin)

DongJae Shin's picture

DongJae Shin

ShinDJ

·

AI & ML interests

NLP, LLM, Vision-Langauge Model

Recent Activity

liked a model 10 days ago

sangmin6600/mamba2-400m-ko-sft

upvoted a paper 10 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

liked a model about 2 months ago

upstage/Solar-Open-100B

View all activity

Organizations

liked a model 10 days ago

sangmin6600/mamba2-400m-ko-sft

0.4B • Updated 16 days ago • 89 • 2

upvoted a paper 10 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published 17 days ago • 49

liked a model about 2 months ago

upstage/Solar-Open-100B

Text Generation • Updated 23 days ago • 12.2k • 455

updated a model about 2 months ago

MLP-VLM2/gemma-3-4b-it_Docplan-HF-124k

Image-Text-to-Text • 4B • Updated Jan 5 • 38

published a model about 2 months ago

MLP-VLM2/gemma-3-4b-it_Docplan-HF-124k

Image-Text-to-Text • 4B • Updated Jan 5 • 38

updated a dataset about 2 months ago

KORMo-VLM/Korean-MM-dataset

Preview • Updated Jan 14 • 16

published a dataset about 2 months ago

KORMo-VLM/Korean-MM-dataset

Preview • Updated Jan 14 • 16

upvoted a paper about 2 months ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published Dec 18, 2025 • 219

updated a dataset about 2 months ago

ShinDJ/OneThinker_img_train

Viewer • Updated Dec 25, 2025 • 173k • 6

published a dataset about 2 months ago

ShinDJ/OneThinker_img_train

Viewer • Updated Dec 25, 2025 • 173k • 6

upvoted a paper 2 months ago

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

Paper • 2512.17532 • Published Dec 19, 2025 • 67

updated 2 datasets 2 months ago

Tutoruslabs/GSM8K_KOR-train

Viewer • Updated Dec 24, 2025 • 3.42k • 6 • 3

Tutoruslabs/GSM8K_KOR

Viewer • Updated Dec 24, 2025 • 569 • 5 • 2

updated a Space 2 months ago

Trackio

published a Space 2 months ago

Trackio

updated a Space 2 months ago

Trl Trackio

published a Space 2 months ago

Trl Trackio

liked a model 2 months ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated 1 day ago • 977k • 641

upvoted an article 3 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

600

reacted to sergiopaniego's post with 🔥 3 months ago

Post

2446

NEW: @mistralai released a fantastic family of multimodal models, Ministral 3.

You can fine-tune them for free on Colab using TRL ⚡️, supporting both SFT and GRPO

Link to the notebooks:
- SFT: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/sft_ministral3_vl.ipynb
- GRPO: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_ministral3_vl.ipynb
- TRL and more examples: https://huggingface.co/docs/trl/index

2 replies

·