Jie Cheng's picture

15 12

Jie Cheng

jinachris

·

https://github.com/CJReinforce

CJReinforce

AI & ML interests

Reinforcement learning, LLM

Recent Activity

liked a model 7 days ago

stepfun-ai/Step-3.5-Flash-GGUF-Q4_K_S

liked a model 7 days ago

stepfun-ai/Step-3.5-Flash-FP8

liked a model 7 days ago

stepfun-ai/Step-3.5-Flash

View all activity

Organizations

None yet

Collections 1

Papers 2

arxiv:2504.15275

arxiv:2410.00564

models 4

jinachris/PURE-PRM-7B

Token Classification • 7B • Updated May 29, 2025 • 8 • 4

jinachris/Qwen2.5-7B-PURE-PRM

Text Generation • 8B • Updated Feb 23, 2025 • 1

jinachris/Qwen2.5-7B-PURE-VR

Text Generation • 8B • Updated Feb 23, 2025 • 1

jinachris/Qwen2.5-7B-PURE-PRMVR

Text Generation • 8B • Updated Feb 23, 2025 • 1

datasets 0

None public yet

jinachris (Jie Cheng)

Jie Cheng's picture

15 12

Jie Cheng

jinachris

·

https://github.com/CJReinforce

CJReinforce

AI & ML interests

Reinforcement learning, LLM

Recent Activity

liked a model 7 days ago

stepfun-ai/Step-3.5-Flash-GGUF-Q4_K_S

liked a model 7 days ago

stepfun-ai/Step-3.5-Flash-FP8

liked a model 7 days ago

stepfun-ai/Step-3.5-Flash

View all activity

Organizations

None yet

Collections 1

Papers 2

arxiv:2504.15275

arxiv:2410.00564

models 4

jinachris/PURE-PRM-7B

Token Classification • 7B • Updated May 29, 2025 • 8 • 4

jinachris/Qwen2.5-7B-PURE-PRM

Text Generation • 8B • Updated Feb 23, 2025 • 1

jinachris/Qwen2.5-7B-PURE-VR

Text Generation • 8B • Updated Feb 23, 2025 • 1

jinachris/Qwen2.5-7B-PURE-PRMVR

Text Generation • 8B • Updated Feb 23, 2025 • 1

datasets 0

None public yet