PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE
Jie Cheng
jinachris
AI & ML interests
Reinforcement learning, LLM
Recent Activity
liked
a model
7 days ago
stepfun-ai/Step-3.5-Flash-GGUF-Q4_K_S
liked
a model
7 days ago
stepfun-ai/Step-3.5-Flash-FP8
liked
a model
7 days ago
stepfun-ai/Step-3.5-Flash
Organizations
None yet