AI & ML interests
None defined yet.
Recent Activity
Papers
Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training
OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
models
10
OpenRubrics/RubricARM-8B-Judge
308k
•
Updated
•
45
•
1
OpenRubrics/RubricARM-8B-Rubric
308k
•
Updated
•
33
OpenRubrics/RubricRM-4B-Rubric-v2
196k
•
Updated
•
15
OpenRubrics/RubricRM-4B-Judge-v2
196k
•
Updated
•
15
•
1
OpenRubrics/RubricRM-8B-Judge-v2
308k
•
Updated
•
128
OpenRubrics/RubricRM-8B-Judge
308k
•
Updated
•
28
OpenRubrics/RubricRM-8B-Rubric-v2
308k
•
Updated
•
285
OpenRubrics/RubricRM-8B-Rubric
308k
•
Updated
•
18
OpenRubrics/RubricRM-4B-Rubric
196k
•
Updated
•
4
OpenRubrics/RubricRM-4B-Judge
196k
•
Updated