LLM, Conversational AI, Agent
P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling