tmpusr/ppo-Huggy · Training metrics