Beegbrain/ppo-Pyramids · Training metrics