New framework for scalable robotic reward modeling using trajectory comparisons to train general-purpose reward models.