If we can accurately recognize good performance on alignment, we could elicit lots of useful alignment work from our models, even if they're playing the training game: https://www.planned-obsolescence.org/training-ais-to-help-us-align-ais/
Want to check another podcast?
Enter the RSS feed of a podcast, and see all of their public statistics.
This website doesn't track the visitors or use any cookies. Made by Alex Barredo. Send your feedback to alex@barredo.es.