What’s a learning rate? Why might you want to schedule it? How does the LR scheduler API in PyTorch work? What the heck is up with the formula implementation? Why is everything terrible?
Want to check another podcast?
Enter the RSS feed of a podcast, and see all of their public statistics.