What are CUDA graphs? How are they implemented? What does it take to actually use them in PyTorch?
Further reading.