Dynamic Tensor Rematerialization
ICLR 2021, 2021.
Weibo:
Abstract:
Checkpointing enables training larger models by freeing intermediate activations and recomputing them on demand. Previous checkpointing techniques are difficult to generalize to dynamic models because they statically plan recomputations offline. We present Dynamic Tensor Rematerialization (DTR), a greedy online algorithm for heuristical...More
Tags
Comments