Add Checkpointer
Note that this saves /every epoch/. Current checkpoints are only 3MB, so
for e.g. 1000 epochs that means 3GB of data, which is not /that/ bad.
However, for larger models we probably need to change something.
