I've stared at the code and I can't figure out why the ddp & tensor scripts output the training stats every 10 seconds but the fsdp one doesn't? The code looks the same in all cases...