Go With The Flow: Fault-Tolerant Decentralized Training of Large Language Models