Fault Tolerance When the network dies... Scripts should enter a retry/wait cycle Or not We should be notified Or not When the server is down... Scripts should enter a retry/wait cycle Or not We should be notified Or not Things shouldn't blow up unless They're time critical We didn't handle an error