The setup corrupts both format.json and xl.meta, but xl.meta corruption is detectable at runtime — so mc admin info on the running pod immediately shows data1=corrupt, data3=corrupt. Every agent (10/10) runs this command early, gets the unambiguous answer, and ignores the monitoring artifacts entirely. The diagnostic puzzle provides zero signal.
The fix has three parts, all low-effort changes to setup.sh:
- Fix the corruption method — corrupt only
format.json(notxl.meta), using valid JSON with wrong disk UUIDs (not random bytes). This blocks bothmc admin infoand filesystem inspection on the running pod. - Make the monitoring classifier confidently wrong — point it at data1+data4 (truth is data1+data3), creating a false consensus trap that agents must see through after restart.