Recently I was summoned for a severe production issue where MS
DTC failed to come online after a failed attempt of SQL Cluster failover. By the time I logged in, it was already a seriously escalated issue so I was under tremendous pressure to bring everything online immediately.
I found many critical errors logged for the failed MS
Though it was not an ideal approach, it was a quick and practical resolution with in 5 minutes and everyone was happy and back in business. Of course this issue is later investigated in-depth (which lead to some serious issue which needed attention in this cluster) but moral of this post is when there is a serious production issue and you know a solution to fix it, then first fix it and later find the root cause.