We would like to improve pipeline stability for corrective retries and discover metrics of when pipelines fail and how to implement improvements
Basically, we want to build a native metrics dashboards in Harness to pinpoint which stage fails and why so that we can continuously improve our pipelines and templates.