Self-service observability
As DevOps teams develop applications in the cloud, they need visibility into the performance of their apps and infrastructure. Without a centralized observability team or a platform engineering team, each DevOps team likely acquires its own monitoring tools with two negative outcomes:
- When a performance issue or an outage occurs, it’s hard for individual teams to know whether their service caused the issue. To resolve the issue all the teams join a call and share what they see on their dashboards. However, each dashboard presents a different picture and there is no single source of truth for the state of IT. The result is long MTTR.
- With each team monitoring their own services in a silo, it is common for developers to instrument and send more data than needed, or to turn on monitoring features and then forget about them. As a result, one team can easily burn through the entire monitoring budget that the organization set in a very short time.
Many organizations try to resolve the problem described above by creating a centralized observability team or a platform engineering team to provide DevOps with the tools they need. However most existing monitoring tools in the market have one or both of the following limitations:
- They do not support sharing of dashboards, alerts, and best practices, making it difficult for different DevOps teams to create a single source of truth. Because of this, MTTRs remain long.
- There are no or few cost controls in place, so individual teams can still burn through their entire monitoring budget.
How can Splunk help?
Expected outcomes
With Splunk Observability Cloud, platform engineers provide software engineers with observability tools that provide all the DevOps teams with a single source of truth, enable them to share best practices across the org, and help them collaborate to minimize MTTR. At the same time, the platform engineers can monitor and maintain access and cost control of these observability tools so that everyone operates within budget.