As a service owner responsible for your company's online storefront, you need to identify and respond to availability issues. You also want to drive operational optimization and excellence within your organization to improve mean-time-to-detect (MTTD) and mean-time-to-respond (MTTR).
This article is part of the Splunk Use Case Explorer for Observability, which is designed to help you identify and implement prescriptive use cases that drive incremental business value. It explains the solution using a fictitious example company, called CSCorp, that hosts a cloud native application called Online Boutique. In the AIOps lifecycle described in the Use Case Explorer, this article is part of Digital experience monitoring.
Online Boutique is the main offering of CSCorp. If the Online Boutique website becomes unavailable, CSCorp’s customers could look to other vendors to provide them with the goods they want to purchase. This could cause CSCorp to lose revenue to their competitors.
In general, they need to know:
- How do we quickly identify (MTTD) when the Online Boutique is unavailable?
- How do we remediate (MTTR) the issue and get Online Boutique back up as quickly as possible?
More immediately, the NGNIX web servers that deliver the Online Boutique service unexpectedly shuts down, causing the Online Boutique website to become unavailable.
How to use Splunk software for this use case
An Uptime Check makes a request to a URL and monitors the response time and the response code. When looking at the results of specific runs, Splunk Synthetic Monitoring also measures the Time to First Byte and DNS time. If an Uptime check fails, Splunk Synthetic Monitoring returns a response code and a traceroute.
There are two basic types of Uptime Checks:
HTTP. Check the Uptime or response of a single URL or endpoint. The HTTP Uptime check goes beyond a simple ping. When it runs, Splunk Synthetic Monitoring waits up to 60 seconds for a full response from the server before closing the connection.
Port. Monitor popular or custom ports on your servers via TCP or UDP protocol.
To monitor the Uptime of several URLs, or for more complex transactions, check out Splunk’s API checks use cases.
The CSCorp team decided to record and test a simple Splunk Synthetic Monitoring uptime check to identify when the service is unavailable and provide the appropriate notification with a response code and traceroute.
Now, look at how CSCorp created and tested a Uptime Check for the Online Boutique application.
Still having trouble? Splunk has many resources available to help get you back on track.
- Splunk OnDemand Services: Use these credit-based services for direct access to Splunk technical consultants with a variety of technical services from a pre-defined catalog. Most customers have OnDemand Services per their license support plan. Engage the ODS team at OnDemand-Inquires@splunk.
com if you require assistance.
- Splunk Answers: Ask your question to the Splunk Community, which has provided over 50,000 user solutions to date.
- Splunk Customer Support: Contact Splunk to discuss your environment and receive customer support.
- Splunk Observability Training Courses: Comprehensive Splunk training to fully unlock the power of Splunk Observability Cloud.