On June 9, 2023, at approximately 22:59 UTC, the Elasticsearch cluster utilized by the Shelf KMS platform experienced an inaccessibility issue in the us-east-1 region. As a consequence, certain API operations that rely on this cluster encountered timeouts, resulting in a degradation of search functionality on the Shelf KMS platform.
During the period of the incident, which lasted until June 10, 2023, at 00:18 UTC, clients using the Shelf KMS platform experienced difficulties in retrieving search results for gems. While the Gem Page remained functional and available for viewing, clients were unable to effectively locate gems via search results or navigate using the search function. The total duration of the degraded functionalities was approximately 1 hour and 19 minutes.
The root cause of the incident was traced back to a networking issue that occurred within Elastic Cloud, a third-party service provider that Shelf KMS relies on for hosting the Elasticsearch cluster. Due to an underlying infrastructure outage outside of our direct control, Elastic Cloud experienced disruptions that adversely affected the performance and accessibility of the Elasticsearch cluster in the us-east-1 region.
For your reference, Elastic Cloud's incident summary can be found at the following link: https://status.elastic.co/incidents/07bw653d2677