All Systems Operational
Website   Operational
90 days ago
100.0 % uptime
Today
Graph rendering   Operational
90 days ago
100.0 % uptime
Today
Ingestion   Operational
90 days ago
100.0 % uptime
Today
Alerting   Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
www.hostedgraphite.com uptime ?
Fetching
Interface health: TCP ?
Fetching
Interface health: UDP ?
Fetching
Interface health: StatsD ?
Fetching
Interface health: HTTP API ?
Fetching
Interface health: carbon relay (pickle) ?
Fetching
Interface health: Heroku integration ?
Fetching
AWS connectivity (US-East-1) ?
Fetching
AWS connectivity (US-West-1) ?
Fetching
Past Incidents
May 22, 2018

No incidents reported today.

May 21, 2018

No incidents reported.

May 20, 2018

No incidents reported.

May 19, 2018

No incidents reported.

May 18, 2018

No incidents reported.

May 17, 2018

No incidents reported.

May 16, 2018
Resolved - The affected data has been replayed.
May 16, 15:36 UTC
Identified - We have identified a failure in one of our aggregation servers resulting in leading edge data being unavailable for approximately 1% of all metrics for all resolutions.

No data has been lost, and all data will be available again once the affected data has been replayed.
May 16, 14:42 UTC
Resolved - The affected data has been replayed.
May 16, 09:18 UTC
Identified - We have identified a failure in one of our aggregation servers resulting in leading edge data being unavailable for approximately 1% of all metrics for all resolutions.

No data has been lost, and all data will be available again once the affected data has been replayed.
May 16, 07:54 UTC
May 15, 2018
Resolved - We have now fully replayed the aggregate metric data and the incident is now resolved.
May 15, 21:47 UTC
Identified - Aggregate metrics ingested after 11:33 UTC have been processed as normal and we are currently replaying data for the affected aggregate metrics. We'll provide an update when the replay has finished.
May 15, 12:07 UTC
Investigating - Leading edge data for 14% of our aggregate metrics is currently unavailable for querying.

This does not impact most users sending us normal metrics, it only affects users using our server-side aggregation feature. This manifests as a gap in recent history on aggregate metrics, but historical data is unaffected.

We're currently working on relaying the missing data.
May 15, 11:44 UTC
May 14, 2018

No incidents reported.

May 13, 2018

No incidents reported.

May 12, 2018

No incidents reported.

May 11, 2018

No incidents reported.

May 10, 2018
Resolved - This incident has been resolved.
May 10, 12:28 UTC
Investigating - From 11:31UTC up to 22% of reads to short term storage have failed.

Ingestion continues as normal and no data has been lost.
May 10, 12:09 UTC
May 9, 2018
Resolved - At 13:55 UTC, the network issues were resolved and no data was lost.
May 9, 14:01 UTC
Investigating - Our hosting provider's network is experiencing connectivity issues.

At 13:35 UTC, we began seeing upto 6% of leading edge queries and 0.8% of long-term queries fail. No data has been lost and we are working to mitigate the network issues.
May 9, 13:51 UTC
May 8, 2018
Our hosting provider's network is experiencing connectivity issues. Between 09:56 UTC and 10:13 UTC, 5% of leading edge queries and 1.5% of long-term queries failed. No data has been lost and the network issues have subsided.
May 8, 10:21 UTC