All Systems Operational
Website Operational
90 days ago
100.0 % uptime
Today
Graph rendering Operational
90 days ago
100.0 % uptime
Today
Ingestion Operational
90 days ago
100.0 % uptime
Today
Alerting Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
had a major outage
had a partial outage
www.hostedgraphite.com uptime ?
Fetching
Interface health: TCP ?
Fetching
Interface health: UDP ?
Fetching
Interface health: StatsD ?
Fetching
Interface health: HTTP API ?
Fetching
Interface health: carbon relay (pickle) ?
Fetching
Graph render time (95th percentile)
Fetching
Interface health: Heroku integration ?
Fetching
AWS connectivity (US-East-1) ?
Fetching
AWS connectivity (US-West-1) ?
Fetching
Past Incidents
Oct 22, 2019

No incidents reported today.

Oct 21, 2019

No incidents reported.

Oct 20, 2019

No incidents reported.

Oct 19, 2019

No incidents reported.

Oct 18, 2019

No incidents reported.

Oct 17, 2019
Resolved - This incident has been resolved.
Oct 17, 13:07 UTC
Update - Our aggregation layer has suffered a further decrease in capacity leading to backlogs of up to 5 minutes..

We have expanded capacity in our aggregation layer to help work through the backlogs.
Oct 17, 12:20 UTC
Monitoring - A fix has been implemented and we are monitoring the results.
Oct 17, 10:55 UTC
Update - As of 10:52 UTC our aggregation layer has returned to full health and all backlogs have been replayed.

We continue to monitor the situation.
Oct 17, 10:53 UTC
Investigating - As of 10:12 UTC network connectivity issues have caused datapoints to be dropped in our aggregation layer.
We have switched to a less strict healthcheck mechanism and are seeing recovery.
Backlogs of up to 7 minutes are currently being replayed.

This will have caused delays in processing datapoints leading to gaps in graphs causing alerts to trigger in error.

No data has been lost.
Oct 17, 10:35 UTC
Oct 16, 2019
Resolved - This incident has been resolved.
Oct 16, 16:14 UTC
Update - All backlogs have been replayed.

This incident is resolved.
Oct 16, 13:35 UTC
Monitoring - During a brief period of network connectivity issues, health check failures in our aggregation layer caused a delay in processing of datapoints.

This will have caused alerts to trigger in error and gaps in graphs.

We have switched to a less strict health check for our aggregation services and are seeing recovery.
Oct 16, 13:02 UTC
Oct 15, 2019

No incidents reported.

Oct 14, 2019

No incidents reported.

Oct 13, 2019

No incidents reported.

Oct 12, 2019

No incidents reported.

Oct 11, 2019

No incidents reported.

Oct 10, 2019

No incidents reported.

Oct 9, 2019

No incidents reported.

Oct 8, 2019

No incidents reported.