Platform Operations
All Systems Operational
Customer Sites?
Dashboard?
Workflow Operations?
Spinup Operations?
System Metrics Month Week Day
Customer Site Availability
fetching...
Dashboard Response Time
fetching...
Past Incidents
Apr 23, 2014
Resolved - Dashboards are operating normally. All clear.
Apr 23, 15:59 PDT
Monitoring - Dashboard operations are returning to normal but we're still monitoring for any degradation.
Apr 23, 15:20 PDT
Update - We're working on a fix. In the meantime, you may see inconsistent data in your dashboard (i.e. sites not listed). There is no data loss, merely inconsistent results.
Apr 23, 14:55 PDT
Identified - We've identified the issue and we're working on a fix. Please standby.
Apr 23, 14:50 PDT
Investigating - Our monitoring has detected elevated error dashboard rates, which may manifest as slow page loads or failed logins.
Apr 23, 14:38 PDT
Resolved - Affected sites have recovered. All clear.
Apr 23, 15:13 PDT
Monitoring - Sites are returning to normal service.
Apr 23, 14:57 PDT
Identified - We are addressing an infrastructure failure that is affecting a small portion of customer sites.
Apr 23, 14:50 PDT
Resolved - Dashboard SSL certificate updates are operational. All clear.
Apr 23, 12:32 PDT
Monitoring - We've deployed a fix for the SSL certificate update issue.
Apr 23, 12:27 PDT
Identified - We've identified the issue with site SSL certificate updates.
Apr 23, 11:23 PDT
Investigating - We've detected an issue with updating SSL certificates on site dashboards.
Apr 23, 10:56 PDT
Apr 22, 2014

No incidents reported.

Apr 21, 2014
Resolved - This incident has been resolved.
Apr 21, 13:18 PDT
Identified - We are investigating a failed application endpoint.
Apr 21, 12:56 PDT
Resolved - The application endpoint and its services are online. All clear.
Apr 21, 10:58 PDT
Identified - We've rebooted the application endpoint and services are coming back online.
Apr 21, 10:42 PDT
Investigating - We are investigating a failed application endpoint.
Apr 21, 10:35 PDT
Apr 20, 2014

No incidents reported.

Apr 19, 2014
Resolved - All clear.
Apr 19, 17:34 PDT
Identified - We are investigating a failed application endpoint.
Apr 19, 17:08 PDT
Apr 18, 2014
Resolved - This incident has been resolved.
Apr 18, 17:01 PDT
Monitoring - Service has been restored and all metrics for the endpoint are stable as of the past 10 mins. We are continuing to monitor.
Apr 18, 15:52 PDT
Identified - We are investigating a failed application endpoint.
Apr 18, 15:20 PDT
Apr 17, 2014
Resolved - No further issues. Resolving.
Apr 17, 15:14 PDT
Monitoring - The primary issues has been resolved after we performed an emergency index rebuild on the Dashboard/API's main data store was initiated. We are moving this incident to monitoring while we perform further checks to ensure the Dashboard has returned to a fully operational state.
Apr 17, 14:06 PDT
Investigating - We are working on resolving intermittent issues on the dashboard that may temporarily affect actions such as logins.
Apr 17, 13:38 PDT
Resolved - This incident has been resolved.
Apr 17, 14:01 PDT
Monitoring - We are now seeing all sites routing normally, but are continuing to monitor closely.
Apr 17, 13:59 PDT
Investigating - Routing issues affecting site loads for some sites.
Apr 17, 13:40 PDT
Resolved - This incident has been resolved.
Apr 17, 12:28 PDT
Update - Service has been restored.
Apr 17, 12:28 PDT
Monitoring - Dashboard performance is returning to normal. We are continuing to monitor.
Apr 17, 12:07 PDT
Investigating - Our monitoring has detected elevated error dashboard rates, which may manifest as slow page loads or failed logins.
Apr 17, 12:01 PDT
Apr 16, 2014

No incidents reported.

Apr 15, 2014
Resolved - The cacheserver endpoint is back to normal, and affected sites have recovered. All clear.
Apr 15, 15:38 PDT
Investigating - We are investigating a failed cacheserver endpoint.
Apr 15, 15:21 PDT
Resolved - The cacheserver endpoint is back to normal. All clear.
Apr 15, 10:44 PDT
Monitoring - The cacheserver endpoint is back online and affected sites are recovering.
Apr 15, 10:38 PDT
Investigating - We are investigating a failed cacheserver endpoint.
Apr 15, 10:31 PDT
Apr 14, 2014
Resolved - Workflow operations are running smoothly. All clear.
Apr 14, 17:06 PDT
Monitoring - Workflow node restored. We are continuing to monitor it closely. Customers who's workflow operations were interrupted can re-try them at this time.
Apr 14, 16:31 PDT
Update - Workflow node is experiencing major networking issues and the underlying host is now down. We are working to restore service on the existing host and also to bring up a new workflow node.
Apr 14, 15:56 PDT
Identified - We are restarting a workflow node.
Apr 14, 15:32 PDT
Investigating - We are investigating abnormally long workflow queues.
Apr 14, 15:29 PDT
Apr 13, 2014

No incidents reported.

Apr 12, 2014

No incidents reported.

Apr 11, 2014

No incidents reported.

Apr 10, 2014
Resolved - New service requests (tickets) are now showing in the dashboard.
Apr 10, 12:11 PDT
Investigating - New service requests (tickets) aren't showing in the dashboard. We're still receiving & responding to the requests.
Apr 10, 11:39 PDT
Apr 9, 2014
Resolved - Dashboards are now operating normally. All clear.
Apr 9, 17:58 PDT
Identified - We've identified the issue and have deployed a fix.
Apr 9, 17:53 PDT
Investigating - Our monitoring has detected elevated error dashboard rates, which may manifest as slow page loads or failed logins.
Apr 9, 17:32 PDT
Resolved - All clear.
Apr 9, 09:08 PDT
Investigating - We are investigating a failed database endpoint.
Apr 9, 08:33 PDT