Final Update: Friday, 08 September 2017 21:31 UTC
We’ve confirmed that all systems are back to normal with no customer impact as of 09/08, 13:10 UTC. Our logs show the incident started on 09/08, 12:50 UTC and that during the 20 minutes that it took to resolve the issue,50% of customers would have experienced 10% of data loss while viewing live metrics data.
We’ve confirmed that all systems are back to normal with no customer impact as of 09/08, 13:10 UTC. Our logs show the incident started on 09/08, 12:50 UTC and that during the 20 minutes that it took to resolve the issue,50% of customers would have experienced 10% of data loss while viewing live metrics data.
- Root Cause: The failure was due to issues with Live Metrics Service in handling high volume of incoming data
- Lessons Learned: The service was redeployed with increased capacity to handle high volume of incoming data.
- Incident Timeline: 20 minutes - 09/08, 12:50 UTC through 09/08, 13:10 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
-Nilesh