Site Reliability Engineering at Salesforce
Site Reliability Engineering at Salesforce
In this talk you will learn about how the Global Site Reliability team coordinates Incident Management for critical incidents within the Salesforce core applications. You will also learn what we do to ensure service availability, as well as the opportunity to see the lengths to which we've gone to research and develop the best methodology to consistently and efficiently respond when things do go wrong with the service and further to continually test & improve the resiliency of the service on an ongoing basis. You will also get an insiders view into the workings of our response plans, the roles, the strategy and the measurable gains we've incurred as a result.