- Operational excellence in the cloud involves designing, implementing, and managing cloud solutions that provide value, performance, security, and reliability.
- The recommendations in this pillar help you to continuously improve and adapt workloads to meet the dynamic and ever-evolving needs in the cloud.
- To achieve operational excellence, you should embrace automation, orchestration, and data-driven insights
- Automation helps to eliminate toil. It also streamlines and builds guardrails around repetitive tasks. Orchestration helps to coordinate complex processes. Data-driven insights enable evidence-based decision-making
- Secure options for data in transit, and default encryption for data at rest.
- Built-in security features for cloud products and services.
- A global infrastructure that's designed for geo-redundancy, with security controls throughout the information-processing lifecycle.
- Automation capabilities that use infrastructure as code (IaC) and configuration guardrails.
- Data and systems are protected through multiple layered defenses using policies and controls that are configured across IAM, encryption, networking, detection, logging, and monitoring.
- Measurable reliability goals that you promptly correct whenever deviations occur
- Design patterns for the following: Scalability,High availability,Disaster recovery and Automated change management
- Components that self-heal (have the ability to remediate issues without manual interventions)
- Hands-free operation such as service runs with minimal manual work, cognitive operator load, and rapid failure detection and mitigation
- Align cloud spending with business value: Ensure that your cloud resources deliver measurable business value by aligning IT spending with business objectives.
- Foster a culture of cost awareness: Ensure that people across your organization consider the cost impact of their decisions and activities, and ensure that they have access to the cost information required to make informed decisions.
- Optimize resource usage: Provision only the resources that you need, and pay only for the resources that you consume.
- Optimize continuously: Continuously monitor your cloud resource usage and costs, and proactively make adjustments as needed to optimize your spending. This approach involves identifying and addressing potential cost inefficiencies before they become significant problems.
- Optimizing the performance of workloads in the cloud can help your organization operate efficiently, improve customer satisfaction, increase revenue, and reduce cost.
- When the backend processing time of an application decreases, users experience faster response times, which can lead to higher user retention and more revenue.
- There might be trade-offs between performance and cost. But sometimes, optimizing performance can help you reduce cost.
- Autoscaling helps provide predictable performance when the load increases by ensuring that the resources aren't overloaded. Autoscaling also helps you reduce cost during periods of low load by removing unused resources.