Job Description
AWS Cloud Operations / Site Reliability Engineer (SRE) is responsible for delivering secure, reliable, and scalable cloud infrastructure. This role covers Infrastructure as a Service, AWS platform release activities, AMI lifecycle management, patching, infrastructure design documentation, terraform scripting and maintaining visibility into the application layer and how it functions in production environments. Experience with Harness for DevOps pipelines is a strong plus.
Required Qualifications
" 10+ years in SRE, Cloud Ops, or DevOps with heavy AWS experience.
" Strong hands-on experience with:
o AWS compute (EC2, ASG, EKS/ECS, Lambda)
o Networking (VPC, Route 53, SG/NACL, ALB/NLB)
o Storage (S3, EBS, EFS)
o Databases (RDS, Aurora, DynamoDB)
" Expert...