🌿 Back to all jobs

🥝 Senior Site Reliability Engineer

EPAM Systems, Inc. | desde casa, Mexico | Posted June 27, 2026

Job Description

**Responsibilities**
- Participate in on-call rotations and provide 24/7 support for critical systems
- Deploying microservices accordingly to release cadence
- Develop and maintain infrastructure as code using Terraform
- Collaborate with engineering teams to identify and prioritize reliability, performance improvements, rightsizing of the dedicated cloud resources.
- Participate in incident management and response using ServiceNow
- Manage and resolve technical issues and tickets using Jira
- Developing knowledge base of the maintaining existing infrastructure and monitoring services
**Requirements**:
- 3+ years of experience in an SRE, DevOps or system administration role
- Deep knowledge Google Cloud Platform (GCP)
- Experience with incident management and response using ServiceNow or similar tools
- Strong problem-solving skills and experience with debugging complex technical issues
- Understanding of monitoring, logging, and alerting systems...

Apply for This Position

Submit Application