Site Reliability Engineer-Career
Midigator
Software Engineering
Thiruvananthapuram, Kerala, India
Posted on Mar 11, 2026
What You'll Do
- Manage system(s) uptime across cloud-native (AWS, GCP) and hybrid architectures.
- Build infrastructure as code (IAC) patterns that meet security and engineering standards using one or more technologies (Terraform, scripting with cloud CLI, and programming with cloud SDK).
- Build CI/CD pipelines for build, test and deployment of application and cloud architecture patterns, using platform (Jenkins) and cloud-native toolchains.
- Build automated tooling to deploy service requests to push a change into production. Build runbooks that are comprehensive and detailed to manage detect, remediate and restore services.
- Solve problems and triage complex distributed architecture service maps. On call for high severity application incidents and improving run books to improve MTTR
- Lead availability blameless postmortem and own the call to action to remediate recurrences.
- BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent job experience required
- 5-7 years of experience in software engineering, systems administration, database administration, and networking
- 2+ years of experience developing and/or administering software in public cloud
- Cloud Certification Strongly Preferred
- Proficiency with continuous integration and continuous delivery tooling and practices
- System administration skills, including automation and orchestration of Linux/Windows using Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.)
- Demonstrable cross-functional knowledge with systems, storage, networking, security and databases
- Experience in languages such as Python, Bash, Java, Go JavaScript and/or node.js
- Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives
- You have expertise designing, analyzing and troubleshooting large-scale distributed systems.
- You take a system problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
- Kubernetes (CKA, CKAD) or cloud certifications.
- You are passionate for automation with a desire to eliminate toil whenever possible
- You’ve built software or maintained systems in a highly secure, regulated or compliant industry
- You thrive in and have experience and passion for working within a DevOps culture and as part of a team
- BS in Computer Science or related field.
- 2+ years of experience developing and/or administering software in public cloud
- 5+ years of programming experience (Python, Bash/Shell Script, Java, Go, etc.).
- 3+ years of experience monitoring infrastructure and application performance.
- 5+ years experience of system administration skills, including automation and orchestration of Linux/Windows using Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.)
- 5+ years experience working with continuous integration and continuous delivery tooling and practices
- Kubernetes: Design, deploy, and manage production-ready Kubernetes clusters.
- Cloud Infrastructure: Build and maintain scalable infrastructure on GCP using tools like Terraform.
- Performance: Identify and resolve performance bottlenecks in applications and infrastructure.
- Observability: Implement monitoring and logging to proactively detect and resolve issues.
- Incident Response: Participate in on-call rotations, troubleshooting and resolving production incidents.
- Collaboration: Promote reliability best practices and ensure smooth deployments.
- Automation: Build CI/CD pipelines, automated tooling, and runbooks.
- Problem Solving: Triage complex issues, lead blameless postmortems, and drive remediation.
- Mentorship: Guide and mentor other SREs.