Join the LLR family of private equity-backed growth companies.

Site Reliability Engineer-Career

Midigator

Midigator

Software Engineering
Thiruvananthapuram, Kerala, India
Posted on Mar 11, 2026
What You'll Do

  • Manage system(s) uptime across cloud-native (AWS, GCP) and hybrid architectures.
  • Build infrastructure as code (IAC) patterns that meet security and engineering standards using one or more technologies (Terraform, scripting with cloud CLI, and programming with cloud SDK).
  • Build CI/CD pipelines for build, test and deployment of application and cloud architecture patterns, using platform (Jenkins) and cloud-native toolchains.
  • Build automated tooling to deploy service requests to push a change into production. Build runbooks that are comprehensive and detailed to manage detect, remediate and restore services.
  • Solve problems and triage complex distributed architecture service maps. On call for high severity application incidents and improving run books to improve MTTR
  • Lead availability blameless postmortem and own the call to action to remediate recurrences.

What Experience You Need

  • BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent job experience required
  • 5-7 years of experience in software engineering, systems administration, database administration, and networking
  • 2+ years of experience developing and/or administering software in public cloud
  • Cloud Certification Strongly Preferred
  • Proficiency with continuous integration and continuous delivery tooling and practices
  • System administration skills, including automation and orchestration of Linux/Windows using Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.)
  • Demonstrable cross-functional knowledge with systems, storage, networking, security and databases
  • Experience in languages such as Python, Bash, Java, Go JavaScript and/or node.js
  • Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives

What Could Set You Apart

  • You have expertise designing, analyzing and troubleshooting large-scale distributed systems.
  • You take a system problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
  • Kubernetes (CKA, CKAD) or cloud certifications.
  • You are passionate for automation with a desire to eliminate toil whenever possible
  • You’ve built software or maintained systems in a highly secure, regulated or compliant industry
  • You thrive in and have experience and passion for working within a DevOps culture and as part of a team
  • BS in Computer Science or related field.
  • 2+ years of experience developing and/or administering software in public cloud
  • 5+ years of programming experience (Python, Bash/Shell Script, Java, Go, etc.).
  • 3+ years of experience monitoring infrastructure and application performance.
  • 5+ years experience of system administration skills, including automation and orchestration of Linux/Windows using Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.)
  • 5+ years experience working with continuous integration and continuous delivery tooling and practices
  • Kubernetes: Design, deploy, and manage production-ready Kubernetes clusters.
  • Cloud Infrastructure: Build and maintain scalable infrastructure on GCP using tools like Terraform.
  • Performance: Identify and resolve performance bottlenecks in applications and infrastructure.
  • Observability: Implement monitoring and logging to proactively detect and resolve issues.
  • Incident Response: Participate in on-call rotations, troubleshooting and resolving production incidents.
  • Collaboration: Promote reliability best practices and ensure smooth deployments.
  • Automation: Build CI/CD pipelines, automated tooling, and runbooks.
  • Problem Solving: Triage complex issues, lead blameless postmortems, and drive remediation.
  • Mentorship: Guide and mentor other SREs.