Join the LLR family of private equity-backed growth companies.

0 Companies

0 Jobs

My job alerts

Site Reliability Engineering Front Line Manager

Midigator

Software Engineering

Pimpri-Chinchwad, Maharashtra, India

Posted on Mar 14, 2026

What you’ll do

Oversee and supervise a team of SRE's, providing guidance on work efforts, understanding of objectives and resolution of issues that may arise.
Monitor, develop and operationalize database and big data environments, with their respective infrastructures, in addition to ensuring continuous improvement cycles, process automation and agility in business deliveries.
Manage complex system(s) uptime across cloud-native (AWS, GCP) and hybrid architectures.
Create and maintain cloud infrastructure capacity plans using estimation models that meet the expected service level objectives of the system(s).
Operate systems at an optimal cost while maintaining availability targets.
Ensure infrastructure as code (IAC) patterns designed and built by the team meets security and engineering standards using one or more technologies (Terraform, scripting with cloud CLI, and programming with cloud SDK).
Create the communication narrative to influence product, engineering, security, Cloud CoE, customers for the reliability and uptime issues and improvements.
Identify, recruit, develop and retain SRE talent.

What Experience You Need

BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent job experience required
7+ years of experience developing and/or administering software in public cloud
7+ years experience in languages such as Python, Bash, Java, Go JavaScript and/or node.js
7+ years experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives
7+ years experience of cross-functional knowledge with systems, storage, networking, security and databases
7+ years experience of system administration skills, including automation and orchestration of Linux/Windows using Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.)
7+ years experience working with continuous integration and continuous delivery tooling and practices

What could set you apart

You have expertise designing, analyzing and troubleshooting large-scale distributed systems
You take a system problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
You have experience managing Infrastructure as code via tools such as Terraform or CloudFormation
You are passionate for automation with a desire to eliminate toil whenever possible
You’ve built software or maintained systems in a highly secure, regulated or compliant industry
You thrive in and have experience and passion for working within a DevOps culture and as part of a team