Site Reliability Engineering Front Line Manager
Midigator
Software Engineering
Pimpri-Chinchwad, Maharashtra, India
Posted on Mar 14, 2026
What you’ll do
- Oversee and supervise a team of SRE's, providing guidance on work efforts, understanding of objectives and resolution of issues that may arise.
- Monitor, develop and operationalize database and big data environments, with their respective infrastructures, in addition to ensuring continuous improvement cycles, process automation and agility in business deliveries.
- Manage complex system(s) uptime across cloud-native (AWS, GCP) and hybrid architectures.
- Create and maintain cloud infrastructure capacity plans using estimation models that meet the expected service level objectives of the system(s).
- Operate systems at an optimal cost while maintaining availability targets.
- Ensure infrastructure as code (IAC) patterns designed and built by the team meets security and engineering standards using one or more technologies (Terraform, scripting with cloud CLI, and programming with cloud SDK).
- Create the communication narrative to influence product, engineering, security, Cloud CoE, customers for the reliability and uptime issues and improvements.
- Identify, recruit, develop and retain SRE talent.
- BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent job experience required
- 7+ years of experience developing and/or administering software in public cloud
- 7+ years experience in languages such as Python, Bash, Java, Go JavaScript and/or node.js
- 7+ years experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives
- 7+ years experience of cross-functional knowledge with systems, storage, networking, security and databases
- 7+ years experience of system administration skills, including automation and orchestration of Linux/Windows using Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.)
- 7+ years experience working with continuous integration and continuous delivery tooling and practices
- You have expertise designing, analyzing and troubleshooting large-scale distributed systems
- You take a system problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
- You have experience managing Infrastructure as code via tools such as Terraform or CloudFormation
- You are passionate for automation with a desire to eliminate toil whenever possible
- You’ve built software or maintained systems in a highly secure, regulated or compliant industry
- You thrive in and have experience and passion for working within a DevOps culture and as part of a team