MX - Senior Platform Manager - IT Automation/Site Reliability
2 months ago
Life at MX :
We are driven by our moral imperative to advance mankind - and it all starts with our people, product and purpose. We always carry a deep sense of drive and passion with us.
If you thrive in a challenging work environment, surrounded by incredible team members who will help you grow, MX is the right place for you.
Come build with us and be part of an award-winning company that's helping create meaningful and lasting change in the financial industry.
Role : SENIOR MANAGER OF PLATFORM AUTOMATION AND SRE
Reports to : Head of India
LIFE AT MX :
We are driven by our moral imperative to advance mankind - and it all starts with our people, product and purpose. We always carry a deep sense of drive and passion with us.
If you thrive in a challenging work environment, surrounded by incredible team members who will help you grow, MX is the right place for you.
Come build with us and be part of an award-winning company that's helping create meaningful and lasting change in the financial industry.
Job Summary :
MX Technology, Inc. is a dynamic and rapidly growing financial company committed to helping to empower the world to be financially strong. Our team is passionate about driving innovation and achieving excellence in everything we do.
We are seeking a dynamic and experienced Senior Manager of Platform Automation and Site Reliability Engineering (SRE) to lead and scale our platform automation and SRE initiatives.
In this role, you will oversee the development and management of our platforms and infrastructure, ensuring the reliability, scalability, and performance of our platforms.
You will drive the adoption of automation and DevOps practices across teams, and be instrumental in enhancing our CI/CD pipelines, Kubernetes environments, and databases, while fostering a culture of continuous improvement and operational excellence.
Key Responsibilities :
Leadership & Strategy :
- Lead a team of platform engineers, SREs, and automation experts, providing mentorship, guidance, and career development.
- Develop and execute a strategic vision for platform automation, infrastructure reliability, and scalability that aligns with business goals.
- Collaborate with cross-functional teams to drive the adoption of DevOps best practices and ensure alignment with business objectives.
Platform Automation :
- Design, implement, and manage automated solutions for infrastructure provisioning, configuration management, and monitoring.
- Lead efforts to automate manual processes, reduce technical debt, and improve developer productivity.
- Oversee the management and optimization of Kubernetes clusters, ensuring high availability and efficient resource utilization.
- Site Reliability Engineering (SRE) :
- Establish and maintain SRE best practices, focusing on reliability, performance, and scalability of production systems.
- Define and monitor SLOs/SLAs, and work to proactively identify and resolve performance bottlenecks.
- Develop incident response processes, lead root cause analysis, and drive post-mortem reviews to prevent future incidents.
CI/CD & Tooling :
- Enhance and maintain CI/CD pipelines to enable rapid and reliable software delivery.
- Evaluate and implement tools that improve developer workflows, release management, and infrastructure as code (IaC).
- Collaborate with development teams to ensure seamless integration of tooling and automation within their workflows.
Database & Infrastructure Management :
- Oversee the performance, reliability, and security of databases, ensuring best practices in monitoring, backup, and recovery.
- Manage cloud and on-premise infrastructure, optimizing for cost efficiency, security, and scalability.
- Drive initiatives to modernize and scale the infrastructure, including cloud migration, containerization, and microservices architecture.
Collaboration & Communication :
- Work closely with product, engineering, and operations teams to ensure alignment on project goals and timelines.
- Communicate effectively with stakeholders at all levels, providing regular updates on project status, risks, and opportunities.
Continuous Improvement :
- Foster a culture of continuous learning and improvement, encouraging experimentation and innovation within the team.
- Stay current with industry trends and emerging technologies, applying them to enhance the platform and its operations.
Qualifications :
- Bachelor's degree in Computer Science, Engineering, or a related field.
- A Master's degree is a plus.
- 15+ years of experience in platform engineering, DevOps, or SRE roles, with at least 5 years in a management or leadership capacity.
- Proven experience with platform automation, Kubernetes, and container orchestration in a production environment.
- Strong background in CI/CD, infrastructure as code (IaC), and cloud-native technologies (e., AWS, Azure, GCP).
- Hands-on experience with databases (SQL, NoSQL), database optimization, and management.
- Experience in building and managing high-performing teams in a fast-paced environment.
Skills :
- Deep understanding of DevOps principles, including continuous integration, continuous delivery, and monitoring.
- Proficiency in scripting and programming languages (e., Ruby, Go, Java) for automation.
- Expertise in Kubernetes, Docker, Terraform, Ansible, Jenkins, and other related technologies.
- Strong problem-solving skills with a focus on root cause analysis and incident management.
- Excellent communication and collaboration skills, with the ability to work effectively across teams and departments.
Preferred Qualifications :
- Certifications in Kubernetes (CKA/CKAD), AWS, or other relevant technologies.
- Experience with observability tools (e., Prometheus, Grafana, ELK stack).
- Knowledge of security best practices in DevOps and infrastructure management.
- Experience with cloud-native architectures and microservices.
At MX, we seek to hire candidates who drive results and achieve successful outcomes.
We utilize a hybrid work arrangement style, which may require both local and remote team members to be in the office when necessary, to kick off projects, hold cross team strategy meetings, or complete key deliverables.
Remote team members will travel into the office four times per year, and MX covers travel expenses associated with this requirement.
Both local and remote employees can take advantage of our incredible office space with onsite perks, company-paid meals, onsite massage therapists, sports simulator, gym, mother's lounge, and meditation room.
-
Chennai, Tamil Nadu, India MX Full timeJob Title:Senior Manager of Platform Automation and Site Reliability EngineeringAbout MX:MX is a dynamic and rapidly growing financial company committed to helping empower the world to be financially strong. We are driven by our moral imperative to advance mankind - and it all starts with our people, product, and purpose.Job Summary:We are seeking a dynamic...
-
Chennai, Tamil Nadu, India MX Build Technologies Full timeRole SummaryMX Technology, Inc. is a dynamic and rapidly growing financial company committed to helping to empower the world to be financially strong. Our team is passionate about driving innovation and achieving excellence in everything we do.Job DescriptionWe are seeking a dynamic and experienced Senior Manager of Platform Automation and Site Reliability...
-
Automation and SRE Leadership
5 days ago
Chennai, Tamil Nadu, India MX Full timeJob SummaryMX Technology, Inc. is a dynamic and rapidly growing financial company committed to empowering the world to be financially strong.We are seeking a dynamic and experienced Senior Manager to lead and scale our platform automation and SRE initiatives.Roles and ResponsibilitiesLeadership and Strategy: Lead a team of platform engineers, SREs, and...
-
Software Engineer
2 weeks ago
Chennai, Tamil Nadu, India MX Full timeAt MX, we're driven by our mission to make the world financially strong. We're building a world-class technology system that requires a talented team of software engineers to design, develop, and maintain our products.As a Senior Backend Software Engineer at MX, you'll be responsible for leading by example and elevating engineering practices across the team....
-
Senior Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India Athenahealth Full timeJob SummaryWe are seeking a Senior Site Reliability Engineer to join our Service Operations, Site Reliability Engineering team within the Cloud Infrastructure Engineering division.The Team is responsible for managing the fleet of systems owned by its sister teams in the Service Operations zone.We are looking for Site Reliability & Infrastructure Engineering...
-
Senior Site Reliability Engineer
1 month ago
Chennai, Tamil Nadu, India Athenahealth Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering division. As a key member of our Service Operations Site Reliability Engineering team, you will play a critical role in managing the fleet of systems owned by our sister teams in the Service Operations zone.Key ResponsibilitiesProvision...
-
Senior Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Athenahealth Full timeAbout the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our Service Operations, Site Reliability Engineering team within the Cloud Infrastructure Engineering division. As a key member of this team, you will be responsible for managing the fleet of systems owned by sister teams in the Service Operations zone.Key...
-
Senior Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India SES Full timeAbout the RoleThe Senior Engineer, Site Reliability is a key member of the SES IT team, responsible for the development, monitoring, operation, and support of the global SES Cloud and on-premise install base, with a strong focus on systems.As the main backup of the Senior Manager IT Systems, the Senior Engineer, Site Reliability will work closely with the...
-
Site Reliability Engineering Leader
5 days ago
Chennai, Tamil Nadu, India Athenahealth Full timeAs a Senior Site Reliability Engineer at Athenahealth, you will be part of the Service Operations Site Reliability Engineering team within the Cloud Infrastructure Engineering division. This team is responsible for managing the fleet of systems owned by its sister teams in the Service Operations zone.We are looking for experienced professionals to help us...
-
Senior Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Athenahealth Full timeAbout the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering team. As a key member of our team, you will be responsible for designing, implementing, and maintaining high-availability systems and infrastructure.Key Responsibilities:Provisioning and managing physical and virtual Linux machines...
-
Site Reliability Engineer
2 months ago
Chennai, Tamil Nadu, India NexionPro Services Full timeJob Title : Site Reliability Engineer (SRE)Location : Chennai (Guindy)Experience : 5-8 yearsNotice Period : Immediate or serving notice (August joiners preferred)Work Mode : 5 days in-officeReferences are highly appreciated.Job Summary : We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a solid...
-
Site Reliability Engineer
1 month ago
Chennai, Tamil Nadu, India Altimetrik Full timeJob Description:We are seeking a highly skilled Site Reliability Engineer to join our team at Altimetrik. The ideal candidate will have a strong background in development and a proven track record of troubleshooting and triaging complex issues.Key Responsibilities:Mandatory Skills:Development experience in Java, .Net, or Python, with a strong focus on code...
-
Site Reliability Engineer Lead
4 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeJob Title: Site Reliability Engineer LeadWe are seeking a highly skilled Site Reliability Engineer Lead to join our team at Bounteous x Accolite. As a Site Reliability Engineer Lead, you will be responsible for owning the outcomes of the incident management process and leading a team of 24/7 site reliability engineers within the technology department.Key...
-
Site Reliability Engineer
1 month ago
Chennai, Tamil Nadu, India Bounteous Full timeIncident ManagerThe Incident Manager is responsible for owning the outcomes of the incident management process and leading a team of 24/7 site reliability engineers within the technology department. This role involves strategic oversight, resource management, and effective coordination of response efforts to minimize disruptions.Key Responsibilities:Provide...
-
Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Virtusa Full timeJob Title: DevOps ArchitectJob Summary:We are seeking a highly skilled DevOps Architect to join our team at Virtusa. As a DevOps Architect, you will be responsible for designing and implementing scalable and efficient cloud infrastructure solutions.Key Responsibilities:Design and implement cloud infrastructure solutions using Hadoop, HBase, Hive, Oozie, and...
-
Site Reliability Engineer Leader
2 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeSeeking an experienced Site Reliability Engineer Leader to lead our 24/7 engineering team and drive incident management processes. The ideal candidate will have a strong background in cloud technologies and a proactive approach to identifying and resolving issues.ResponsibilitiesProvide strategic direction for the team and meticulous oversight of the...
-
Site Reliability Engineer Lead
3 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer Lead to join our team at Bounteous x Accolite. As a key member of our technology department, you will be responsible for leading a team of 24/7 site reliability engineers and overseeing the incident management process.Key ResponsibilitiesLeadership & Oversight: Provide strategic direction for...
-
Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeJob Title: Incident ManagerJob Summary: We are seeking an experienced Incident Manager to lead our team of 24/7 site reliability engineers within the technology department. The ideal candidate will have a strong background in cloud technologies and a proactive approach to identifying and resolving issues before they impact the business.Key...
-
Kubernetes Site Reliability Engineer
1 month ago
Chennai, Tamil Nadu, India Axiom Technologies Full timeAxiom Technologies is a leading provider of IT services, supporting medium to large-scale enterprises with innovative solutions.Key Responsibilities:Design and implement continuous integration and continuous deployment (CICD) pipelines for multiple products across various environments.Develop and maintain tools for deployment, monitoring, and operations to...
-
Senior Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India Athenahealth Full timeJob Summary We are seeking an experienced Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering team. The ideal candidate will have a strong background in cloud-native technologies and a proven track record of delivering scalable and highly available SaaS infrastructure solutions. Key ResponsibilitiesDesign and...