
Site Reliability Engineer III
9 hours ago
As a Observability Engineer under Site Reliability Engineering Team, you will be a crucial part of the team responsible for the availability, performance, and scalability of our cloud platform. You will blend software engineering and systems administration expertise to build and run large-scale, distributed, fault-tolerant systems. Your mission is to ensure our services are reliable and efficient through automation, robust monitoring, and proactive incident response. You will work closely with development teams to build resilient and scalable applications on our Google Cloud Platform (GCP) and Kubernetes-based infrastructure. Having a Strong troubleshooting skills and a methodical approach to problem-solving is a MUST.
Key Responsibilities:
Infrastructure as Code (IaC): Design, build, and maintain our core cloud infrastructure on GCP using tools like Terraform and Google Config Connector (KCC) within a GitOps framework.
Automation: Utilize Infrastructure as Code (IaC) with Kubernetes (GKE) and Google Config Connector (KCC), Develop automation scripts and tools (primarily in Python or Go) to reduce operational toil, streamline deployments, and improve system efficiency.
Observability: Implement and manage comprehensive monitoring, logging, and alerting solutions using tools like Prometheus, Open Telemetry, Grafana, and Google Cloud's operations suite to gain deep insights into system health.
Reliability & SLOs: Define, measure, and monitor Service Level Indicators (SLIs) and Service Level Objectives (SLOs) for critical services. Drive initiatives to meet and exceed these objectives. Develop & promote dashboarding, and actionable alerting across the organization.
Incident Management: Participate in an on-call rotation to respond to and resolve production incidents. Lead blameless post-mortems to identify root causes and implement lasting solutions.
Collaboration: Partner with software engineering teams throughout the development lifecycle to provide guidance on building reliable, scalable, and secure applications. Help them troubleshoot complex issues, improve service performance, and adopt observability best practices.
Enhance Reliability: Analyze observability data to identify trends, uncover potential issues, and drive initiatives to improve system reliability, performance, and cost-efficiency.
Secure and Scale: Manage secrets and system configurations securely using Hashi Corp Vault and ensure the observability platform scales to meet the demands of a growing engineering organization.
Qualifications Required:
- Bachelor's degree in computer science, a related technical field, or equivalent practical experience.
- 3-8 years of experience in a Site Reliability, DevOps, or Software Engineering role.
- Strong proficiency in at least one high-level programming language (e.g., Python, Go, Java).
- Hands-on experience with cloud platforms, particularly Google Cloud Platform (GCP).
- Solid understanding and practical experience with containerization (Docker) and orchestration (Kubernetes).
- Experience with Infrastructure as Code (IaC) tools such as Terraform, Ansible, or Google Config Connector.
- Familiarity with CI/CD principles and tools (e.g., GitLab CI, Jenkins...)
- Knowledge of GitOps principles and tools
- Excellent communication skills and the ability to work effectively in a collaborative team environment.
CME Group: Where Futures are Made
CME Group is the world's leading derivatives marketplace. But who we are goes deeper than that. Here, you can impact markets worldwide. Transform industries. And build a career by shaping tomorrow. We invest in your success and you own it – all while working alongside a team of leading experts who inspire you in ways big and small. Problem solvers, difference makers, trailblazers. Those are our people. And we're looking for more.
At CME Group, we embrace our employees' unique experiences and skills to ensure that everyone's perspectives are acknowledged and valued. As an equal-opportunity employer, we consider all potential employees without regard to any protected characteristic.
Important Notice: Recruitment fraud is on the rise, with scammers using misleading promises of job offers and interviews to solicit money and personal information from job seekers. CME Group adheres to established procedures designed to maintain trust, confidence and security throughout our recruitment process. Learn more here.
-
Site Reliability Engineer II
1 day ago
Bangalore - Bagmane Tridib, India CME Group Full time ₹ 1,04,000 - ₹ 13,08,780 per yearCME Group is the world's leading and most diverse derivatives marketplace, offering futures and options across a wide range of industries. We are seeking a passionate SRE to join our dynamic team. The Application Site Reliability Engineer II will help ensure the reliability and performance of our Markets trading and real-time post-trade systems; systems...
-
Systems and Application Support Engineer
1 week ago
Bangalore - Bagmane Tridib, India CME Group Full time ₹ 9,00,000 - ₹ 12,00,000 per yearThe Systems Engineer III ESP supports the Enterprise Server Platforms at CME Group. The incumbent must have knowledge of server(Windows or Linux) administration, configuration, troubleshooting, security, scripting, and networking. Strong communication and documentation skills are required as the candidate will typically be working with customers for support...
-
Mgr Software Engineering
7 days ago
Bangalore - Bagmane Tridib, India CME Group Full time US$ 1,50,000 - US$ 2,00,000 per yearThe Manager Software Engineering independently manages a team that is accountable for engineering secure, scalable and reliable technology solutions to advance CMEG in the global marketplace and serve risk management needs of customers around the world.Principal Accountabilities:Demonstrates advanced language proficiency; Contributes to the architectural...
-
Software Engineer I
2 days ago
Bangalore - Bagmane Tridib, India CME Group Full time US$ 80,000 - US$ 1,20,000 per yearThe Software Engineer I engineers secure, scalable and reliable technology solutions, with appropriate mentoring, to advance CMEG in the global marketplace and serve risk management needs of customers around the world.Principal Accountabilities:• Conducts coding at a small task level; potentially minimal design.• Conducts unit testing of own code....
-
Sr Software Engineer
7 days ago
Bangalore - Bagmane Tridib, India CME Group Full time US$ 90,000 - US$ 1,20,000 per yearThe Senior Software Engineer engineers secure, scalable and reliable technology solutions, with minimal mentoring, to advance CMEG in the global marketplace and serve risk management needs of customers around the world.Principal Accountabilities:Designs, develops, documents, troubleshoots and debugs web applications using modern technologies.Demonstrates...
-
Senior Reliability Engineer
1 week ago
Bengaluru / Bangalore, Pune, India beBeeReliability Full time US$ 1,50,000 - US$ 2,00,000Job DescriptionAs a seasoned technologist, you will play a key role in shaping the future of our company's infrastructure and applications. Your expertise in site reliability engineering will be instrumental in driving business growth and ensuring the stability and scalability of our systems.Key Responsibilities:Collaborate with cross-functional teams to...
-
Software Dev Engineer
6 days ago
Bangalore, India Umanist Staffing LLC Full timePosition: Software Dev Engineer IIILocation: BangaloreDuration: 8 Months Job Type: ContractWork Type: Onsite Job Description:Complete all Production Readiness related tasks (Integration Testing, Metrics and Alarms setup, Runbook updates, AWS CDK setup) for an AWS application,Working collaboratively with cross functional teams (Ex: Technical requirement...
-
Engineering Manager
2 weeks ago
Bangalore, India Ford Global Career Site Full time US$ 1,25,000 - US$ 1,75,000 per yearAs a Senior Salesforce Engineering Manager with a focus on Salesforce Service Cloud, Sales Cloud, Data cloud and Marketing Cloud, you will be responsible for overseeing and driving the Delivery, technical strategy and implementation of Salesforce solutions, sales processes and marketing objectives of our organization. Your role will encompass project...
-
Reliability Specialist
1 week ago
Bengaluru / Bangalore, Hyderabad / Secunderabad, Telangana, Chennai, India beBeePerformance Full time ₹ 12,00,000 - ₹ 30,00,000Our ideal candidate is a seasoned Systems Operations professional, well-versed in managing the stability and scalability of large-scale distributed software applications. As a Site Reliability Engineer, you will be responsible for ensuring the overall health and performance of our production environment.The role involves monitoring system metrics,...
-
Reliable Software Developer
4 days ago
Hyderabad / Secunderabad, Telangana, Chennai, Bengaluru / Bangalore, India beBeeSoftware Full time ₹ 12,00,000 - ₹ 30,00,000As a Site Reliability Engineer, you are responsible for ensuring the uptime and performance of our production application. This role involves monitoring site reliability, addressing technical issues, automating maintenance tasks, and collaborating with cross-functional teams to meet business objectives.Main ResponsibilitiesRun the production environment by...