Site Reliability Engineer
4 weeks ago
Business Unit Cubic Transportation Systems Company Details When you join Cubic you become part of a company that creates and delivers technology solutions in transportation to make people s lives easier by simplifying their daily journeys and defense capabilities to help promote mission success and safety for those who serve their nation Led by our talented teams around the world Cubic is committed to solving global issues through innovation and service to our customers and partners We have a top-tier portfolio of businesses including Cubic Transportation Systems CTS and Cubic Defense CD Explore more on Cubic com Job Details The Junior Site Reliability Engineer is responsible for assisting in the design build and maintenance of the infrastructure and deployment systems that underpin our live environments This role is hands-on and highly collaborative working closely with development teams and senior SREs to ensure our systems are reliable scalable and well-instrumented Junior SREs are expected to learn and apply best practices in building robust automated solutions and to ensure their work is repeatable and understandable by others Every contribution should be accompanied by documentation to support knowledge-sharing within the team and across engineering Core Responsibilities Infrastructure Design Maintenance Assist in building and maintaining infrastructure using infrastructure-as-code IaC tools e g Terraform CloudFormation Support the provisioning and lifecycle management of production staging and other critical environments Help implement shared infrastructure components e g logging metrics service mesh load balancing Contribute to improving infrastructure scalability availability and performance under the guidance of senior engineers Collaborate with development teams to provide infrastructure support for their deployment needs Deployment Systems CI CD Support and help extend CI CD pipelines GitHub Actions Argo CD to improve reliability and automation of deployments Help promote consistency and best practices across environments for deployment rollback and observability Work with developers to streamline testing and delivery of code to production Assist in reducing manual steps in the deployment and operations workflows Reliability Observability Tooling Assist in the implementation and maintenance of our monitoring alerting and logging infrastructure Kube-Prometheus-Grafana stack Help track SLOs SLIs for core services in partnership with service owners Learn to identify and help eliminate single points of failure performance bottlenecks and sources of instability Participate in reliability reviews and post-incident analysis Documentation Knowledge Sharing Ensure that all systems and processes you work on are accompanied by thorough up-to-date documentation Contribute to shared knowledge bases runbooks and developer-facing onboarding materials Participate in internal training sessions and pairings to learn from teammates Collaboration Culture Work closely with the SRE Lead and other team members to execute work aligned with team goals Engage constructively with other teams across engineering Participate in on-call rotations with strong support from senior members Embrace a culture of blameless learning transparency and continuous improvement Qualifications Skills Experience 3 years in a DevOps SRE or related role Cloud Basic understanding of cloud computing concepts with some hands-on experience in AWS Containers Orchestration Familiarity with Docker and a foundational understanding of Kubernetes concepts Experience with AWS ECS is a plus CI CD Exposure to CI CD principles and tools like GitHub Actions Familiarity with Argo CD is a bonus IaC Some experience with or exposure to Infrastructure as Code tools like Terraform or CloudFormation Scripting Proficiency in at least one scripting language e g Bash Python Observability A basic understanding of monitoring and logging Exposure to Prometheus and Grafana is desirable Collaboration Strong communication skills and a desire to learn and work within a team Problem Solving An enthusiastic and curious approach to solving technical challenges Worker Type Employee
-
Site Reliability Engineer
1 day ago
Hyderabad, Telangana, India UBS Full timeBusiness Divisions Group Functions Your role Are you an analytic thinker Do you enjoy Site Reliability Engineering initiatives and proactive problem management across on-premise Cloud Database ensuring high availability stability of Database infrastructure services Do you want to play a key role in transforming our firm into an agile organization At UBS we...
-
Lead Site Reliability Engineer
3 days ago
Hyderabad, Telangana, India JPMorgan Chase Full timeJob Category Software Engineering Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability As a Lead Site Reliability Engineer at JPMorgan Chase within the Consumer Community Banking you hold a leadership role in your team demonstrate strong...
-
Site Reliability Engineer Iii
7 days ago
Hyderabad, Telangana, India JPMorgan Chase Full timeJob Category Software Engineering There s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world s most complex and mission-critical systems As a Site Reliability Engineer III at JPMorgan Chase within the Corporate Oversight Governance Team - Regulatory...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, India Whatjobs IN C2 Full timeJob Title: Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog | 24/7 Support Department: Site Reliability Engineering Location: Hyderabad, India Employment Type: Full-Time Notice period: 0-15 Days We’re hiring a Site Reliability Engineer to join our SRE team focused on maintaining the performance, reliability, and availability of our fintech...
-
Senior Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Microsoft Full timeThe Windows Cloud division is looking for a Senior Site Reliability Engineer that will help us take the Windows Cloud platform as well as the Windows 365 Cloud PC and Azure Virtual Desktop business to the next level Windows 365 Cloud PC W365 and Azure Virtual Desktop AVD have recently been recognized as leaders in the Gartner Magic Quadrant TM for Desktop as...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Talent Worx Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
1 day ago
Hyderabad, India Elios Talent Full timeSite Reliability Engineer Key Highlights
-
Site Reliability Engineer
2 days ago
Hyderabad, Telangana, India 2a1d0a41-1875-4bbb-b5a8-e4d5620cfd5f Full time ₹ 12,00,000 - ₹ 36,00,000 per yearRole & responsibilitiesCoordinates cross-product chaos experimentation to proactively test system resilience and uncover reliability gaps.Maintains the centralized incident response playbook for the subdivision, documenting standards for communication, escalation, and recovery during incidents. Aggregates and reports quantifiable availability data to senior...
-
Site Reliability Engineer
2 days ago
Hyderabad, Telangana, India Assurant Full time ₹ 6,00,000 - ₹ 12,00,000 per yearSite Reliability Engineer, GCC-AssurantThe Site Reliability Engineer (SRE) will be part of the Assurant Reliability Team, specifically within the Site Reliability Engineering area. This remote position, based in India, focuses on building and maintaining reliable, scalable systems through a combination of software development and network diagnostics. The...
-
Site Reliability Engineer
4 days ago
Hyderabad, Telangana, India Assurant Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer, GCC-Assurant The Site Reliability Engineer (SRE) will be part of the Assurant Reliability Team, specifically within the Site Reliability Engineering area. This remote position, based in India, focuses on building and maintaining reliable, scalable systems through a combination of software development and network diagnostics. The...