Site Reliability Engineer Specialist
7 days ago
Every day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant services. Our worldwide team helps over 3 million companies, more than 1,300 financial institutions and over 600 million cardholders grow with confidence and achieve amazing results. We are driven by our passion for success and we are proud to deliver best-in-class payment technology and software solutions. Join our dynamic team and make your mark on the payments technology landscape of tomorrow.
- Summary of This Role
manage DevOps tools such as Jenkins, Git, Docker, Kubernetes, and Terraform. Use these skills to support, build and maintain Kubernetes clusters on-prem, in OCP and in AWS. Responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. Site reliability engineers create a bridge between development and operations by applying a software engineering mindset to system administration topics. They split their time between operations/on-call duties and developing systems and software that help increase site reliability and performance.
- What Part Will You Play?
Participate in architecture and R&D discussions for new technology or processes to increase the performance and reliability of our systems.
- Chaos engineering - you're expected to think laterally about how our systems might fail in theory, design tests to demonstrate how they behave in practice, and then formulate and implement remediation plans, as appropriate.
- Pushing our systems to their limits, and then coming up with designs for how to get them to the next performance tier.
- Use practices from DevOps and GitOps to improve automation and processes to make self service possible.
- Safeguarding reliability. Ensuring that our services are highly available, resilient against disasters, self-monitoring, and self-healing.
- Running "game days" to test assumptions about reliability and learn what will break before it matters to customers.
- Reviewing designs with an eye toward increasing the holistic stability of our platform and identifying potential risks.
- Building systems to proactively monitor the health, performance and security of our production and non-production virtualized infrastructure.
- Improving our monitoring and alerting systems to make sure engineers get paged when it matters (and don't get paged when it doesn't).
- Troubleshooting systems and network issues, alongside our Technical Operations Team.
- Mentoring other engineers in reliability-related skills.
- Evolving our SDLC, practices, and tooling to account for Site Reliability considerations and best practices.
- Developing runbooks and improving documentation.
- What Are We Looking For in This Role?
- Minimum Qualifications
BS in Computer Science, Information Technology, Business / Management Information Systems or related field
- Typically have 6+ years of experience with programming in one or more programming languages and 4 years of experience working with Unix/Linux systems internals and administration (e.g. filesystems, inodes, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware, SDN).
- What Are Our Desired Skills and Capabilities?
Required
- Basic familiarity with containerization tools like Docker.
- Deep understanding of Kubernetes concepts, architecture, and best practices.
- Familiarity with OpenShift Container Platform, its features, and how it extends Kubernetes.
- Basic understanding of version control systems such as Git.
- Basic knowledge of CI/CD concepts and tools (e.g., Jenkins, GitLab CI).
- Basic understanding of Infrastructure as Code principles.
- Basic knowledge of Linux operating systems.
- Understanding of basic networking concepts and protocols.
- Awareness of fundamental security practices and principles.
- Basic understanding of securing applications and infrastructure.
- Analytical skills to troubleshoot and resolve basic technical issues.
- Ability to identify and escalate complex issues to senior team members.
- Eagerness to learn new technologies and continuously improve technical skills.
- Active participation in training sessions, workshops, and relevant certifications.
Preferred
- Experience with cloud platforms (e.g., AWS, Azure, GCP) and their services.
- Proficiency in scripting languages (e.g., Python, Bash, Groovy) and experience with automation tools (e.g., Ansible, Terraform, Salt).
- Basic knowledge of monitoring and logging tools (e.g., Prometheus, Grafana).
- Exposure to Kafka, Nats, Vault
Global Payments Inc. is an equal opportunity employer. Global Payments provides equal employment opportunities to all employees and applicants for employment without regard to race, color, religion, sex (including pregnancy), national origin, ancestry, age, marital status, sexual orientation, gender identity or expression, disability, veteran status, genetic information or any other basis protected by law. If you wish to request reasonable accommodations related to applying for employment or provide feedback about the accessibility of this website, please contact
-
Senior Site Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Barclays Full time ₹ 6,00,000 - ₹ 18,00,000 per yearStep into the role of Senior Site Reliability Engineer - Database Specialist. At Barclays, we are more than a bank we are a force for progress. You will be the part of the central SRE (Site Reliability Engineer) core team within our wider Infrastructure team. You will act as a centre of excellence providing hands on consultancy to our different...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Fiserv Full time ₹ 8,00,000 - ₹ 24,00,000 per yearSite Reliability EngineerExp. Range-8 to14 YearsWhat does a successful Site Reliability Engineer (SRE) Expert do at Fiserv?The Site reliability engineer blends the principles of software engineering with the discipline of operations to create high-performing and reliable software systems. They are tasked with designing and implementing tools, processes, and...
-
Site Reliability Engineer
6 days ago
Pune, Maharashtra, India ENGEL Full time ₹ 6,00,000 - ₹ 18,00,000 per yearCompany DescriptionENGEL is a global leader in the production of injection moulding machines and their automation. The company produces systems that manufacture plastic parts used in various industries such as automotive, packaging, and consumer goods. With nine production plants worldwide and subsidiaries and representatives in over 85 countries, ENGEL...
-
Site Reliability Engineer
2 weeks ago
Pune, Maharashtra, India Ather Energy Full time ₹ 6,00,000 - ₹ 18,00,000 per yearYou'll be our: Site Reliability EngineerYou'll be based at: Pune Zonal OfficeYou'll be aligned with: Cloud and Data Platform Lead / Cloud ArchitectYou'll be a member of: Cloud and Data Platform TeamAther's fleet of smart scooters is growing rapidly, and so is the volume of data they generate. Our Vehicle Data Platform (VDP) is the core of this ecosystem, and...
-
Site Reliability Engineer
7 days ago
Pune, Maharashtra, India NielsenIQ Full time ₹ 12,00,000 - ₹ 24,00,000 per yearSite Reliability Engineer - Cloud Computing Engineering - T6 Job Description Senior Site Reliability Engineer, Pune At NielsenIQ Digital Shelf, we help the world's leading brands measure and improve their online performance. Formerly known as Data Impact, we've recently joined NielsenIQ. Today, we operate at the intersection of scale and agility — a...
-
Site Reliability Engineer
6 days ago
Pune, Maharashtra, India Amdocs Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob ID: 205406Required Travel : MinimalManagerial - NoLocation: :India- Pune (Amdocs Site)In one sentenceAs the SRE Lead, you will be responsible for the reliability & operational excellence of the amAIz (Telco Agentic Suite). You will lead a cross-functional team NFT, QA & DevOps Engineers, driving best practices in observability, automation, performance...
-
Site Reliability Engineer
6 days ago
Pune, Maharashtra, India Amdocs Full time ₹ 9,00,000 - ₹ 12,00,000 per yearJob ID: Required Travel :Minimal Managerial - No Location: :India- Pune (Amdocs Site) In one sentence As the SRE Lead, you will be responsible for the reliability & operational excellence of the amAIz (Telco Agentic Suite). You will lead a cross-functional team NFT, QA & DevOps Engineers, driving best practices in observability, automation, performance...
-
Site Reliability Engineer
2 hours ago
Pune, Maharashtra, India Infosys Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Description:Site Reliability Engineer Observability ToolsKey Responsibilities:A day in the life of an InfoscionAs part of the Infosys delivery team your primary role would be to interface with the client for quality assurance issue resolution and ensuring high customer satisfactionYou will understand requirements create and review designs validate the...
-
SRE (Site Reliability Engineer)
2 weeks ago
Pune, Maharashtra, India Apex One Full time ₹ 6,00,000 - ₹ 18,00,000 per yearJob Overview We are looking for a detail-oriented and experienced Site Reliability Engineer to join our team. The Site Reliability Engineer will be responsible for creating and implementing scalable software solutions in order to meet system and application performance goals. You will also be responsible for troubleshooting system errors and resolving any...
-
Site Reliability Engineer
4 days ago
Pune, Maharashtra, India UBS Full time ₹ 10,00,000 - ₹ 25,00,000 per yearIndiaInformation Technology (IT)Group FunctionsJob Reference #319274BRCityPuneJob TypeFull TimeYour roleAre you an analytic thinker?Do you enjoy Site Reliability Engineering initiatives and proactive problem management across on-premises & Cloud Database ensuring high availability & stability of Database infrastructure services?Do you want to play a key role...