Site Reliability Engineer, Platform Engineering
4 days ago
Position Description
Tesla's Platform Engineering is looking for a Site Reliability Engineer to join our team. As a member of the team, you will be building and maintaining Kubernetes clusters using infrastructure-as-code tools like Ansible, Terraform, ArgoCD and Helm and helping the application teams to be successful on our platform. The underlying infrastructure is a mix of on-premise VMs, bare metal hosts and public clouds such as AWS located all around the globe, which presents unique challenges and opportunity to work with different types of infrastructure technologies. A successful candidate will be expected to possess expert knowledge in Linux fundamentals, architecture and performance tuning; as well as software development skills to match. Experience running Kubernetes in production will be a strong plus; we prefer Golang or Python for any automation or tools we have to build along the way. We are the team that runs production critical workloads for every aspect of the business at Tesla and sets the standards for other teams, a group of well-rounded generalists that not only solve the hardest problems in the industry but also push other Engineering teams at large to be better. Join us to get a chance to work with some of the best Engineers in the industry for one of the most transformative companies in the history of both automotive and energy industries.
Responsibilities- Hands-on with developers to deploy the applications to provide support
- Building new features to improve the platform in terms of stability & updates
- Manage our Kubernetes clusters on-prem and in the cloud to support our growing workloads
- Participating in the architecture design process and troubleshooting of live applications with the product teams
- Participating in a 24x7 on-call rotation
- Influence architectural decisions with focus on security, scalability and high-performance
- Setup and maintain monitoring, metrics & reporting systems for fine-grained observability and actionable alerting
- Authoring technical documentation for workflows/processes/best practices
Requirements- Experience managing web-scale infrastructure in a production *nix environment
- Ability to prioritize tasks and work independently with an analytical mind with a bias for action
- Advanced or expert-level Linux administration and performance tuning skills
- Bachelor's Degree in Computer Science, Computer Engineering, or equivalent experience or evidence of exceptional ability
- Advanced experience with configuration management systems such as Ansible, Terraform or Puppet
- Demonstrable knowledge of the Linux operating system internals, networking stack, filesystems, resource scheduling and process management
- Exposure to AWS, or other cloud infrastructure providers
- Experience managing container-based workloads, using Kubernetes or other orchestration software in production (ArgoCD, Helm)
- Proficiency in a high-level language like Python, Go, Ruby and/or Java
-
Site Reliability Engineer
3 days ago
Pune, Maharashtra, India Talent Worx Full time ₹ 15,00,000 - ₹ 25,00,000 per yearSite Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India ENGEL Full time ₹ 6,00,000 - ₹ 18,00,000 per yearCompany DescriptionENGEL is a global leader in the production of injection moulding machines and their automation. The company produces systems that manufacture plastic parts used in various industries such as automotive, packaging, and consumer goods. With nine production plants worldwide and subsidiaries and representatives in over 85 countries, ENGEL...
-
Site Reliability Engineer
4 days ago
Pune, Maharashtra, India Growel Softech Pvt. Ltd. Full time ₹ 12,96,000 - ₹ 1,51,20,000 per yearJob TitleSite Reliability EngineerLocationPune (Hybrid - 3days in a week at office, 2 days wfh, Candidate needs toreport to only Pune office) (Relocation is considerable)Shift Timings12:30 PM - 9:30 PM ISTBudget - 10+ to 12+ yrs 31 LPA13 to 15+ yrs 36 LPAInterview2 rounds (HMs availability is between 3PM 5PM IST)Positions4Considerable Notice Period - 30...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India UBS Full time ₹ 10,00,000 - ₹ 25,00,000 per yearIndiaInformation Technology (IT)Group FunctionsJob Reference #319274BRCityPuneJob TypeFull TimeYour roleAre you an analytic thinker?Do you enjoy Site Reliability Engineering initiatives and proactive problem management across on-premises & Cloud Database ensuring high availability & stability of Database infrastructure services?Do you want to play a key role...
-
Site Reliability Engineer
5 days ago
Pune, Maharashtra, India Infosys Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Description:Site Reliability Engineer Observability ToolsKey Responsibilities:A day in the life of an InfoscionAs part of the Infosys delivery team your primary role would be to interface with the client for quality assurance issue resolution and ensuring high customer satisfactionYou will understand requirements create and review designs validate the...
-
Site Reliability Engineer
4 days ago
Pune, Maharashtra, India Techverito Software Solutions LLP Full time ₹ 8,00,000 - ₹ 24,00,000 per yearJob Description3-5 years of proven and progressive experience as an SRE or DevOps Engineer. As a SRE Engineer, you will have a strong background in cloud infrastructure management and deployment, with expertise in AWS cloud, DevOps tools, and Kubernetes ecosystem. The primary focus of this role will be to design, implement, and manage our cloud...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Spark Tech Wave Innovation Full time ₹ 15,00,000 - ₹ 25,00,000 per yearAn experienced Site Reliability Engineer (SRE) / DevOps Engineer with strong experience in cloud infrastructure, automation, and CI/CD. The candidate will be responsible for improving the reliability, scalability, and performance of production systems while driving automation and monitoring initiatives across the environment.Key ResponsibilitiesDesign,...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Global Payments Inc. Full time US$ 80,000 - US$ 1,50,000 per yearEvery day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant services. Our worldwide team helps over 3 million companies, more than 1,300 financial institutions and over 600 million cardholders grow with confidence and achieve amazing...
-
Site Reliability Engineer
5 days ago
Pune, Maharashtra, India KONE Full time ₹ 9,00,000 - ₹ 12,00,000 per yearSite Reliability Engineer – SAP Platforms (SAP BASIS)The Site Reliability Engineer – SAP Platforms (SAP BASIS) is continually in touch with the demand management team to understand the demand pipeline and configure Enterprise IT solutions and/or platforms according to the requirements, including RISE with SAP environments, Greenfield and Brownfield...
-
Site Reliability Engineer Oracle
1 week ago
Pune, Maharashtra, India UBS Full time ₹ 8,00,000 - ₹ 12,00,000 per yearIndiaInformation Technology (IT)Group FunctionsJob Reference #322692BRCityPuneJob TypeFull TimeYour roleAre you an analytic thinker?Do you enjoy Site Reliability Engineering initiatives and proactive problem management across on-premises & Cloud Database ensuring high availability & stability of Database infrastructure services?Do you want to play a key role...