Cloud SRE

4 weeks ago


Chennai, India Ford Full time

Job Description

Enterprise Technology is the engine driving the future of transportation, and were looking for a talented Site Reliability Engineer (SRE) to help us redefine mobility. In this role, you'll leverage cutting-edge technology to enhance customer experiences, improve lives, and create vehicles as smart as you are.

As an SRE at Ford, you'll be instrumental in developing, enhancing, and expanding our global monitoring and observability platform. Youll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. Youll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.

If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility

- Bachelor s degree in Computer Science, Engineering, Mathematics or equivalent experience.
- 3+ years of experience as an SRE, DevOps Engineer, Software Engineer or similar role.
- Strong experience with Golang development and desired familiarity with Terraform Provider development.
- Proficient with monitoring and observability tools, particularly OpenTelemetry or other tools.
- Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
- Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
- Experience with relational and document databases.
- Ability to debug, optimize code, and automate routine tasks.
- Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
- Excellent verbal and written communication skills.
- Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
- Provide helpful and actionable feedback and review for code or production changes.
- Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
- Lead debugging, troubleshooting, and analysis of service architecture and design.
- Participate in on-call rotation.
- Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
- Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
- Work within GCP infrastructure, optimizing performance, and cost, and scaling resources to meet demand.
- Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
- Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
- Troubleshoot and resolve issues in our dev, test, and production environments.
- Participate in postmortem analysis and create preventative measures for future incidents.
- Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
- Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
- Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
- Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
- Contribute to internal knowledge bases and documentation.


  • Sre Coach

    2 weeks ago


    Chennai, India Capgemini Full time

    SRE Coach: The SRE Coach is responsible for leading and mentoring a team of SREs to implement best practices and processes related to site reliability engineering. The role involves collaborating with DevSecOps Automation teams and other stakeholders to identify opportunities for improving system reliability and performance defining and implementing metrics...

  • Devops SRE

    3 hours ago


    Chennai, Tamil Nadu, India Virtusa Full time ₹ 5,00,000 - ₹ 15,00,000 per year

    5+ years of relevant experience in DevOps/SRE roles.Deep expertise in AWS cloud services and architecture.Proven experience developing and implementing SRE/DevOps strategies.Experience with GCP, AWS cloud platform and services.Strong background in automation and CI/CD implementation using Terraform, GitHub, and Jenkins.Advanced systems administration skills...


  • Chennai, Tamil Nadu, India Ford Motor Company Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Enterprise Technology plays a critical part in shaping the future of mobility. If you're looking for the chance to leverage advanced technology to redefine the transportation landscape, enhance the customer experience and improve people's lives, this is the opportunity for you. Join us and challenge your IT expertise and analytical skills to help create...

  • SRE Engineer

    2 weeks ago


    Bengaluru, Chennai, Hyderabad, India Cognizant Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Job SummaryWe are seeking a Sr. Software Engineer with 7 to 12 years of experience to join our dynamic team. The ideal candidate will have expertise in AWS EKS SRE Elastic Beanstalk Kubernetes Python GCP AWS Automation and Terraforms. This hybrid role offers a day shift with no travel required.ResponsibilitiesDevelop and maintain scalable software solutions...

  • Mid-Level SRE

    5 days ago


    Chennai, Tamil Nadu, India Suzva Software Technologies Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Mid-Level SRE/DevOps Engineer (C2H) | Onsite - Coimbatore Azure DevOpsAutomate infra with Terraform (IaC)Monitor & optimize systems using Datadog, Prometheus, GrafanPosition: Mid-Level SRE/DevOps EngineerExperience: 5-6 YearsOpenings: 3Location: Coimbatore (Onsite)Engagement Type: Contract-to-Hire (C2H)Contract Duration: 6 months to 1 year (based on...

  • SRE Lead Consultant

    4 days ago


    Bengaluru, Chennai, Hyderabad, India Krazy Mantra HR Solutions Pvt. Ltd Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    We are looking for a skilled SRE Lead Consultant & SRE Principal consultant with 8 to 10 years of experience. The ideal candidate should have expertise in SRE concepts such as SLO, SLI, and error budgeting, deployment experience in APM tools & Cloud monitoring tools, Git and code-review systems, change management, Agile, ITIL concepts, SOP creation, and...


  • Chennai, Tamil Nadu, India Altimetrik Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Cloud Engineer - Site Reliability Engineering for Ford Credit TechWere passionate about building software that solves problems. We count on our Site Reliability Engineers (SREs) to empower our users with a rich feature set, high availability, and stellar performance level to pursue their missions. We are currently seeking a public cloud experienced engineer...

  • Cloud Kinetics

    6 days ago


    Chennai, India Cloud Kinetics Full time

    Job Description Job Summary We are seeking a skilled and proactive DevOps Lead to design, implement, and maintain our CI/CD pipelines, infrastructure, and deployment processes. Responsibilities The ideal candidate will guide the DevOps team and work closely with development, QA, and IT teams to drive automation, scalability, and operational efficiency across...


  • Chennai, India BCT Full time

    Job Description We are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. The engineer will be responsible for maintaining and improving our software development and deployment processes while ensuring the reliability, scalability, and performance of our infrastructure. Key Responsibilities: -...


  • Chennai, Tamil Nadu, India Ford Motor Company Full time ₹ 1,20,000 - ₹ 1,50,000 per year

    Enterprise Technology plays a critical part in shaping the future of mobility. If you're looking for the chance to leverage advanced technology to redefine the transportation landscape, enhance the customer experience and improve people's lives, this is the opportunity for you. Join us and challenge your IT expertise and analytical skills to help create...