Principal Site Reliability Engineer
5 days ago
Job Requisition ID #
25WD93104
Position Overview
Become part of the team responsible for developing the core infrastructure that drives Autodesk's cloud services. You will be working on CloudOS, our cutting-edge Continuous Deployment (CD) platform, which is widely utilized by our internal cloud engineering teams. As a crucial element of the Platform Services and Emerging Technologies team, CloudOS streamlines and standardises the provisioning and management of front-end, back-end, and infrastructure components across multiple global regions. CloudOS harnesses the power and flexibility of industry-standard open-source tools, enabling the rapid delivery of new platform capabilities while benefiting from the contributions of the global developer community. This modern approach allows for increased agility and the replacement of outdated, less flexible internal deployment systems.
We are looking for a skilled and passionate Principal Site Reliability Engineer (SRE) to join our CloudOS team. As a Principal SRE, you will be a strategic leader and a technical authority in developing, operating, and evolving CloudOS platform. You will help in streamlining and standardising the provisioning and management of front-end, back-end, and infrastructure components across multiple global regions. You will collaborate closely with internal engineering teams to understand their deployment needs, enhance the platform's functionality, ensure its reliability, and promote its adoption across Autodesk. If you are enthusiastic about building resilient developer platforms, automating complex workflows, and working on public cloud services using containers like ECS and EKS at scale, this position is ideal for you. While we primarily utilise AWS, we also have workloads on Azure and GCP.
This is a hybrid role which requires few days per week working from Bengaluru office.
Responsibilities:
- Lead the design, development, deployment, testing, maintenance, and enhancement of features and functionality within the CloudOS platform.
- Define the architectural roadmap and future vision for the CloudOS platform, ensuring alignment with Autodesk's business goals.
- Drive strategic initiatives to improve the platform's reliability, performance, and scalability.
- Oversee and manage the infrastructure supporting CloudOS, ensuring reliability, scalability, security, and cost optimization using technologies like Kubernetes, AWS, Azure, and GCP
- Develop and enforce policies, standards, and procedures for cloud infrastructure management.
- Configure, manage, and upgrade Jenkins and Spinnaker; contribute to upstream improvements or develop custom extensions as needed
- Lead the development of complex automation scripts for provisioning, deployment, monitoring, and operational tasks using languages such as Python, Go, or Java and Infrastructure-as-Code tools like Terraform.
- Collaborate closely with internal engineering teams to understand their CI/CD needs, provide proactive support, troubleshoot complex issues, and promote best practices for utilizing CloudOSImplement, manage, and optimize monitoring, logging, and alerting systems to ensure the health and performance of the CloudOS platform.
- Lead initiatives to identify and resolve performance bottlenecks, ensuring optimal system performance.
- Create and maintain comprehensive technical documentation, runbooks, and facilitate knowledge transfer within the team and among platform users
- Stay informed with the latest industry trends and advancements in CI/CD, cloud-native technologies, DevOps, and platform engineering
- Oversee the on-call rotation to provide incident response and support for the CloudOS platform, ensuring timely resolution of issues and continuous improvement from incident learnings.
Minimum Qualifications:
- Bachelor's degree in computer science, Computer Engineering, or a related field, with a minimum of 8 + years in a Platform Engineering, DevOps, SRE, or related role
- Extensive understanding of CI/CD principles and hands-on experience with relevant tools (e.g., Spinnaker, Jenkins, GitLab CI, Argo CD, Argo Workflows)
- Experience working with major public cloud providers (AWS, Azure, GCP).
- Proficiency in scripting and/or programming languages (e.g., Python, Go, Java).
- Hands-on experience with containerisation (Docker) and container orchestration (Kubernetes).
- Experience with Infrastructure-as-Code (IaC) tools like Terraform or CloudFormation.
- In-depth knowledge of monitoring and observability tools (e.g., Prometheus, Grafana, Dynatrace, Datadog, ELK Stack).
- Strong troubleshooting, problem-solving skills, and a strategic mindset.
- Excellent communication and collaboration skills.
Preferred Qualifications:
- Direct, hands-on experience managing, configuring, or extending CI/CD Pipelines
- Experience contributing to open-source projects, particularly related to cloud-native or CI/CD tooling
- Strong AWS expertise
- Deep expertise in Kubernetes operations and management
- Experience building and supporting internal developer platforms or tools.
- Familiarity with multi-region cloud application architectures and deployment strategies
- Experience with GitOps workflows
Learn More
About Autodesk
Welcome to Autodesk Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.
We take great pride in our culture here at Autodesk – it's at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.
When you're an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us
Salary transparency
Salary is one part of Autodesk's competitive compensation package. Offers are based on the candidate's experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.
Diversity & Belonging
We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here:
Are you an existing contractor or consultant with Autodesk?
Please search for open jobs and apply internally (not on this external site).
-
Principal Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India Autodesk Full time ₹ 1,00,00,000 - ₹ 2,00,00,000 per yearJob Requisition ID # 25WD93104Position OverviewBecome part of the team responsible for developing the core infrastructure that drives Autodesk's cloud services. You will be working on CloudOS, our cutting-edge Continuous Deployment (CD) platform, which is widely utilized by our internal cloud engineering teams. As a crucial element of the Platform Services...
-
Principal Site Reliability Engineer
4 hours ago
Bengaluru, Karnataka, India Okta Full time ₹ 12,00,000 - ₹ 36,00,000 per yearGet to know OktaOkta is The World's Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth.At Okta, we celebrate a variety of...
-
Site Reliability Engineering
1 week ago
Bengaluru, Karnataka, India Thakral One Full time US$ 60,000 - US$ 1,20,000 per yearCompany DescriptionThakral One, headquartered in Singapore, is a technology consulting and services company with a strong presence across Asia. The company specializes in technology-driven consulting, custom solution development, data analytics, and leveraging cloud capabilities to deliver enhanced decision support and practical outcomes. Collaborating...
-
Site Reliability Engineering
6 days ago
Bengaluru, Karnataka, India Viraaj HR Solutions Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer (SRE)About The OpportunityA fast-growing organization in the Enterprise Cloud Infrastructure & SaaS sector delivering highly available, mission-critical services to enterprise customers. We are hiring an on-site Site Reliability Engineer in India to own reliability, automation, and operational excellence across cloud-native...
-
Site Reliability Engineer
11 hours ago
Bengaluru, Karnataka, India super Full time ₹ 12,00,000 - ₹ 24,00,000 per yearSite Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...
-
Principal Site Reliability Engineer
4 hours ago
Bengaluru, Karnataka, India Oracle Full time ₹ 20,00,000 - ₹ 25,00,000 per yearSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate...
-
Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India Zetamicron Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title: Site Reliability Engineer (SRE)About the RoleWe are seeking a highly skilled and proactive Site Reliability Engineer (SRE)to ensure the stability, scalability, and reliability of our platform. The ideal candidate will have strong experience in managing production environments, automating operational processes, and enhancing system performance...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per yearThis posting is for Site Reliability Engineer in the Oracle Analytics Warehouse product development organization. Fully handled Cloud service that provides customers a turn-key enterprise warehouse on the cloud for Fusion Applications. The service is being built on a sophisticated technology stack demonstrating a brand-new data integration platform and the...
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India Chevron Full time ₹ 20,00,000 - ₹ 25,00,000 per yearTotal Number of Openings2About the position:Come join our Subsurface Digital Platform where we are driving continuous innovations to improve reliability, scalability and sustainability of Chevron business via Chevron's Digital Transformation. We are seeking a T-shaped dynamic Senior Site Reliability Engineer to lead and provide end-to-end solution support...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Infogrowth Full time ₹ 15,00,000 - ₹ 25,00,000 per yearRole : SRE Engineer (Site Reliability Engineer) Location : Marathali Bangalore. Work Mode : Hybrid Mode (Weekly 3 days) Exp : 6 – 10 Years Required Candidate profileSkills :Python, AWS (EC2, IAM, Lambda, API Gateway, SNS, SQS & etc.), GITHUB Actions, Service Management, Incident Management etc. & CAPAs.Share resume on or