Devops/SRE Engineer
3 days ago
As a Cloud Site Reliability Engineer at our company, you will play a critical role in ensuring the robustness,
performance, and security of our cloud-based systems. Your focus will be on maintaining and improving our
cloud infrastructure with a special emphasis on cloud security and observability. You will work closely with
development teams to architect, deploy, and optimize systems that are not only reliable but also resilient and secure.
Handle SRE operational duties including responding to pull requests and ensuring smooth continuous integration and delivery processes.
Maintain and fine-tune applications for optimal performance, ensuring they meet specified requirements.
Explore and experiment with new technologies through Proof-of-concepts to enhance existing functionalities or discover new opportunities.
Automate deployment, configuration, and operational processes to improve efficiency and accuracy.
Collaborate with development teams to guide system architecture and design, focusing on reliability, efficiency, and scalability.
Implement and manage observability tools such as Grafana, Prometheus, and New Relic to ensure all critical services are monitored effectively.
Develop custom reliability tools and frameworks for use by engineering teams.
Participate in an on-call rotation for critical systems, lead incident responses, and conduct thorough
post-mortem analyses.
Drive system and process efficiencies including capacity planning, configuration management,
performance tuning, monitoring, and root cause analysis.
Act as a consultant within the organization for best practices in infrastructure management and assist
teams in effective infrastructure utilization.
Experience with state machines such as AWS Step Functions or Azure Logic Apps.
Deep knowledge in telemetry and observability; experience with Prometheus, OpenTelemetry, or
DynaTrace is highly desirable.
Proficiency in Kubernetes with CKA/CKAD certification being advantageous.
Expertise in Terraform, with experience in setting up pipelines for multi-environment deployments.
Good programming skills in high-level languages, with a preference for Python. Go, or any other
compiled languages is an advantage. familiarity with Observability tools like Grafana, Prometheus, and New Relic.
Strong project management and organizational skills.
An open mindset with the ability to quickly adapt to new technologies and learning practices.
About Cloud Native Engineering
The Cloud Native Engineering Practice is an organization of engineers who work with our production services
throughout their entire life cycle, from design and architecture, through implementation, deployment, and
sustaining operation.SREs delivers important system properties: reliability, performance, efficiency, and
scalability, for the products and platforms that our customers use every day.
SREs work in high-performance squads with expertise on large scale system reliability and in-depth
understanding of critical business components architecture, as well as dedicated engineering teams building
comprehensive tools, platform and infrastructure.
-
SRE DevOps Engineer
4 weeks ago
Hyderabad, India Supro Consulting Full timeJob Overview We are seeking an experienced SRE DevOps Engineer to join our team in Hyderabad. This is a full-time, mid-level position requiring 4 to 6 years of relevant work experience. The role demands hands-on expertise in site reliability engineering and DevOps practices, particularly within AWS environments, to ensure the smooth and efficient...
-
SRE DevOps Engineer
4 weeks ago
Hyderabad, India Supro Consulting Full timeJob Overview We are seeking an experienced SRE DevOps Engineer to join our team in Hyderabad. This is a full-time, mid-level position requiring 4 to 6 years of relevant work experience. The role demands hands-on expertise in site reliability engineering and DevOps practices, particularly within AWS environments, to ensure the smooth and efficient operation...
-
SRE DevOps Engineer
3 weeks ago
Bengaluru, India Brillio Full timeSRE DevOps(ML Ops role) Required Skills: ● Demonstrated ability in designing, building, refactoring and releasing software written in Python. ● Hands-on experience with ML frameworks such as PyTorch, TensorFlow, Triton. ● Ability to handle framework-related issues, version upgrades, and compatibility with data processing / model training environments....
-
SRE DevOps Engineer
3 weeks ago
Bengaluru, India Brillio Full timeSRE DevOps(ML Ops role) Required Skills: ● Demonstrated ability in designing, building, refactoring and releasing software written in Python. ● Hands-on experience with ML frameworks such as PyTorch, TensorFlow, Triton. ● Ability to handle framework-related issues, version upgrades, and compatibility with data processing / model training environments....
-
SRE DevOps Engineer
3 weeks ago
Bengaluru, India Brillio Full timeSRE DevOps(ML Ops role) Required Skills: ● Demonstrated ability in designing, building, refactoring and releasing software written in Python. ● Hands-on experience with ML frameworks such as PyTorch, TensorFlow, Triton. ● Ability to handle framework-related issues, version upgrades, and compatibility with data processing / model training environments....
-
DevOps / SRE with Python
1 week ago
Bengaluru, Karnataka, India BAHWAN CYBERTEK Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Description We are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our...
-
DevOps / SRE with Python
4 weeks ago
Bengaluru, India Bahwan Cybertek Group Full timeWe are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure. Responsibilities: - Develop...
-
DevOps / SRE with Python
4 weeks ago
Bengaluru, India Bahwan Cybertek Group Full timeWe are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure. Responsibilities: - Develop...
-
DevOps Engineer
2 weeks ago
Bengaluru, India 8byte Full timeJob Description DevOps Engineer 8byte What is the role As a DevOps Engineer, you will be responsible for designing, implementing, and maintaining robust, scalable, and secure infrastructure while optimizing development and deployment processes. This role goes beyond traditional DevOps and aligns closely with Site Reliability Engineering (SRE) principles,...
-
SRE DevOps Engineer
4 weeks ago
Bengaluru, India Brillio Full timeSRE DevOps(ML Ops role) Required Skills: ● Demonstrated ability in designing, building, refactoring and releasing software written in Python. ● Hands-on experience with ML frameworks such as PyTorch, TensorFlow, Triton. ● Ability to handle framework-related issues, version upgrades, and compatibility with data processing / model training...