Site Reliability
6 days ago
Description
N-iX is a global software development company founded in 2002, connecting over 2,400+ tech professionals across 40+ countries.
We deliver innovative technology solutions in cloud computing, data analytics, AI, embedded software,IoT, and more to global industry leaders and Fortune 500 companies.
Join us to create technology that drives real change for businesses and people across the world.
About The Team
We are the AI Platform Team, building and operating highly available, scalable, and automated infrastructure supporting global machine learning workloads.
We are seeking a Site Reliability / DevOps Engineer with a solid background in Java development, who thrives in solving complex infrastructure challenges and driving platform automation.
In this role, you will ensure reliability, scalability, and efficiency of our AI platform systems through automation, Java-based service optimization, and SRE best practices.
Youll collaborate closely with development, infrastructure, and research teams to deliver production-grade, self-healing, and performance-optimized services.
Key Responsibilities
- Design, implement, and maintain CI/CD pipelines for platform services.
- Manage and optimize Kubernetes clusters, Docker containers, and cloud infrastructure.
- Ensure high availability %), system reliability, and operational security.
- Automate infrastructure tasks, monitoring, and service deployments.
- Troubleshoot production incidents, perform root cause analysis, and implement preventive solutions.
- Drive observability improvements using Prometheus, Grafana, and log aggregation tools.
- Collaborate with developers to define operational standards and DevOps best practices.
- Contribute to service discovery, orchestration, and API development.
- Improve system performance, scalability, and resilience through code and infrastructure enhancements.
- Integrate application services into automated build and deployment pipelines.
- Work with both SQL and NoSQL databases to support scalable platform components.
Requirements
- 3- 5 years of combined experience in SRE / DevOps.
- Experience with Kubernetes, Docker, and Linux systems.
- Proven experience with CI/CD tools (e.g. Jenkins, GitHub Actions, GitLab CI).
- Understanding of cloud environments (AWS, GCP, or Azure).
- Familiarity with monitoring and observability tools (Prometheus, Grafana, ELK stack).
- Good understanding of JVM tuning, profiling, and debugging.
- Excellent problem-solving, communication, and collaboration skills.
- Exposure to MLOps tools,
- Fluent English (spoken and written).
We Offer
- Flexible working format remote, office-based or flexible.
- A competitive salary and good compensation package.
- Personalized career growth.
- Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more).
- Active tech communities with regular knowledge sharing.
- Education reimbursement.
- Memorable anniversary presents.
- Corporate events and team buildings.
- Other location-specific benefits.
)
-
Senior Site Reliability Engineer
2 days ago
Greater Kolkata Area, India Meazure Learning Full time ₹ 12,00,000 - ₹ 24,00,000 per yearAt Meazure Learning, we believe in transforming learning and assessment experiences to unlock human potential. As a global leader in online testing and exam services, we support credentialing, licensure, workforce education, and higher education through purpose-built solutions that are secure, accessible, and deeply human-centered. With a global footprint...
-
Site Reliability Engineer/Architect
1 week ago
Greater Kolkata Area, India Cling Multi Solutions Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob SummaryWe are seeking an experienced Site Reliability Engineer (SRE) Architect with over 10 years of IT experience, specializing in designing and implementing highly scalable, reliable, and automated systems.The ideal candidate will have strong expertise in cloud-native architectures, automation, monitoring, and SRE practices.This role requires excellent...
-
Senior Site Reliability Engineer
1 week ago
Greater Kolkata Area, India N-iX Full time ₹ 12,00,000 - ₹ 24,00,000 per yearAbout The CompanyOur Client is defining the future of cybersecurity through our XDR platform that automatically prevents, detects, and responds to threats in real time. Singularity XDR ingests data and leverages our patented AI models to deliver autonomous protection. With the Client, organizations gain full transparency into everything happening across the...
-
Site Engineer
2 days ago
Greater Kolkata Area, India 2a0f1bbb-3615-458c-8667-e86acf78d2cc Full time ₹ 5,00,000 - ₹ 15,00,000 per yearCompany DescriptionI SUN GROUP, established in 2016 and headquartered in Rajkot, Gujarat, is a premier name in the solar energy sector. Specializing in the design, manufacturing, and installation of solar mounting structures, the company delivers end-to-end solutions for residential, commercial, industrial, and utility-scale projects. With a robust...
-
Site Reliability Engineer
1 week ago
Kolkata, West Bengal, India ERM Placement Services Full time ₹ 12,00,000 - ₹ 36,00,000 per year10 + years of experience in Site Reliability Engineering or related roles.• Lead and mentor a team of Site Reliability Engineers, fostering a culture of collaboration and innovation, effective client communication• Develop and enforce SRE best practices, including incident management processes, SLAs, SLOs, and error budgets.• Design, implement, and...
-
Site Reliability Engineer
2 weeks ago
Kolkata, West Bengal, India Tech Mahindra Ltd Full time ₹ 9,00,000 - ₹ 12,00,000 per yearSite Reliability Engineer We are looking for seasoned SRE professional to join our high impact team, with preference for immediate joiners. Role - Site Reliability Engineer (L2 support) Experience- 3 to 8 years Location- Kolkata only Shift - Rotational (24*7) Required skills & Experience -Azure service administration and operations ...
-
Specialist - Site Reliability Engineer
1 week ago
Pune/Pimpri-Chinchwad Area, India Accelya Full time ₹ 15,00,000 - ₹ 25,00,000 per yearFor more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...
-
Senior Site Reliability Engineer
2 days ago
Kolkata, West Bengal, India Qiskitq Technology Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title: Senior Site Reliability Engineer (SRE) Datadog ObservabilityExperience Required: 7+ years overall in SRE and Infrastructure Operations with minimum 3+ years hands-on experience in DatadogLocation: remoteJob Summary:We are seeking an experienced Site Reliability Engineer (SRE) to lead end-to-end SRE implementation initiatives with a strong focus on...
-
Site Reliability Engineering Manager
2 weeks ago
Kolkata, India CloudHire Full timeJob SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...
-
Site Reliability Engineer
4 weeks ago
Greater Noida, India TRH Consultancy Services Full timeDescription : We are seeking a Site Reliability Engineer with expertise in OpenTelemetry to join our team in India. The ideal candidate will be responsible for ensuring the reliability, availability, and performance of our systems while implementing best practices for observability and monitoring.Responsibilities : - Design, implement, and maintain reliable...