Cloud Site Reliability Engineer
2 weeks ago
Job Description At NiCE, we don't limit our challenges. We challenge our limits. Always. We're ambitious. We're game changers. And we play to win. We set the highest standards and execute beyond them. And if you're like us, we can offer you the ultimate career opportunity that will light a fire within you. So, what's the role all about NICE Public Safety has expanded significantly and there is a need to automate, support and maintain our applications 24/7. As a result, we are expanding our Site Reliability team to ensure we continue to offer exemplary service to our customers. Our Site Reliability team is responsible for reducing the number of issues and speeding up the time to detection/resolution of issues using automation, tooling, telemetry, and data. This job description is not intended to be all-inclusive, and you will also perform other reasonable related business duties as assigned by your immediate supervisor and other management as required. We may revise or change job duties as the need arises. This job description does not constitute a written or implied contract of employment How will you make an impact - Act as part of a team of SRE's that act as the gatekeepers of production, and actively manage the work backlog and develop reliability improvements. - Lead investigations into root cause outages, performance, and cost issues. - Lead initiatives to develop the automation of low-value tasks balanced against project delivery demands. - You will provide technical leadership and to wider Cloud Operations and Support teams along with providing oversight to the products and services they support. - Develop and configure monitoring dashboards and alerts in tools like Grafana and Azure Monitor. - Installation and configuration of Observability Platform including tools like Grafana, Prometheus, Azure Monitor, Open telemetry etc. - Developing bicep modules for monitoring infrastructure and deploy it. - Developing and configuring CI/CD pipelines in Azure Devops for deploying monitoring infrastructure and monitoring objects Have you got what it takes - Must have 3+ years of experience in Site Reliability Engineering - Excellent technical, analytical and troubleshooting skills - Experience and in-depth knowledge of databases and data handling (MS-SQL, Elasticsearch, YML, JSON, XML) - Significant experience in programming or advanced scripting (C#, PowerShell etc.) - Experience with infrastructure/configuration as code and version control (ARM, BICEP, Git) - Experience managing monitoring, alerting and dashboarding platforms (Azure Monitor, Prometheus, Grafana, Elasticsearch) - Demonstrable experience of supporting live cloud services and platforms - Production experience with Kubernetes and containerization - Implementation and support of service level objectives (SLOs) - Exposure to commercial cloud providers (Ideally Azure, others considered) - Exposure to Azure DevOps pipelines is desirable (CI/CD) - Exposure to test frameworks is desirable (NUnit, Jasmine, Selenium) - Efficient, effective, and respectful communication skills both with customers and within internal departments. Including, - Good listener, able to identify and validate assumptions. - Able to use effective questioning to confirm understanding of a customer problem and then provide help to solve it. - Methodical troubleshooting, technical skill and attention to detail used in diagnosing problems and reproducing issues in a local environment. - Multi-tasking and time-management to prioritise and switch between varied tasks. You will have an advantage if you also have: - Be flexible with working hours when needed to address critical or urgent matters. - Be able to provide on-call services from time to time as needed. What's in it for you - Join an ever-growing, market disrupting, global company where the teams comprised of the best of the best work in a fast-paced, collaborative, and creative environment As the market leader, every day at NiCE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NiCEr Enjoy NiCE-FLEX At NiCE, we work according to the NiCE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere. Requisition ID: 8094 Reporting into: Technical Manager/Director of Engineering Role Type: Individual Contributor About NiCE NICELtd. (NASDAQ: NICE)software products are used by 25,000+ global businesses, including 85 of the Fortune 100 corporations, to deliver extraordinary customer experiences,fight financial crimeand ensure public safety.Every day, NiCE software managesmore than120 million customer interactions and monitors3+billion financial transactions. Known as an innovation powerhouse that excels in AI, cloud and digital, NiCE is consistently recognized as the market leader in its domains, with over 8,500 employees across 30+ countries. NiCE is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, age, sex, marital status, ancestry, neurotype, physical or mental disability, veteran status, gender identity, sexual orientation or any other category protected by law.
-
Site Reliability Engineer
3 weeks ago
Pune, India Talent Worx Full timeSite Reliability Engineer (SRE) At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
1 day ago
Bengaluru, India Relanto Full timeJob Description Job Title: Site Reliability Engineer Summary We are looking for a Site Reliability Engineer to join our Digital & Transformation department. The ideal candidate will have 2-3 years of experience in this field and will be responsible for ensuring the reliability, availability, and performance of our systems and applications. Roles And...
-
Site Reliability Engineer
2 weeks ago
Pune, India TechVerito Full timeJob Description About the Role: 3-5 years of proven and progressive experience as an SRE or DevOps Engineer. As a SRE Engineer, you will have a strong background in cloud infrastructure management, migration and deployment, with expertise in Google Cloud Platform (GCP), DevOps tools, and Kubernetes ecosystem. The primary focus of this role will be to migrate...
-
Site Reliability Engineer
3 weeks ago
Pune, India Talent Worx Full timeSite Reliability Engineer (SRE) At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Talent Worx Full time ₹ 15,00,000 - ₹ 25,00,000 per yearSite Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
-
Site Reliability Engineer
2 days ago
Pune, Maharashtra, India Ather Energy Full time ₹ 6,00,000 - ₹ 18,00,000 per yearYou'll be our: Site Reliability EngineerYou'll be based at: Pune Zonal OfficeYou'll be aligned with: Cloud and Data Platform Lead / Cloud ArchitectYou'll be a member of: Cloud and Data Platform TeamAther's fleet of smart scooters is growing rapidly, and so is the volume of data they generate. Our Vehicle Data Platform (VDP) is the core of this ecosystem, and...
-
Cloud Site Reliability Engineer
1 week ago
IND-Hyderabad-CapitaLand, India London Stock Exchange Group Full time ₹ 12,00,000 - ₹ 36,00,000 per yearEngineer, Cloud Site Reliability Company Profile LSEG (London Stock Exchange Group) is a world-leading financial markets infrastructure and data business. We are dedicated, open-access partners with a commitment to excellence in delivering services across Data & Analytics, Capital Markets, and Post Trade. Backed by three hundred years of experience,...
-
Site Reliability Engineer
6 days ago
India Akamai Full time ₹ 5,00,000 - ₹ 15,00,000 per yearDo you want to grow your career in Linux and Site Reliability Engineering?Would you like to contribute to the foundation of a new public cloud platform?Join our IaaS Site Reliability Engineering (SRE) team.We design, develop, and operate infrastructure and services that power the backbone of our cloud platform. This is a rare opportunity to help build a...
-
Site Reliability Engineer
1 week ago
Pune, India Batch Systems Inc Full timeBatch is a brand-first technology platform designed to amplify customer engagement, enable frictionless transactions, defend product authenticity, elevate customer loyalty, and ignite customer growth. Our mission is to provide seamless solutions that help businesses build stronger connections with their customers. With a focus on enhancing the customer...
-
Site Reliability Engineer
1 week ago
Pune, India Batch Systems Inc Full timeBatch is a brand-first technology platform designed to amplify customer engagement, enable frictionless transactions, defend product authenticity, elevate customer loyalty, and ignite customer growth. Our mission is to provide seamless solutions that help businesses build stronger connections with their customers. With a focus on enhancing the customer...