Current jobs related to Sr. Principal Observability and Reliability Tooling Engineer - Pune, Maharashtra - UKG
-
Principal Site Reliability Enginee
2 weeks ago
Pune, Maharashtra, India Boomi Software Full timeJob DescriptionAs a Principal Site Reliability Engineer, you will be responsible for developing sophisticated systems and software based on the customer s business goals, needs and general business environment. You will work with product management, other engineering teams, customer success and support on developing cutting edge new product features and...
-
Senior Cloud DevOps Engineer
7 days ago
Pune, Maharashtra, India beBeeCloudDevops Full time ₹ 15,75,000 - ₹ 24,20,000Sr. Cloud DevOps & SRE LeaderWe are seeking an experienced Sr. Cloud DevOps & SRE leader with expertise in cloud infrastructure, automation, and observability.This role involves ensuring the reliability, performance, and scalability of systems through proactive problem-solving and operational excellence.The Sr. Cloud DevOps & SRE leader will play a vital...
-
Reliability Engineer
1 week ago
Pune, Maharashtra, India beBeeDatabase Full time US$ 90,000 - US$ 1,20,000Reliability Engineer - Database SystemsAbout the Role:We are seeking an experienced reliability engineer to join our team. As a key member of our database systems team, you will be responsible for ensuring the high availability and performance of our systems.Key Responsibilities:Understanding project KPIs, SLIs, SLOs, MTTD, MTTR, error budgets, chaos...
-
Senior Software Engineer
2 weeks ago
Pune, Maharashtra, India Principal Global Services Full timeResponsibilities Indicative years of experience 5 yearsRole Description Principal Pune is hiring a Mainframe Modernization - Sr Infrastructure Engineer This engineer will be a part of the Platform support under Information Services IS and responsible for helping achieve the strategy around availability of our Mainframe environment through adoption of...
-
Delivery Manager
2 weeks ago
Pune, Maharashtra, India Principal Global Services Full time ₹ 6,00,000 - ₹ 8,00,000 per yearJob Description:Site Reliability Engineering (SRE) Manager Observability & ITOMIndicative years of total experience: 14 – 16 yearsLocation:PuneDepartment:Engineering / IT OperationsReporting relationship:This role will report to Program ManagerJob Type:Full-Time (Hybrid)Job Summary:We are seeking a seasoned SRE Manager to lead our Observability &...
-
Lead Software Engineer
3 weeks ago
Pune, Maharashtra, India Principal Financial Full timeResponsibilitiesAbout the Team Our API Platform Engineering team is focused on enabling teams across the company to build and integrate APIs efficiently We provide the tools standards and best practices that empower developers to create secure scalable and high-quality APIs If you re passionate about improving developer workflows shaping API...
-
Sr. Site Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Peoplefy Full timeHi Everyone, We are hiring for the position Sr. Site Reliability Engineer , and this opportunity is with Yervada, Pune location. We are looking for candidate with 5+ years of experience with Application support or production support and .NET or Java Monitoring tools- Grafana or Prometheus or Splunk or Dynatrace. If interested share your updated resumes on ...
-
Sr. Site Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Peoplefy Full timeHi Everyone,We are hiring for the position Sr. Site Reliability Engineer, and this opportunity is with Yervada, Pune location.We are looking for candidate with 5+ years of experience with Application support or production support and .NET or Java Monitoring tools- Grafana or Prometheus or Splunk or Dynatrace.If interested share your updated resumes on
-
Delivery Manager
2 weeks ago
Pune, Maharashtra, India Principal Global Services Full time ₹ 1,04,000 - ₹ 1,30,878 per yearResponsibilities:Job Description: Site Reliability Engineering (SRE) Manager – Observability & ITOMIndicative years of total experience: 14 – 16 yearsLocation:Pune/HyderabadDepartment:Engineering / IT OperationsReporting relationship:This role will report to Program ManagerJob Type:Full-Time (Hybrid)Job Summary:We are seeking a seasoned SRE Manager to...
-
Reliability Engineering Expert
2 weeks ago
Pune, Maharashtra, India beBeeExpert Full time US$ 1,80,000 - US$ 2,50,000Reliability Engineering ExpertSite reliability engineering combines development and operations knowledge to drive business success. If you have a background in SRE or development with experience in improving service reliability by adding observability, our team can benefit from your expertise.As a reliability engineer on the observability team, you will help...
Sr. Principal Observability and Reliability Tooling Engineer
2 weeks ago
Company Overview
With 80,000 customers across 150 countries, UKG is the largest U.S.-based private software company in the world. And were only getting started. Ready to bring your bold ideas and collaborative mindset to an organization that still has so much more to build and achieve? Read on.
At UKG, you get more than just a job. You get to work with purpose. Our team of U Krewers are on a mission to inspire every organization to become a great place to work through our award-winning HR technology built for all.
Here, we know that youre more than your work. Thats why our benefits help you thrive personally and professionally, from wellness programs and tuition reimbursement to U Choose a customizable expense reimbursement program that can be used for more than 200+ needs that best suit you and your family, from student loan repayment, to childcare, to pet insurance. Our inclusive culture, active and engaged employee resource groups, and caring leaders value every voice and support you in doing the best work of your career. If youre passionate about our purpose people then we cant wait to support whatever gives you purpose. Were united by purpose, inspired by you.
Site Reliability Engineers at UKG are team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering and auto remediation.
Site Reliability Engineers must have a passion for learning and evolving with current technology trends. They strive to innovate and are relentless in their pursuit of a flawless customer experience. They have an automate everything mindset, helping us bring value to our customers by deploying services with incredible speed, consistency and availability.
Primary/Essential Duties and Key Responsibilities:
- Proficient in Splunk/ELK, and Datadog.
- Experience with observability tools such as Prometheus/InfluxDB, and Grafana.
- Possesses strong knowledge of at least one scripting language such as Python, Bash, Powershell or any other relevant languages.
- Design, develop, and maintain observability tools and infrastructure.
- Collaborate with other teams to ensure observability best practices are followed.
- Develop and maintain dashboards and alerts for monitoring system health.
- Troubleshoot and resolve issues related to observability tools and infrastructure.
- Engage in and improve the lifecycle of services from conception to EOL, includingsystem design consulting, and capacity planning
- Define and implement standards and best practices related toSystem Architecture, Service delivery, metrics and the automation of operational tasks
- Support services, product & engineering teams by providing common tooling and frameworks to deliver increased availability and improved incident response.
- Improve system performance, application delivery and efficiency through automation, process refinement, postmortem reviews, and in-depth configuration analysis
- Collaborate closely with engineering professionals within the organization to deliver reliable services
- Identify and eliminate operational toil by treating operational challenges as a software engineering problem
- Actively participate in incident response, including on-call responsibilities
- Partner with stakeholders to influence and help drive the best possible technical and business outcomes
- Guide junior team members and serve as a champion for Site Reliability Engineering
- Engineering degree, or a related technical discipline, and 10+years of experience in SRE.
- Experience coding in higher-level languages (e.g., Python, Javascript, C++, or Java)
- Knowledge of Cloud based applications & Containerization Technologies
- Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing
- Ability to analyze current technology utilized and engineering practices within the company and develop steps and processes to improve and expand upon them
- Working experience with industry standards like Terraform, Ansible.
- (Experience, Education, Certification, License and Training)
- Must have hands-on experience working within Engineering or Cloud.
- Experience with public cloud platforms (e.g. GCP, AWS, Azure)
- Experience in configuration and maintenance of applications & systems
- infrastructure. Experience with distributed system design and architecture
- Experience building and managing CI/CD Pipelines
Where were going
UKG is on the cusp of something truly special. Worldwide, we already hold the #1 market share position for workforce management and the #2 position for human capital management. Tens of millions of frontline workers start and end their days with our software, with billions of shifts managed annually through UKG solutions today. Yet its our AI-powered product portfolio designed to support customers of all sizes, industries, and geographies that will propel us into an even brighter tomorrow
UKG is proud to be an equal opportunity employer and is committed to promoting diversity and inclusion in the workplace, including the recruitment process.
Disability Accommodation
For individuals with disabilities that need additional assistance at any point in the application and interview process, please email