
Cloud Engineer III-Observability
3 weeks ago
Who are we
Smarsh empowers its customers to manage risk and unleash intelligence in their digital communications. Our growing community of over 6500 organizations in regulated industries counts on Smarsh every day to help them spot compliance, legal or reputational risks in 80+ communication channels before those risks become regulatory fines or headlines. Relentless innovation has fueled our journey to consistent leadership recognition from analysts like Gartner and Forrester, and our sustained, aggressive growth has landed Smarsh in the annual Inc. 5000 list of fastest-growing American companies since 2008.
About the team: The Observability team builds and manages the single telemetry and observability service used by all product teams on the Smarsh platform. It provides 'as a service' telemetry, monitoring, and visualization capabilities that enable our product teams to operate, support, and triage the applications and services under their product portfolio.
We are seeking a rigorous, problem-solving, and curious Platform Engineer (who codes) to join our Fabric Insight group. Fabric teams at Smarsh combine software and systems engineering to build and run products that equip our engineering teams with secure tools and infrastructure to do their best work. We are looking for someone who can build Observability systems that engineers love to work with. In this role, you will play a key part in shaping the future of our platform by developing tooling and providing hands-on technical expertise to design, deploy, and optimize our services in a compliant and cost-effective way in the cloud. The ideal candidate will have a programming background in a cloud environment, a strong understanding of cloud automation, Observability, and security best practices, as well as the ability to collaborate effectively with cross-functional teams.
Roles & Responsibilities
- Develop and analyze various business and technical scenarios to drive the highest levels of executive decision-making around Observability resources. Drive consensus and decisions with stakeholders.
- Develop and implement automation to provision, configure, deploy, and monitor Observability services.
- Create reusable integrations for third-party tools (e.g., CI/CD systems, monitoring platforms, container registries and many more) to consolidate workflows.
- Communicate risks and progress in a timely manner to reporting supervisor
- Ensure efficient resource utilization and continuously improve processes leveraging automation and internal tools resulting in enhanced Product delivery, maturity, and scalability.
- Support the features delivered by debugging and creating RCA for production issues and subsequently work towards short term and long-term fix
- On-Call Rotation: Participate in an on-call rotation to provide 24/7 support for critical systems.
Required Experience/Skills
- Professional degree in Computer Science from a reputed college with consistent academic record.
- 4-6 years of professional experience in DevOps or software engineering roles, with a focus on configuring, deploying, and maintaining Kubernetes in AWS
- Strong proficiency in infrastructure as code (IaC) using Terraform, AWS CloudFormation, or similar tools.
- Experience with scripting and automation using languages such as Python
- Experience with CI/CD pipelines and automation tools such as Concourse, Jenkins, or Ansible.
- Experience with teams having delivered observability and telemetry tools and practices, such as Prometheus, Grafana, ELK stack, distributed tracing, and performance monitoring.
- Experience with cloud-native tools such as Istio, Argo CD, External Secrets Operator, Keda, Karpenter, etc
- Understanding SRE principles includes monitoring, alerting, error budgets, fault analysis, and automation.
- Concepts of SLI, SLO, SLA, Define SLIs (Service Level Indicators), SLOs (Service Level Objectives), and error budgets.
- Excellent problem-solving skills and attention to detail.
About Our Culture
Smarsh hires lifelong learners with a passion for innovating with purpose, humility and humor. Collaboration is at the heart of everything we do. We work closely with the most popular communications platforms and the worlds leading cloud infrastructure platforms. We use the latest in AI/ML technology to help our customers break new ground at scale. We are a global organization that values diversity, and we believe that providing opportunities for everyone to be their authentic self is key to our success. Smarsh leadership, culture, and commitment to developing our people have all garnered Comparably.com Best Places to Work Awards. Come join us and find out what the best work of your career looks like.
-
Bengaluru, Karnataka, India Pegasystems Full timeMeet Our Team:Cloud Observability Engineering collaborates with all the engineering teams at Pega and advocate for Observability solutions, establish standards and processes. Cloud Observability Engineering team is responsible for designing, developing and maintaining Observability solutions for Pega Cloud.Picture Yourself at Pega:You will be part of a...
-
Bengaluru, Karnataka, India Pegasystems Full timeMeet Our Team: Cloud Observability Engineering collaborates with all the engineering teams at Pega and advocate for Observability solutions, establish standards and processes. Cloud Observability Engineering team is responsible for designing, developing and maintaining Observability solutions for Pega Cloud. Picture Yourself at Pega: You will be part of a...
-
Senior Cloud Security
6 days ago
Bengaluru, Karnataka, India beBeeSecurity Full time ₹ 9,00,000 - ₹ 12,00,000Cloud Site Reliability Engineer (SRE) wanted. We are seeking a skilled Cloud SRE with expertise in Cloud Security and Observability to design, build, and scale resilient cloud platforms.ResponsibilitiesCloud Platform Architect: Design and optimize Terraform modules for multi-environment deployments.DevSecOps Lead: Drive DevSecOps practices and strengthen...
-
Senior Cloud Developer
56 minutes ago
Bengaluru, Karnataka, India beBeeCloud Full time ₹ 1,80,00,000 - ₹ 2,50,00,000Job Summary:We are seeking a seasoned Cloud Development Engineer to join our Observability Services team. As a key member of our cloud-based infrastructure, you will be responsible for designing and developing observability solutions that enable us to monitor and optimize our applications hosted on Pega Cloud.Responsibilities:Design and Develop Observability...
-
SRE - Cloud Security & Observability
3 days ago
Bengaluru, Karnataka, India Xebia Full timeSRE – Cloud Security & Observability # ; Location: Bangalore (Hybrid – 3 days office per week)We are looking for a Cloud Site Reliability Engineer (SRE) with strong expertise in Cloud Security and Observability to design, build, and scale resilient cloud platforms. #Architect and optimize Terraform modules for multi-environment deployments....
-
Site Reliability Engineer III
3 days ago
Bengaluru, Karnataka, India Chase Bank Full timeJob DescriptionThere's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Commercial & Investment Bank, youwill solve complex and broad...
-
Cloud Security and Observability Expert
2 days ago
Bengaluru, Karnataka, India beBeeSre Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Cloud Security Engineer (SRE) Job DescriptionAs a Cloud Site Reliability Engineer (SRE), you will be responsible for designing, building, and scaling resilient cloud platforms with strong expertise in Cloud Security and Observability. This role is ideal for someone who thrives in a fast-paced environment and has a passion for automating deployments, CI/CD...
-
Software Engineer III
2 weeks ago
Bengaluru, Karnataka, India Chase- Candidate Experience page Full time US$ 1,50,000 - US$ 2,00,000 per yearWe have an exciting and rewarding opportunity for you to take your software engineering career to the next level. As a Software Engineer III at JPMorgan Chase within the Consumer & Community Banking technology team, you will be an integral part of an agile team that works to enhance, build, and deliver trusted market-leading technology products in a secure,...
-
Cloud-Oriented Observability Specialist
3 days ago
Bengaluru, Karnataka, India beBeeSite Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job TitleSenior Site Reliability Engineer - ELK ExpertWe are seeking a seasoned Senior Site Reliability Engineer with expertise in the ELK stack to join our Platform Engineering Practice. As a key member of our team, you will design, manage, and scale large-scale observability infrastructure, enhancing reliability across distributed systems and driving...
-
Cloud Security
2 days ago
Bengaluru, Karnataka, India beBeeSecurity Full time ₹ 15,00,000 - ₹ 22,00,000Cloud Security Engineer (SRE) Job DescriptionWe are seeking a highly skilled Cloud Site Reliability Engineer (SRE) with expertise in Cloud Security and Observability to design, build, and scale resilient cloud platforms.Responsibilities:Design and optimize Terraform modules for multi-environment deployments.Drive DevSecOps practices and strengthen cloud...