Site Reliability Engineer-1

19 hours ago


Bengaluru Karnataka, India MoEngage Inc Full time

**About MoEngage**

Fortune 500 brands and Enterprises across 35 countries such as **Deutsche Telekom, Samsung, Ally Financial, Vodafone, and McAfee along with internet-first brands such as Flipkart, Ola, OYO, and Bigbasket u**se MoEngage to orchestrate their cross-channel campaigns and engage efficiently with their customers sending 80 billion messages to 900 million consumers every month.

Our vision is to build the world’s most trusted customer engagement platform for the mobile-first world.

We promise to care about your customers as much as you do. And that justifies our top ratings for service and support in **Gartner Magic Quadrant, Gartner Peer Insights, and G2 Summer Reports.** **We have also been recognized as one of the 25 Highest Rated Private Cloud Computing Companies To Work For in a list released by Battery Ventures,** a global investment firm based on the employee feedback on Glassdoor where employees reported the highest levels of satisfaction at work during the first six months of the pandemic.

Our last round of Series C1 funding of $32.5 million in July 2021 accelerates our vision to create impeccable customer experiences for our customers globally. We have recently crossed 400+ headcount milestones and still growing.

**As part of the Engineering team at MoEngage, here are some things you can expect**:

- Take ownership and be responsible for what you build - no micromanagement
- Work with A players (some of the best talent in the country), and expedite your learning curve and career growth
- Make in India and build for the world at the scale of 500M active users, which no other internet company in the country has seen
- Learn together from different teams on how they scale to millions of users and billions of messages.

**Here are some of the challenging areas you can expect to work as part of the SRE team**:

- Maintain services once they are live by measuring and monitoring availability, latency and overall system reliability.
- Work closely with team members to ensure best practices and strategic goals are incorporated into development work.
- Collaborate with other engineering teams to identify and anticipate changing requirements and opportunities to improve the development environment.
- Monitoring at scale with VictoriaMetrics and the likes
- Orchestrating and managing with K8S and the likes
- Implementing best practices, challenging the status quo, and tab on industry and technical trends, changes, and developments to ensure the team is always striving for best-in-class work.
- Manage capacity, build security into every layer, and reduce cost
- Implement secure networking, key management, user management, access management, process management, and image management.
- Effectively lead and manage team deliverable (short/long term) project planning and coaching, quarterly reviews, participation in the selection process for new hires, and technical and non-technical guidance to the team.

**Skill Requirements**:

- Proven experience in handling large infrastructure and distributed systems like Yarn, Kubernetes, Elasticsearch, etc..
- Tech Stack - Python, AWS, Azure, Linux.
- Familiarity with Python-related technologies and frameworks like Falcon, Django, or Pyramid.
- Experience with Unix/Linux operating systems internals and administration (e.g. filesystems, inodes, system calls, etc.) or networking (e.g. TCP/IP, routing, network topologies, and hardware, SDN, etc.)
- Familiarity with the cloud computing infrastructure, preferably Azure
- Familiarity with task queue frameworks like Celery or Pika is a plus.
- Source code management and Implementation of security best practices.
- Familiarity with any one container orchestration tools build, artifact, packaging, service discovery management tools.
- Know-how of gathering metrics across distributed systems (instances/container) & generating automated notifications, and reports.
- Prowess in analyzing App bottlenecks, and performance degradation, and implementing automated processes/tools to detect such anomalies.
- Good understanding & implementation experience using 12-factor App principles.

**Mandatory Skills**:

- 3+ years of Experience on the AWS/Azure platform.
- Proficiency in Python or shell scripting languages.
- Hands-on experience in container technologies (K8s, ArgoCd, Helm/Kustomize)
- Having a mindset as Automate anything.
- Experience with AWS/Azure cost explorer, billing analysis, and various cost optimization techniques.
- Awareness of Cloud Security concepts
- Awareness of Information Security concepts and Best Practices

**Good to have**:

- AWS/Azure cloud certification preferred
- Certification in Kubernetes Administrator (CKA).
- Certification in Kubernetes Application Developer (CKAD)
- Experience with configuration management tools and strong code analysis skills in Python
- Experience in working with APM-based tools like New Relic

At MoEngage, we are passionate about our team and technology. We handle mor



  • Bengaluru, India ViewSonic Full time

    Job Requirements: 1. Bachelor's degree in Computer Science, Engineering, or a related field. 2. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. 3. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS. 4. Interest and understanding of...


  • Bengaluru, Karnataka, , India Qure ai Technologies Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    About Qure.AI:Qure.AI is an equal opportunity employer. is a leading Healthcare Artificial Intelligence (AI) company disrupting the 'status quo' by enhancing diagnostic imaging and improving health outcomes with the assistance of machine -supported tools. taps deep learning technology to provide an automated interpretation of radiology exams like X -rays,...


  • Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Role DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....


  • Bengaluru South, Karnataka, India Gravity Engineering Services Full time

    Company DescriptionGravity Engineering Services helps ambitious brands, retailers, and enterprises turn technology into a growth engine. With over 11 years of global experience and a team of more than 300 experts, Gravity Engineering Services delivers end-to-end digital transformation across commerce, supply chain, AI, and cloud sectors. Their mission,...


  • Bangalore, Karnataka, India Empower Annuity Insurance Full time

    Our vision for the future is based on the idea that transforming financial lives starts by giving our people the freedom to transform their own We have a flexible work environment and fluid career paths We not only encourage but celebrate internal mobility We also recognize the importance of purpose well-being and work-life balance Within Empower and our...


  • Bengaluru, Karnataka, India Thakral One Full time US$ 60,000 - US$ 1,20,000 per year

    Company DescriptionThakral One, headquartered in Singapore, is a technology consulting and services company with a strong presence across Asia. The company specializes in technology-driven consulting, custom solution development, data analytics, and leveraging cloud capabilities to deliver enhanced decision support and practical outcomes. Collaborating...


  • Bengaluru, Karnataka, India FIS Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the Role :Site Reliability Engineer (SRE)with deep expertise inMainframe technologies like COBOL, JCL, etc. to support and enhance ourCard Management & Payment processing functions. This role will be responsible for ensuring reliability, high availability, scalability, stability and performance of mission-critical mainframe software applications and...


  • Bellandur, Bengaluru, Karnataka, India Princeton IT America Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    LocationsBengaluruMinimum Experience8Maximum Experience10Mandatory SkillsSite Reliability Engineer, AWS, Devops, automation, Prometheus, monitoring, framework, design reviewSkill to EvaluateSite Reliability Engineer, AWS, Devops, automation, Prometheus, monitoring, framework, design reviewExperience8 to 10 YearsLocationBengaluruJob DescriptionDesign and...


  • Bengaluru, India Whatjobs IN C2 Full time

    Site Reliability Engineer (SRE) Level 3 Overview: A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...


  • Bengaluru, India Relanto Full time

    Job Description Job Title: Site Reliability Engineer Summary We are looking for a Site Reliability Engineer to join our Digital & Transformation department. The ideal candidate will have 2-3 years of experience in this field and will be responsible for ensuring the reliability, availability, and performance of our systems and applications. Roles And...