Senior Site Reliability Engineer

2 months ago


delhi, India Tanla Platforms Limited Full time
About the Role: As a Site Reliability Engineer , you will be responsible for ensuring platform and application availability, scalability, and reliability, while maintaining optimal system uptime.
What you''ll be Responsible for?
Build, monitor and maintain highly scalable, large-scale deployments.
Installation/deployment of new releases, environments for applications.
Proactively monitor systems and applications, develop and maintain monitoring tools and dashboards, and ensure high availability of production environments by identifying performance issues and implementing corrective actions.
Incident Management: Lead incident response efforts, diagnose root causes, and implement long-term solutions to prevent recurrence. Ensure effective communication during outages.
Collaboration & Coordination: Work closely with cross-functional teams to ensure efficient platform integration, API management, and campaign execution, while providing technical guidance and support as needed.
Troubleshooting and Root Cause Analysis: Utilize your expertise to investigate and resolve incidents quickly during crisis situations, performing root cause analysis to prevent recurrence.
Ensure high availability of production environments by monitoring performance metrics and implementing corrective actions when necessary.
Platform Integration: Manage and oversee the integration of various APIs, ensuring seamless interoperability between systems and third-party services.
Support the compliance and security integrity of the environments.
Adherence to process compliance & ensuring platform reliability.
Experience in monitoring and automations in Prometheus Grafana or ELK or Datadog or Dynatrace or any observability tools
Experience with container management and micro-services architectures such as Docker in cloud or on-premises infrastructure.
What You'd have?
Kubernetes: Expertise in creation, maintenance, scaling, and upgrades of Production clusters.
Docker: Must have experience in writing Docker files complying with Industry standard best practices.
CI/CD: Must have hands-on experience with Azure-DevOps/Jenkins in creation & Execution of Pipelines in a multi-target environment.
Troubleshooting skills: Expertise in analysis of applications logs to drilldown in identification of the issue with expertise on logging stacks such as ELK, Dynatrace, Splunk
Monitoring Stacks: Expertise in using Grafana with skills on building & managing of dashboards on various data sources in Grafana.
Programming Skills: Experience in creating & managing of Bash scripts & Ansible with some exposure on Terraform.
Environment: Excellent skills and hands-on in Linux environments and able to troubleshoot issues at OS levels.
Experience on usage of project management tools such as JIRA
Experience in deploying & Managing of Distributed Queuing systems such as Redis, Kafka Rabbit-MQ, IBM-MQ, MSMQ
Experience in deploying & managing of Databases in standalone & cluster modes with basic DB Skills on Postgres, MySQL, Click House
Prior experience in working on high traffic & highly scalable platforms is an added advantage.
Good command on Linux, Networking concepts (TLS/SSL, DNS, Load Balancers, etc.,) and troubleshooting skills in large scale environments
Deep understanding of basic security concepts and protocols - authentication, authorization, signing, encryption, SSL/TLS, SSH/SFTP, X509 certificates
Good knowledge of ITIL terminology for incident and problem management
Track record of excellent interpersonal, analytical, and communication skills.
Bachelor of Science in Computer Science or other related discipline.
Why join us?
Impactful Work: Play a pivotal role in safeguarding Tanla's assets, data, and reputation in the industry.
Tremendous Growth Opportunities: Be part of a rapidly growing company in the telecom and CPaaS space, with opportunities for professional development.
Innovative Environment: Work alongside a world-class team in a challenging and fun environment, where innovation is celebrated. Tanla is an equal opportunity employer.
Tanla is an equal opportunity employer. We champion diversity and are committed to creating an inclusive environment for all employees.


  • Delhi, India GeekBull Consulting Full time

    Job Code: GBC-2411129Job Role: Senior Site Reliability EngineerJob Type: Contract - to - Hire ( C2H )Duration: 6 MonthsExperience: 7 - 10 YearsLocation: HyderabadWork Location: Hyderabad/ RemoteShift Timings : 6 PM to 3 AM ISTAbout Company:We collaborate with a wide range of clients, from startups to industry giants in sectors like Healthcare, Education, IT,...


  • Delhi, India Ushur Full time

    Location: BangaloreExperience: 6-8 YearsWork Mode: Hybrid/RemoteThe RoleSenior Site Reliability Engineers at Ushur perform a unique blend of customer support engineering, solution engineering, and operational engineering. You will work on our largest customers’ most complex problems and craft intuitive, elegant solutions. You’ll also proactively work...


  • Delhi, India SwiftWIN | A Concord Company Full time

    Job Title: Site Reliability Engineer (SRE) - Azure DevOpsJob Overview:We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with strong experience in Azure DevOps to join our dynamic team. The SRE will be responsible for maintaining the reliability, availability, and performance of our production environments, with a specific focus on...


  • Delhi, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Delhi, India Delphic (South Asia) Full time

    Job Title: Site Reliability Engineer (SRE)Location: RemoteJob Type: Full-timeExperience : 7 yearsIntroduction:We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will...


  • Delhi, India Delphic Full time

    Job Title: Site Reliability Engineer (SRE)Location: RemoteJob Type: Full-timeExperience : 7 yearsIntroduction:We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will...


  • Delhi, India IDEMIA Full time

    We are hiring for Site Reliability Engineer role at Noida location.Responsibility:- Involved in deploy/manage/operate of medium to large scale production systems- Understanding of Linux as a runtime environment- Familiar to Cloud native concepts and virtualisation- Familiar to CI/CD concepts and tools like Jenkins, Gitlab etc- Previous experience of working...


  • Delhi, India IDEMIA Full time

    We are hiring forSite Reliability Engineerrole atNoidalocation.Responsibility:Involved in deploy/manage/operate of medium to large scale production systemsUnderstanding of Linux as a runtime environmentFamiliar to Cloud native concepts and virtualisationFamiliar to CI/CD concepts and tools like Jenkins, Gitlab etcPrevious experience of working with Docker,...


  • Delhi, India IDEMIA Full time

    We are hiring for Site Reliability Engineer role at Noida location.Responsibility:Involved in deploy/manage/operate of medium to large scale production systemsUnderstanding of Linux as a runtime environmentFamiliar to Cloud native concepts and virtualisationFamiliar to CI/CD concepts and tools like Jenkins, Gitlab etcPrevious experience of working with...


  • Delhi, India K&K social resources and development GmbH Full time

    K&K Social Resources & Development GmbH is an international recruiting agency that has been providing technical resources in the European region since 1993. This position is with one of our clients in India who is actively hiring candidates to expand their teams.Title: Site Reliability EngineerLocation: India - RemoteEmployment Type: PermanentNotice...


  • Delhi, India K&K Social Resources And Development GmbH Full time

    K&K Social Resources & Development Gmb H is an international recruiting agency that has been providing technical resources in the European region since 1993. This position is with one of our clients in India who is actively hiring candidates to expand their teams.Title: Site Reliability EngineerLocation: India - RemoteEmployment Type: PermanentNotice...


  • Delhi, India Hirextra -World's First Staffing Aggregator Full time

    Job Description :- Highly skilled Cloud Site Reliability Engineer to ensure high availability, reliability and performance of cloud infrastructure and services.- Experience in cloud platforms (AWS, GCP), automation, monitoring, and incident management.- Experience in Prometheus, Grafana, Splunk, CloudWatch).- Automate routine operational tasks and cloud...


  • Delhi, India Tata Consultancy Services Full time

    TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability EngineerExperience Range: 8 – 12 YearsLocation: Pune & Chennai, Bangalore , DelhiMust-Have:Exceptional skills in...


  • Delhi, India Tata Consultancy Services Full time

    TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability EngineerExperience Range: 8 – 12 YearsLocation: Pune & Chennai, Bangalore , DelhiMust-Have:Exceptional skills in...


  • Delhi, India Coforge Full time

    Job Title: Site Reliability EngineerSkills : SRE, CI/CD, AWS, Python, Terraform & KubernetesLocation: Hyderabad (Work from Office)Experience: 7-15 YearsNote: Immediate joiners are preferableJob Description:We at Coforge are hiring a Site Reliability Engineer with the following skillset:Design, implement, and manage scalable and secure cloud-based...


  • New Delhi, India AIVID.AI Full time

    Role Overview:We are seeking proactive and skilled Site Reliability Engineers (SREs) to manage clientdeployments, provide on-site support, and ensure the seamless functioning of our AI-basedcamera analytics systems. This hybrid role requires a mix of on-site visits and remote work.The selected candidates will operate from their respective regions—New...


  • Delhi, India Zscaler Full time

    About the role:Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185...


  • Delhi, India Zscaler Full time

    About the role:Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185...


  • Delhi, India Systal Technology Solutions Full time

    Site Reliability EngineerCompetitive Salary & BenefitsBangaloreSystal is a global managed network and security service and transformation specialist. We consult, deploy, and integrate multi-vendor technologies which help enterprise businesses maximize the security and value of their complex IT infrastructure. Across our 24/7 Network and Security Operations...


  • Delhi, India Systal Technology Solutions Full time

    Site Reliability EngineerCompetitive Salary & BenefitsBangaloreSystal is a global managed network and security service and transformation specialist. We consult, deploy, and integrate multi-vendor technologies which help enterprise businesses maximize the security and value of their complex IT infrastructure. Across our 24/7 Network and Security Operations...