Senior Specialist

4 days ago


Mumbai, Maharashtra, India Datavail Infotech Full time
Description

Job Title: Senior Specialist Site Reliability Engineer (SRE) – Automation & Observability

Education: Any Degree

Experience: 8–10 years

Job Location: Mumbai

Role Level: Senior Individual Contributor (Customer-facing)

Role Overview:

We are seeking a Senior Specialist Site Reliability Engineer (SRE) to own and continuously improve the reliability, availability, scalability, and performance of business-critical services across multi-cloud environments (AWS, Azure, GCP).

This role combines strong SRE fundamentals, automation engineering, and observability expertise with customer leadership. You will work closely with customer engineering teams to embed reliability into application design, drive automation, lead incident response, and demonstrate measurable SRE outcomes through dashboards and metrics.

Key Responsibilities:

  • Reliability Engineering & SRE Practices

  • Define, implement, and maintain Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets for critical services.

  • Continuously monitor SLO compliance and drive improvements based on error budget consumption.

  • Participate in architecture reviews focused on high availability, disaster recovery, scalability, and fault tolerance.

Incident, Problem & Change Management:

  • Lead incident response, acting as the Tier-3 escalation point for SRE and operations teams.

  • Drive blameless postmortems, Root Cause Analysis (RCA), and ensure corrective and preventive actions are implemented.

  • Define and maintain incident response runbooks, escalation paths, and on-call processes.

  • Track and improve key reliability metrics including MTTR, incident frequency, and change failure rate.

Automation & Infrastructure as Code:

  • Automate infrastructure provisioning and operational workflows using Terraform, CloudFormation, and AWS CDK.

  • Build and maintain CI/CD pipelines supporting canary deployments, blue/green strategies, and automated rollbacks.

  • Implement event-driven automation and auto-remediation using AWS Lambda, Step Functions, or Azure Functions.

  • Continuously identify and eliminate operational toil through automation and self-healing systems.

Monitoring, Observability & Logging:

  • Design, implement, and operate end-to-end observability platforms covering metrics, logs, and traces.

  • Hands-on experience with:

  • New Relic / Datadog for APM, distributed tracing, and SLO tracking

  • Prometheus for metrics collection

  • Grafana for dashboards and SRE scorecards

  • Graylog / ELK for centralized logging and RCA

  • Ensure alerts are SLO-driven, actionable, and noise-free.

  • Build customer-facing dashboards to clearly demonstrate SRE service outcomes.

Cloud Infrastructure & Platform Reliability

  • Provision and manage cloud infrastructure across AWS, Azure, and/or GCP.

  • Operate compute, storage, networking, load balancers, VPNs, and private connectivity.

  • Manage patching, backups, encryption, IAM/RBAC, and disaster recovery readiness.

  • Optimize performance and cost through rightsizing, autoscaling, and capacity planning.

  • Ensure reliability of data platforms such as MongoDB / MongoDB Atlas, Elasticsearch / OpenSearch, MySQL (RDS), and DocumentDB.

Customer Engagement & Mentorship:

  • Act as the primary technical contact for assigned customer accounts.

  • Lead reliability and observability discussions with customers and internal stakeholders.

  • Mentor mid-level and junior SREs, conducting reliability-focused design and operational reviews.

  • Maintain high-quality documentation, runbooks, SOPs, and operational playbooks.

Required Qualifications:

  • 8–10 years of experience in SRE, Cloud Engineering, or Production Operations roles.

  • Strong OS fundamentals: Linux and Windows, with scripting (Bash, PowerShell).

  • Strong programming skills in Python, Go, or equivalent.

  • Proven hands-on experience with:

  • Infrastructure as Code (Terraform, CloudFormation, CDK)

  • CI/CD pipelines and deployment automation

  • Observability tools (New Relic, Datadog, Prometheus, Grafana, Graylog, ELK)

  • Distributed systems at production scale

  • Cloud certifications (one or more):

  • AWS (Associate or Professional)

  • Azure (AZ-104 / Architect Expert)

  • GCP (Professional Cloud Architect)

  • Cloud-agnostic certification such as Terraform Associate, CKA, or SRE Foundation.

Nice-to-Have Skills:

  • Experience with multi-cloud or hybrid architectures.

  • Exposure to cross-region or cross-cloud data replication.

  • Hands-on experience with chaos engineering or fault injection.

  • Knowledge of ITIL, Agile, or SRE maturity models.

Experience with serverless architectures (AWS Lambda, Azure Functions).



  • Mumbai, Maharashtra, India Vision Display Pvt Ltd Full time

    We're Hiring | Senior Sales SpecialistLocation: Borivali, MumbaiCompany: Vision Display Pvt. Ltd.We are looking for a Senior Sales Specialist with minimum 2 years of experience in architectural / B2B sales.Job Details :Handle B2B sales for LED Display Screens & Video WallsBuild and manage relationships with architects, designers & developersConduct product...

  • Senior MS SQL

    2 weeks ago


    Navi Mumbai, Maharashtra, India Talent Acquisition Specialist Full time

    Job Description Job Title: Senior MS SQL & Oracle DeveloperDepartment: IT/Technical SupportReports To: IT Operations ManagerJob Summary:We are seeking an experienced and highly skilled Senior MS SQL and Oracle Developer tojoin our technology team. The ideal candidate will have deep expertise in database design,performance tuning, stored procedures, and...

  • Senior Specialist

    2 weeks ago


    Mumbai, Maharashtra, India Guy Carpenter Full time

    Assists in basic quantitative analysis projects under the guidance of senior project team members in order to calculate risk exposure and potential loss that may occur due to natural and man-made catastrophes.Gathers, organizes and reviews raw exposure data from the client for accuracy and validity, and to identify abnormalities (such as negative premiums,...

  • Senior Specialist

    1 week ago


    Mumbai, Maharashtra, India Marsh McLennan Full time

    Company:MarshDescription:Specialist/Senior SpecialistShould have excellent excel working knowledge.Process billing for US Captives region and Islands along with checking agreements and verifying the details.Maintaining track of billings and checking each and every request is completed.Setting up new clients in the system such as RMB, BCA and GCMS.Helping in...

  • Senior Specialist

    1 week ago


    Mumbai, Maharashtra, India Marsh Full time

    Specialist/Senior SpecialistShould have excellent excel working knowledge.Process billing for US Captives region and Islands along with checking agreements and verifying the details.Maintaining track of billings and checking each and every request is completed.Setting up new clients in the system such as RMB, BCA and GCMS.Helping in International billing for...

  • Senior Specialist

    1 week ago


    Mumbai, Maharashtra, India DP World Full time

    DP World is looking for Senior Specialist - Buyer Support - Global Service Centre to join our team Roles and Responsibility Provide exceptional customer service and support to buyers through various channels.Collaborate with internal teams to resolve complex issues and improve processes.Develop and maintain strong relationships with buyers, suppliers, and...

  • Senior Specialist

    4 days ago


    Mumbai, Maharashtra, India ADTM Adenza Technology de Mexico S de RL de CV Full time

    Nasdaq Technology is looking for a passionate Senior Software Developer Specialist with focus on developing and supporting real-time mission critical applications that power the Nasdaq internal markets, to join the Mumbai technology center in India. If Innovation and effectiveness drive, you forward this is the place for you Nasdaq is continuously...

  • Senior Specialist

    2 weeks ago


    Mumbai, Maharashtra, India Datavail Career Site Full time

    Job Title: Senior Specialist – Cloud SRE Education: Bachelor's Degree Experience: 8+ years Location: Mumbai As a Senior SRE Engineer (Cloud SRE Specialist), you will be responsible for ensuring the reliability, scalability, performance, and cost optimization of cloud services across AWS, Azure, and multi-cloud environments. You will act as the primary...

  • Senior Specialist

    2 weeks ago


    Mumbai, Maharashtra, India Datavail Infotech Full time

    DescriptionJob Title: Senior Specialist – Cloud SRE Education: Bachelor's Degree Experience: 8+ years Location: Mumbai As a Senior SRE Engineer (Cloud SRE Specialist), you will be responsible for ensuring the reliability, scalability, performance, and cost optimization of cloud services across AWS, Azure, and multi-cloud environments. You will act as...


  • Mumbai, Maharashtra, India Accelya Group Full time

    For more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...