Senior Site Reliability Engineer

3 days ago


mumbai, India NextLevel Full time

This job role is by one of our hiring partners on the NextLevel platform. Please apply for prioritised shortlisting.


As a Senior Site Reliability Engineer, you will lead the design, implementation, and maintenance of AWS infrastructure and tools critical for the continuous deployment and monitoring of our applications. You will collaborate with cross-functional teams, leveraging your extensive AWS experience to enhance system reliability, performance, and efficiency. You will also play a key role in our migration from our current cloud hosting partner to AWS.


Key Responsibilities:

AWS Infrastructure Design:

  • Lead the design and implementation of scalable, reliable, and secure AWS infrastructure.
  • Architect solutions to maximize the benefits of AWS services.
  • Upgrade Apache web servers for improved performance and security.
  • Oversee database upgrades, ensuring minimal downtime and data integrity.
  • Manage application server upgrades to enhance overall system efficiency.

Automation and AWS Tooling:

  • Develop and maintain automation tools for deployment, monitoring, and operations on AWS.
  • Implement and enhance infrastructure as code (IaC) using AWS CloudFormation or similar tools.

Service Availability Monitoring and Incident Response:

  • Set up and maintain monitoring solutions on AWS to proactively identify and address system issues.
  • Respond to and resolve incidents, ensuring minimal downtime and impact on users.
  • Engage in major incident responses, leveraging monitoring tools to debug and resolve issues.
  • Prepare root cause analyses (RCA) for incidents and ensure preventive measures are implemented.
  • Monitor minor incidents to identify trends and prevent escalation to major incidents.

AWS Best Practices:

  • Enforce AWS best practices for security, performance, and cost optimization.
  • Stay current with AWS advancements and integrate relevant technologies into our infrastructure.

Collaboration and Communication:

  • Work closely with development, operations, and QA teams to foster a DevOps culture.
  • Communicate AWS-related insights, recommendations, and project status effectively.
  • Facilitate the upgrade of Kafka and other essential tools within the solution engineering framework.
  • Engage in change planning with the cloud team for seamless upgrades and troubleshooting.

Cloud Security:

  • Implement and maintain Akamai Edge Security and WAF measures for optimal protection.
  • Oversee monitoring activities to proactively identify and address security vulnerabilities.
  • Conduct cloud security checks and upgrade planning in collaboration with the solution team.
  • Manage DDOS, WAF, Edge firewall, and network security tasks, including continuous monitoring.
  • Coordinate corrective actions with the cloud team/AWS to ensure a secure cloud environment.

High Traffic Events:

  • Evaluate infrastructure needs for high-traffic events, ensuring appropriate sizing and scaling.
  • Monitor traffic patterns and collaborate with cloud architects to optimize performance.

FinOps Cost Management:

  • Monitor storage utilization and implement strategies to optimize costs.
  • Oversee infrastructure utilization, controlling costs through effective monitoring.
  • Optimize resource consumption by monitoring CPU, memory, RAM, and other parameters.
  • Conduct regular checks on data storage to ensure efficient utilization.


Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • 6-10 years of hands-on experience as a Site Reliability Engineer, with a focus on AWS.
  • Extensive experience with AWS, cloud infrastructure, AWS cloud security, high-traffic events, and FinOps cost management.
  • Proficiency in scripting languages (e.g., Python, Bash) and experience with AWS SDKs.
  • In-depth knowledge of AWS services and a proven track record of implementing solutions on AWS.
  • Experience with container orchestration tools (e.g., Kubernetes, Docker Swarm) on AWS.
  • Understanding of web, middleware, and database technologies such as Apache, Wildfly, MySQL, and Kafka.
  • Familiarity with cloud security measures and high-traffic event management.
  • Knowledge of FinOps principles and cost management in cloud environments.
  • Strong problem-solving and troubleshooting skills.
  • Excellent communication and collaboration skills.



  • Mumbai, Maharashtra, India RELX India (Pvt) Ltd Risk div Company Full time

    Senior Site Reliability Engineer I Would like to be part of Collaborative and friendly team? Would you like to be part of a rewarding project?


  • Mumbai, India NextLevel Full time

    This job role is by one of our hiring partners on the NextLevel platform. Please apply for prioritised shortlisting.As a Senior Site Reliability Engineer, you will lead the design, implementation, and maintenance of AWS infrastructure and tools critical for the continuous deployment and monitoring of our applications. You will collaborate with...


  • mumbai, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes , send me your updated cv : Please...


  • Mumbai, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners8 to 9 years for Hyderabad Locationfor a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience inSRE, GCP and Kubernetes , send me your updated cv : find below the...


  • Mumbai, Maharashtra, India SID Global Solutions Full time

    Dear Candidates, We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes , send me your updated cv : Please find...


  • Mumbai, India Morningstar Full time

    Title:Senior Site Reliability EngineerShift: GeneralThe Group:At Morningstar, we strive to Empower Investor Success. We tirelessly pursue new ways to combine our data and research with design and technology to help solve investors’ needs. Our solutions pave the way for investors to achieve their goals with confidence. The Morningstar Wealth Platform allows...


  • Mumbai, India SID Global Solutions Full time

    Dear Candidates, We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes , send me your updated cv : Please...


  • Mumbai, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes, send me your updated cv :...


  • Mumbai, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes, send me your updated cv :...


  • Mumbai, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes, send me your updated cv :...


  • Mumbai, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes, send me your updated cv :...


  • Mumbai, India Jio Full time

    Site Reliability Engineer (SRE) Job Overview As a Site Reliability (SRE) / DevOps Engineer, you will be responsible for the availability, automation, performance, efficiency, and scaling, monitoring and emergency response for any incidents / issues in Applications. You will use your deep understanding of platforms, architecture, people, systems, and...


  • Mumbai, India Morningstar Full time

    Title: Senior Site Reliability Engineer The Group: At Morningstar, we strive to Empower Investor Success. We tirelessly pursue new ways to combine our data and research with design and technology to help solve investors’ needs. Our solutions pave the way for investors to achieve their goals with confidence. The Morningstar Wealth Platform allows...


  • Mumbai, India Morningstar Full time

    Title: Senior Site Reliability Engineer The Group: At Morningstar, we strive to Empower Investor Success. We tirelessly pursue new ways to combine our data and research with design and technology to help solve investors’ needs. Our solutions pave the way for investors to achieve their goals with confidence. The Morningstar Wealth Platform allows...


  • Mumbai, Maharashtra, India Morningstar Full time

    Title: Senior Site Reliability Engineer The Group: At Morningstar, we strive to Empower Investor Success. We tirelessly pursue new ways to combine our data and research with design and technology to help solve investors' needs. Our solutions pave the way for investors to achieve their goals with confidence. The Morningstar Wealth Platform allows...


  • mumbai, India Morningstar Full time

    Title: Senior Site Reliability Engineer The Group: At Morningstar, we strive to Empower Investor Success. We tirelessly pursue new ways to combine our data and research with design and technology to help solve investors’ needs. Our solutions pave the way for investors to achieve their goals with confidence. The Morningstar Wealth Platform allows...


  • Mumbai, India Morningstar Full time

    Title: Senior Site Reliability Engineer Shift : General The Group: At Morningstar, we strive to Empower Investor Success. We tirelessly pursue new ways to combine our data and research with design and technology to help solve investors’ needs. Our solutions pave the way for investors to achieve their goals with confidence. The Morningstar Wealth...


  • mumbai, India Morningstar Full time

    Title: Senior Site Reliability Engineer Shift : General The Group: At Morningstar, we strive to Empower Investor Success. We tirelessly pursue new ways to combine our data and research with design and technology to help solve investors’ needs. Our solutions pave the way for investors to achieve their goals with confidence. The Morningstar...


  • Mumbai, Maharashtra, India Jio Full time

    Site Reliability Engineer (SRE) Job Overview As a Site Reliability (SRE) / DevOps Engineer, you will be responsible for the availability, automation, performance, efficiency, and scaling, monitoring and emergency response for any incidents / issues in Applications. You will use your deep understanding of platforms, architecture, people, systems, and...


  • Mumbai, India dentsu Full time

    The purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...