System Reliability Engineer

2 weeks ago


india Fulcrum Digital Full time
Job Description

Who are we
Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher education, food, healthcare, and manufacturing.

 

The Role

  • Plan, manage, and oversee all aspects of a Production Environment 
  • Define strategies for Application Performance Monitoring, Optimization in Prod environment
  • Respond to Incidents and improvise platform based on feedback and measure the reduction of incidents over time.
  • Support deployment of code into multiple lower environments.  Supporting current processes with an emphasis on automating everything as soon as possible.
  • Design, develop and standardize Monitoring and Alerting mechanism for the supported applications.
  • Take a holistic approach to problem solving, by connecting the dots during a production event through the various technology stack that makes up the platform, to optimize meantime to recover.
  • Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation and refinement.
  • Analyse ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns.
  • Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.
  • Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead in DevOps automation and best practices.
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
  • Scale systems sustainably through mechanisms like automation and evolving systems by pushing for changes that improve reliability and velocity.
  • Work with a global team spread across tech hubs in multiple geographies and time zones.
  • Ability to share knowledge and explain processes and procedures to others.
  • Share knowledge and mentor junior resources
  • Able to perform on-call duties on a rotational basis.
  • Occasional off hours work required.

 

 

 


Requirements

Skills 

Must Have

  • Linux
  • Shell Scripting
  • ITIL / ITSM
  • PL/SQL - 
  • Application Troubleshooting
  • Any Cloud knowledge / experience
  • Any Monitoring tool (Preferred Splunk/Dynatrace)
  • Jenkins - CI/CD - Basic
  • Groovy Scripting/Yaml - Good to have
  • Git basic/bit bucket - Good to have
  • Ansible/Chef - Good to have

Good To Have

  • Even Framework architecture

 

 


Benefits

 

 


Requirements
• Linux • Shell Scripting • ITIL / ITSM • PL/SQL • Application Troubleshooting • Splunk/Dynatrace

  • india System Soft Technologies Full time

    Job Summary: The client is looking for a SRE Engineer: An SRE bridges the gap between traditional software engineering and operations to create highly scalable and fault-tolerant systems. As a result, they ensure the reliable and efficient operation of an organization's systems and services. Responsibilities: Detect issues. Automatically handle failures....


  • India System Soft Technologies Full time

    Job Summary:The client is looking for a SRE Engineer:An SRE bridges the gap between traditional software engineering and operations to create highly scalable and fault-tolerant systems. As a result, they ensure the reliable and efficient operation of an organization's systems and services.Responsibilities:Detect issues.Automatically handle failures.Prepare...


  • India System Soft Technologies Full time

    Job Summary:The client is looking for a SRE Engineer:An SRE bridges the gap between traditional software engineering and operations to create highly scalable and fault-tolerant systems. As a result, they ensure the reliable and efficient operation of an organization's systems and services.Responsibilities:Detect issues.Automatically handle failures.Prepare...


  • India System Soft Technologies Full time

    Job Summary: The client is looking for a SRE Engineer: An SRE bridges the gap between traditional software engineering and operations to create highly scalable and fault-tolerant systems. As a result, they ensure the reliable and efficient operation of an organization's systems and services. Responsibilities: Detect issues. Automatically handle failures....


  • India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • india System Soft Technologies Full time

    Title: Site Reliability Engineer 100% REMOTE The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • India System Soft Technologies Full time

    Title: Site Reliability Engineer 100% REMOTE The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • India System Soft Technologies Full time

    Job Summary The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • India System Soft Technologies Full time

    Job Summary The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • India System Soft Technologies Full time

    Job Summary The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • India System Soft Technologies Full time

    Title: Site Reliability Engineer 100% REMOTE Applications written in .NET (python or any other scripting would be good) we need more of a dev background then operations. Automation experience: Ansible preferred but good with Terraform as well. Doesn’t need to come from a 24x7 environment but needs to be okay working in that environment. AWS preferred but...


  • India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEApplications written in .NET (python or any other scripting would be good) we need more of a dev background then operations.Automation experience: Ansible preferred but good with Terraform as well.Doesn’t need to come from a 24x7 environment but needs to be okay working in that environment.AWS preferred but any...


  • India IKAI Technology Solutions Full time

    Company Description IKAI Technology Solutions is a leading provider of IT services, supporting businesses across various industries to harness the full potential of information technology. With extensive experience in managing the intricate systems and operations of global enterprises, IKAI is committed to revolutionizing the way businesses navigate the...


  • india IKAI Technology Solutions Full time

    Company Description IKAI Technology Solutions is a leading provider of IT services, supporting businesses across various industries to harness the full potential of information technology. With extensive experience in managing the intricate systems and operations of global enterprises, IKAI is committed to revolutionizing the way businesses navigate the...


  • India IKAI Technology Solutions Full time

    Company Description IKAI Technology Solutions is a leading provider of IT services, supporting businesses across various industries to harness the full potential of information technology. With extensive experience in managing the intricate systems and operations of global enterprises, IKAI is committed to revolutionizing the way businesses navigate the...


  • India Unilog Full time

    Job Title : Site Reliability EngineerJob Summary :As a Site Reliability Engineer (SRE) specializing in Google Cloud Platform (GCP), you will be responsible for designing, implementing, and maintaining highly scalable and reliable systems. You will collaborate with development teams to ensure that applications are designed with reliability and performance in...


  • india Thoucentric Full time

    Job Description Job Description:We are seeking a skilled and dedicated Site Reliability Engineer (SRE) to join our team. The SRE will be responsible for ensuring the reliability, performance, and scalability of our systems and applications. This role combines software development and systems engineering to build and run large-scale, distributed,...