
Reliable Systems Engineer
15 hours ago
We are seeking an expert in site reliability engineering to join our team.
Key Responsibilities:- Discover, design, and implement changes to existing infrastructure with a focus on improved reliability, performance, and standardization.
- Collaborate with cross-functional teams to translate customer, business, and technical requirements into architectural designs and enhancements.
- Ensure efficient resource utilization and continuously improve processes leveraging automation and internal tools resulting in enhanced service delivery, maturity, and scalability.
- Troubleshoot production issues providing root cause analysis and designing solutions to prevent future occurrences.
- Monitoring of services and creating intelligent alarming for quicker incident detection and resolution.
- Maintain vulnerability management processes and policies using a risk-based priority methodology.
- Collaborate with various teams and platform owners on all vulnerability management and reporting.
- Mentor and coach other SRE team members.
- Strategically apply architectural and infrastructure disciplines to solve business problems.
- Participate in an on-call rotation.
- Extensive experience with a wide range of infrastructure technologies such as Linux, Windows, high-performance computing, storage platforms, networking, cloud computing, cloud services (IaaS, PaaS, SaaS), virtualization, OpenStack, containerization, and orchestration technologies (e.g., Docker, Kubernetes).
- Deep understanding of IT infrastructure related services and their dependencies required to troubleshoot issues and define mitigations.
- Solid experience with the administration, security hardening, and performance tuning of Linux and Windows OS. In-depth knowledge of CIS benchmarking standards.
- Experience with developing service level indicators and objectives, instrumenting software, and building alerts.
- An understanding of software engineering fundamentals with experience developing software with a team of engineers. Strong experience in the practice of testing.
- Experience with the operations, administration, and development of orchestration systems such as Kubernetes, ECS, Mesos.
- Passion for tracking down technical root causes of distributed systems and software.
- Experience with ITAM, Service Mapping, and CMDB (service-now).
- Strong technical foundation with the ability to engage deeply on technical topics related to data center and cloud infrastructure, software reliability, and operational practices.
- Proficiency in ITIL processes and frameworks.
- Service availability-oriented mindset with a pro-active approach to problem solving. An ideal candidate should be able to develop automated solutions to prevent recurring problems.
- Possesses the ability and willingness to challenge the status-quo and optimize current processes and procedures.
- At least 5 years of experience in site reliability engineering or related field.
- Strong understanding of system architecture and design principles.
- Excellent communication and collaboration skills.
- Ability to work effectively in a fast-paced environment.
- Proven track record of delivering high-quality results under pressure.
-
System Reliability Engineer
5 days ago
Hyderabad, Telangana, India beBeeReliability Full time ₹ 1,20,00,000 - ₹ 1,50,00,000**System Reliability Engineer Opportunity**This is an exciting opportunity to join a team as a System Reliability Engineer. We are seeking a highly motivated and experienced individual to ensure the overall stability of our production application.The successful candidate will be responsible for ensuring the reliability, availability, scalability, and...
-
Principal Site Reliability Engineer
5 days ago
Hyderabad, Telangana, India Cubic Transportation Systems Full timeHiring Principal Site Reliability Engineer Experience: 12+ Years Location: Hyderabad Notice: Immediate to 30 Days We're seeking an experienced Site Reliability Engineer (SRE) to ensure our services are robust, scalable, secure, and maintainable. You will blend software engineering and systems operations to automate processes, monitor performance, lead...
-
Principal Site Reliability Engineer
19 hours ago
Hyderabad, Telangana, India Cubic Transportation Systems Full time ₹ 15,00,000 - ₹ 20,00,000 per yearHiring Principal Site Reliability EngineerExperience: 12+ YearsLocation: HyderabadNotice: Immediate to 30 DaysWe're seeking an experiencedSite Reliability Engineer (SRE)to ensure our services are robust, scalable, secure, and maintainable. You will blend software engineering and systems operations to automate processes, monitor performance, lead incident...
-
Operations Engineer
6 days ago
Hyderabad, Telangana, India beBeeSystemReliability Full time ₹ 15,00,000 - ₹ 25,00,000Job OverviewThis role focuses on maintaining the stability of our production applications. Ensuring reliability, availability, scalability, and efficiency in our systems and platforms is crucial.Maintain high uptime and performance of production systems, applications, and infrastructure.Troubleshoot operational issues, providing timely escalation to...
-
Site Reliability Senior Engineer
2 days ago
Hyderabad, Telangana, India Veeva Systems Full time ₹ 15,00,000 - ₹ 20,00,000 per yearVeeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in history, we surpassed $2B in revenue in our last fiscal year with extensive growth potential ahead.At the heart of Veeva are our values: Do the Right Thing, Customer...
-
System Reliability Expert
1 week ago
Hyderabad, Telangana, India beBeeSoftwareEngineer Full time ₹ 1,80,00,000 - ₹ 2,40,00,000Reliable System SpecialistOur organization seeks a highly skilled specialist to enhance system reliability and performance. This key role will be responsible for designing, implementing, and maintaining scalable infrastructure to support applications and services.The ideal candidate will have expertise in software engineering concepts and applied experience...
-
System Reliability Specialist
2 days ago
Hyderabad, Telangana, India beBeeReliability Full time ₹ 30,00,000 - ₹ 40,00,000We are seeking a skilled System Reliability Specialist to join our team. As a System Reliability Specialist, you will play a critical role in ensuring the performance and reliability of our systems.- Design and implement Service Level Agreements (SLAs), Service Level Indicators (SLIs), and error budgets to improve system reliability.- Monitor and optimize...
-
Reliability Engineer
6 days ago
Hyderabad, Telangana, India JLL Full timeJob DescriptionJLL is seeking a Reliability Engineer to join our teamThis exciting opportunity is responsible for providing reliability engineering support for operations and maintenance of buildings, infrastructure, and equipment assets. In coordination and full collaboration with the Engineering Services Reliability & Asset Management COE, the Reliability...
-
System Reliability Engineer
3 days ago
Hyderabad, Telangana, India beBeeAzure Full time ₹ 1,50,00,000 - ₹ 2,50,00,000System Reliability Engineer (SRE) - Azure SpecialistThis role is for a skilled System Reliability Engineer with expertise in Core Azure Services, IoT, Event Hub, Databricks, and experience with Kubernetes, Docker, and Python/Powershell scripting.The ideal candidate will have strong knowledge of monitoring tools, including ELK, alerting, and logging systems....
-
Hyderabad, Telangana, India Cubic Transportation Systems Full timeHiring Principal Site Reliability EngineerExperience: 12+ YearsLocation: HyderabadNotice: Immediate to 30 DaysWe're seeking an experienced Site Reliability Engineer (SRE) to ensure our services are robust, scalable, secure, and maintainable. You will blend software engineering and systems operations to automate processes, monitor performance, lead incident...