
Reliable System Solutions Expert
1 day ago
Job Title: Site Reliability Engineer
We are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our systems and applications.
About UsWe are a global leader in safety, innovation, reliability, and customer experience. Our employees have led the airline industry in operational excellence while maintaining our reputation for award-winning customer service.
Our mission is to connect people and cultures of the globe, fostering understanding across a diverse world and serving as a force for social good.
Key Responsibilities- Incident Management: Execute on Incident, Change Management, Problem Management processes
- Application Development: Building and supporting a reliable application suite for the environment in order to meet the development and maintenance requirements of systems/platforms.
- Technical Support: Provide consultation and direct technical support in life cycle planning, problem management, integration, and systems programming.
- Performance Monitoring: Evaluate platform performance and availability meet enterprise objectives through monitoring, timely service restoration, and tuning.
- Automation: Working to improve and implement automation of applications tasks.
- Troubleshooting: Providing technical support for systems/platforms according to application SLA's.
- Resiliency Engineering: Responsible for designing and developing resiliency in the application code, troubleshooting incidents, engaging with squads to address failure patterns, and participating in incident management.
- Problem-Solving: Strong Troubleshooting ability required.
- Communication: Leads calls or contributes in a logical fashion.
- Predictive Maintenance: Focus on resolving issues before they become incidents.
- Risk Assessment: Identify and articulate severity of impacts using provided monitoring tools and escalate as needed.
- Root Cause Analysis: Able to understand architecture and design of applications and identify or narrow focus for an incident based on symptoms.
- System Recovery: Perform root cause analysis to quickly recover from service interruptions, and to prevent recurring problems.
- Platform Tuning: Monitor, manage, and tune platforms to ensure expected availability and performance levels are achieved.
- Monitoring: Identify gaps in monitoring or documentation and reaches out to appropriate teams to fill those gaps.
- Change Management: Implement changes to platforms with minimal impact to the business by following enterprise standards and procedures.
- Bachelor's Degree: Bachelor's degree or industry certification in an applicable IT field, in addition to 3 years applicable experience in the design/administration/support of one or more platforms or Bachelor's degree in an IT field, in addition to two years applicable experience in the Design / administration / support of one or more platforms.
- Systems Experience: 3+ years of experience as a Systems Engineer or Site Reliability Engineer.
- Ops Automation: 3+ years of experience with ops automation using a scripting language such as Python or Ansible - Must Have with either one.
- Site Reliability Engineering Knowledge: Knowledge of Site Reliability Engineering: Theories and methodologies of reliability engineering; ability to design, develop and support various tools, services and applications to maintain a reliable site Environment.
- Performance Measurement and Tuning: Knowledge of system performance, testing and programming; ability to monitor, measure, and optimize system performance and network communication.
- CI/CD Pipeline: Knowledge of concepts, values and tools applied in building Continuous Integration (CI), Continuous Delivery and Continuous Deployment (CD) pipeline; ability to design, build, implement and maintain CI/CD pipelines to achieve the automation of software delivery process.
- Kubernetes and AWS: Kubernetes and AWS - Must.
- Docker: Docker (Good to have).
- Software Release Management: Knowledge of strategies, practices and tools for managing versions and distribution of software products and enhancements; ability to evaluate and improve release management practices and tools.
- Application Maintenance: Knowledge of production applications; ability to monitor application functions and resolve issues to maintain optimal conditions for system applications.
- Software Engineering: Knowledge of software engineering; ability to deliver new or enhanced software products.
- Agile Development: Knowledge of agile methodologies and the agile development lifecycle ability to utilize formal agile methodologies, disciplines, practices and techniques for the delivery of new and enhanced applications.
- Master's Degree: Master's degree in Computer Science, Information Technology or related field is preferred.
- VMWare VDI Experiences: Experiences and exposure to VMWare VDI implementations a huge plus.
- Dynatrace APM and Synthetic Monitoring: Experiences with Dynatrace APM and synthetic monitoring.
- Airline Applications and Infrastructure Technology: Experiences with airline applications and infrastructure technology is a plus.
-
Reliability Expert Leader
3 days ago
Rajahmundry, Andhra Pradesh, India beBeeReliability Full time US$ 1,54,662 - US$ 2,40,925About the Role:We are seeking a skilled reliability expert to join our team. The ideal candidate will have a proven track record of delivering high-quality solutions and contributing to long-term objectives.Key Responsibilities:Develop innovative solutions to drive business successCollaborate with cross-functional teams to achieve operational...
-
Site Reliability Expert
17 hours ago
Rajahmundry, Andhra Pradesh, India beBeeSiteReliability Full time ₹ 15,00,000 - ₹ 20,00,000Job Title:A highly skilled professional in site reliability engineering is sought after to join our team.">Design and support scalable, reliable, and resilient systems on AWS.Contribute to platform engineering projects to create automated solutions for deployment, scaling, and operations.Analyze system performance and provide recommendations for optimization...
-
Reliable Systems Engineer
1 week ago
Rajahmundry, Andhra Pradesh, India beBeeSre Full time US$ 1,50,000 - US$ 2,50,000Job Title: SRE Lead (Engineering & Reliability)We are seeking an experienced and dynamic Site Reliability Engineering (SRE) leader to oversee the reliability, scalability, and performance of our critical systems. As an SRE lead, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving automation,...
-
Site Reliability Engineer
4 days ago
Rajahmundry, Andhra Pradesh, India DigiHelic Solutions Pvt. Ltd. Full timeSite Reliability Engineer Location: TrivandrumExperience: 5 YearsMandatory Skills: Gcp,Aws,Jenkins,KubernetesJob descriptionSeeking a highly skilled Site Reliability Engineer (SRE) to work with one of the leading financial services organizations in the US. This role involves managing the end-to-end application and system stack, ensuring high reliability,...
-
Site Reliability Engineer Specialist
1 week ago
Rajahmundry, Andhra Pradesh, India beBeeReliability Full time ₹ 70,00,000 - ₹ 1,05,00,000System Reliability Expert RoleCutting-edge software development drives personalized product creation for millions of global customers. Modular services comprise the Mass Customization Platform.As a leading provider of custom marketing solutions, we foster connections between businesses and their customers worldwide. Our expertise lies in personalized...
-
Leadership Roles in System Reliability
1 day ago
Rajahmundry, Andhra Pradesh, India beBeeReliabilityEngineer Full time ₹ 2,56,25,000 - ₹ 3,37,50,000Senior Reliability EngineerWe are seeking a seasoned Senior Reliability Engineer to join our team. The ideal candidate will have 12+ years of experience in the field of computer science, information systems, or a related field.Define and drive a reliability strategy that promotes an 'Automate-first' culture in operating services through reduction of...
-
Rajahmundry, Andhra Pradesh, India beBeeSystemReliability Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job Description:We are seeking a highly skilled SRE & DevOps Engineer to join our team. The ideal candidate will have extensive experience in supporting large-scale AI/ML infrastructure, enabling researchers and data scientists to innovate at a global scale.Key Responsibilities:Support and scale AI platform services for global teams.Ensure secure, high...
-
SAP Global Trade Solutions Expert
1 day ago
Rajahmundry, Andhra Pradesh, India beBeeCustomization Full time ₹ 1,00,00,000 - ₹ 1,50,00,000Global Trade Solutions ExpertAbout the Role:Design, implement, and test cutting-edge Global Trade Solutions using SAP GTS.Manage master data and configure custom settings to ensure seamless integration with external systems.Develop and maintain integrations with various external systems to enhance business processes.Provide expert training and support to...
-
Site Reliability Lead
1 week ago
Rajahmundry, Andhra Pradesh, India beBeeReliability Full time ₹ 23,00,000 - ₹ 25,00,000Job Title: Senior Site Reliability EngineerWe are seeking a seasoned professional to join our SRE team. As part of our digital transformation journey, we are investing in automation and reliability engineering to enhance system resilience and reduce production outages.Main Responsibilities:Investigate and resolve high-impact production issues across...
-
Financial Systems Expert
1 day ago
Rajahmundry, Andhra Pradesh, India beBeeFinancial Full time ₹ 1,80,00,000 - ₹ 2,40,00,000Financial Systems SpecialistThe ideal candidate will possess a strong foundation in accounting, finance, and technology, with expertise in customized financial solutions and software applications.Main Responsibilities:Act as the subject matter expert (SME) for financial systems functionality in financial processes, maintaining configurations, data integrity,...