Lytx - Senior Site Reliability Engineer - Incident Management
3 months ago
Job Description :
Why Lytx :
Join our dynamic and passionate team of driven, low-ego engineers who are at the forefront of designing and supporting cutting-edge IoT infrastructure. As we rapidly grow and transition to the cloud, we're diving into the exciting realms of "Operations as Code," "Infrastructure as Code," and innovative infrastructure automation.
Our Site Reliability Engineering (SRE) team is pivotal in ensuring the availability, reliability, observability, and resilience of Lytx' services, both on-premises and in the cloud. We're not just keeping the lights on-we're engineering the future of our business's continuity.
If you're energized by crafting transformative solutions and excel at designing robust, detailed cloud infrastructure with a focus on continuous improvement, this could be the perfect role for you.
Responsibilities :
System Design and Architecture :
- Design, implement, and maintain scalable and reliable systems, ensuring they can handle both current and future demands.
Incident Management :
- Lead incident response efforts, diagnose root causes, and implement long-term solutions to prevent recurrence.
- Ensure effective communication during outages.
Monitoring and Observability :
- Develop and maintain comprehensive monitoring and alerting systems to proactively identify and address issues before they impact users.
Automation and Efficiency :
- Automate repetitive tasks and processes to improve operational efficiency and reduce manual intervention.
Performance Tuning :
- Continuously optimize system performance, including fine-tuning applications, databases, and infrastructure to meet service level objectives (SLOs).
Capacity Planning :
- Forecast future system requirements based on growth trends and current usage, and plan capacity upgrades to ensure system reliability.
Collaboration and Mentoring :
- Work closely with development teams to integrate reliability into the software development lifecycle.
- Mentor junior SREs and share best practices.
Documentation and Knowledge Sharing :
- Create and maintain detailed documentation on system design, incident response procedures, and operational practices to ensure knowledge is preserved and accessible.
Requirements :
- 5+ years of experience as an SRE within AWS environments at medium to large-scale organizations.
- 3+ years of hands-on experience implementing and managing observability tools, such as Prometheus, New Relic, Grafana, or similar.
- Advanced programming skills in Python, Groovy, and Bash.
- Strong understanding of database technologies, including both SQL and NoSQL systems.
- 3+ years of experience developing and managing infrastructure deployment pipelines using Git, Terraform, Helm, Jenkins/Jenkins X/ArgoCD, or similar tools.
- Proven expertise in designing, evaluating, and supporting production environments in AWS, including VPCs, EKS, IAM, AMI, EC2, CloudWatch, CloudTrail, Control Tower, GuardDuty, MSK, S3, Glacier, Gateways, Direct Connect, Route 53, RDS, ALBs, Autoscaling, and more.
- Hands-on experience with Linux systems and protocols and technologies such as HTTP, REST, TCP/IP, SSL, DNS, SMTP, SSH, NTP, Load Balancing, SQL/NoSQL, Message Brokers, Nginx, Vault, etc.
- Extensive experience with Kubernetes and various container and cloud-native technologies.
- Significant experience in managing 24/7 on-call rotations, creating runbooks, establishing support procedures, and proactively monitoring systems across multiple geographic locations.
- Ability to thrive under pressure and excel in a technically challenging environment
-
Lytx - Senior DevOps Engineer - IAC Terraform
2 weeks ago
Bengaluru, India Lytx, Inc Full timeAt Lytx, our solutions process vast volumes of data and video from over 700,000 vehicles globally, delivering actionable insights through the power of Video Telematics, AI, and Machine Vision. As a Senior DevOps Engineer, you'll lead exciting projects, mentor rising talent, and design scalable, innovative solutions to impact an entire industry. What...
-
Lytx - Senior DevOps Engineer
2 weeks ago
Bengaluru, India Hirist.tech Full timeJob Description : Role : Senior Software Engineer DevOpsWhy Choose Lytx? If you are a seasoned and dynamic Senior DevOps Engineer, Lytx welcomes you to join our ambitious team in redefining transportation safety. At Lytx, our innovative technology processes vast volumes of video and data from over 700,000 vehicles globally, delivering top-notch safety...
-
Lytx - Senior Manager - Data Engineering
2 weeks ago
Bengaluru, India Hirist.tech Full timePosition : Senior Manager Data EngineeringExperience : 12+Location : Bangalore.Mode : HybridJob Description :Why Lytx :Thisrolewillhave a positiveimpacton Data Engineering anddeliveryas well asteam andemployee development.If you enjoyhelpingteams reach their potential, workingoninteresting technicalchallengesand deliveringproducts that save lives,thisrolemay...
-
Senior Site Reliability Engineer
3 weeks ago
Bengaluru, India SolarWinds Full timeAt SolarWinds, we put people first. Our mission is to help customers accelerate business transformation through simple, powerful, and secure solutions. We value collaboration, accountability, and innovation, and we're looking for someone who shares those values to join our team.The Role:We are looking for a Senior Site Reliability Engineer (SRE) with...
-
Lytx - Data Engineer III - Python Programming
3 months ago
Bengaluru, India Lytx, Inc Full timeJob Description :Data Engineer has full accountability to design and develop rich data-driven frameworks, applications and services. This role will be responsible for development of applications on the Cloud platform and migration of existing on-premise data systems and processes to the Cloud, while also engaging with the Teams navigate through Technology...
-
Site Reliability Expert
1 month ago
Bengaluru, Karnataka, India SolarWinds Full timeAbout This RoleWe're seeking a highly skilled Senior Site Reliability Engineer to join our team at SolarWinds. This role requires expertise in designing, building, and maintaining scalable and reliable cloud infrastructure using AWS, Azure, Kubernetes, and GitOps.As a Senior Site Reliability Engineer, you'll work closely with our SRE team to develop and...
-
Senior Site Reliability Engineer
4 weeks ago
Bengaluru, India SolarWinds Full timeAt SolarWinds, we put people first. Our mission is to help customers accelerate business transformation through simple, powerful, and secure solutions. We value collaboration, accountability, and innovation, and we're looking for someone who shares those values to join our team.The Role:We are looking for aSenior Site Reliability Engineer (SRE)with expertise...
-
Senior Site Reliability Engineer
1 month ago
Bengaluru, India SolarWinds Full timeAt SolarWinds, we put people first. Our mission is to help customers accelerate business transformation through simple, powerful, and secure solutions. We value collaboration, accountability, and innovation, and we're looking for someone who shares those values to join our team. The Role: We are looking for a Senior Site Reliability Engineer (SRE) with...
-
Bengaluru, India Hirist.tech Full timeWhy Lytx : As Senior Software Engineer, youll work with industry leading Lytx Safety Software, which manages massive amounts of video and data collected from over 700,000 vehicles worldwide and ensures the best possible outcomes by identifying behaviors and events that impact safety. Be responsible for building solutions that collect, organize, and present...
-
Bengaluru, India Hirist.tech Full timeWhy Lytx : Do you want to join a team of hungry, humble, and capable people and dedicate your time and talent to making a difference in our world? At Lytx, youll work to apply innovative technology to improve safety and help save lives on our roadways! Being part of a market-leading, medium-sized technology company means that there's room for you to...
-
Site Reliability Engineer
1 week ago
Bengaluru, India Tranzeal Incorporated Full timeJob Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...
-
Site reliability engineer
1 week ago
Bengaluru, India Tranzeal Incorporated Full timeJob Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...
-
Senior Site Reliability Engineer
4 weeks ago
Bengaluru, India SolarWinds Full timeAt SolarWinds, we put people first. Our mission is to help customers accelerate business transformation through simple, powerful, and secure solutions. We value collaboration, accountability, and innovation, and we're looking for someone who shares those values to join our team.The Role:We are looking for a Senior Site Reliability Engineer (SRE) with...
-
Site Reliability Engineer
1 day ago
Bengaluru, India Tranzeal Incorporated Full timeJob Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...
-
Site reliability engineer
6 hours ago
Bengaluru, India Tranzeal Incorporated Full timeJob Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5 Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible...
-
Bengaluru, India Hirist.tech Full timeWhy Lytx : Lytx is looking for a highly skilled Staff DevOps Engineer to join our dynamic team. As a Staff DevOps Engineer, you will be pivotal in designing, implementing, and maintaining our continuous integration and deployment pipelines, infrastructure, and tools. You will also get the opportunity to act as a subject-matter guide on core AWS services,...
-
Site Reliability Engineer
1 week ago
Bengaluru, India Tranzeal Incorporated Full timeJob Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring aSite Reliability Engineerto join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...
-
Site Reliability Engineer
1 day ago
Bengaluru, India Tranzeal Incorporated Full timeJob Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring aSite Reliability Engineerto join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...
-
Site Reliability Engineer
1 day ago
Bengaluru, India BCE Global Tech Full timeAt BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications. This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go.If you are passionate about technology and eager to make a difference, we want to hear from you! Apply...
-
Senior Site Reliability Engineering Manager
1 week ago
Bengaluru, Karnataka, India AMEX Full timeAbout the Role: We are seeking an experienced Senior Site Reliability Engineer to lead our team in delivering high-quality, reliable technology solutions. The ideal candidate will have a deep understanding of observability tools and methodologies, as well as strong leadership and people management skills. About Us: American Express is a global leader in...