Wealthy - Site Reliability Engineer - Kubernetes
2 weeks ago
Job Description :
- Design, implement, and maintain reliable containerized applications using Kubernetes on GCP.
- Develop and optimize SLIs, SLOs, and SLAs for critical systems and services.
- Create and maintain automation for deployment, scaling, and management of applications and infrastructure.
- Implement and manage observability solutions, including monitoring, logging, and alerting systems.
- Conduct capacity planning and performance optimization for Kubernetes clusters and GCP resources.
- Collaborate with development teams to improve application reliability, scalability, and performance.
- Implement and maintain disaster recovery and business continuity plans.
- Continuously improve system reliability through chaos engineering and proactive testing.
- Optimize resource utilization and cost management within GCP.
- Implement security best practices for Kubernetes clusters and GCP services.
- Stay up-to-date with the latest developments in site reliability engineering, Kubernetes, and GCP services.
- Document processes, runbooks, and best practices for maintaining system reliability.
- Serve as the primary point of contact for interactions with Google Cloud support and other service providers.
- Collaborate with Google Cloud representatives to optimize our use of GCP services and stay informed about new features and best practices.
- Manage relationships with other cloud and service providers, ensuring optimal integration and utilization of their services.
- Rapidly learn and adapt to new technologies and tools as needed.
- Proactively explore and experiment with new technologies and methodologies to improve system reliability and efficiency.
- Implement and manage GitOps workflows using ArgoCD for Kubernetes deployments.
- Design, develop, and maintain Helm charts for streamlined application deployments.
Requirements :
- Strong experience with Kubernetes, including deployment, scaling, and management of containerized applications.
- Extensive knowledge of Google Cloud Platform (GCP) services and best practices.
- Solid understanding of containerization technologies, particularly Docker.
- Experience with monitoring and observability tools (e.g, Prometheus, Grafana, Alerting).
- Strong knowledge of version control systems, particularly Git.
- Deep understanding of networking concepts, including VPNs, VPCs, and gateways.
- Familiarity with database administration, particularly cloud-native database solutions.
- Strong analytical skills with a focus on system reliability and performance optimization.
- Excellent communication skills, including the ability to effectively interact with external service providers and translate technical concepts for non-technical stakeholders.
- Experience in vendor management or working closely with cloud service providers.
- Ability to work in a collaborative team environment.
- Demonstrated ability to quickly learn and adapt to new technologies and tools.
- Familiarity with CI/CD concepts and ability to work with various CI/CD tools.
- Natural curiosity and a tinkerer's mindset, with a passion for understanding how systems work at a deep level.
- History of personal projects or contributions to open-source projects (preferred).
- Ability to think creatively and approach problems from multiple angles.
- Proficiency with ArgoCD and GitOps principles for managing Kubernetes deployments.
- Strong experience with Helm for packaging and deploying Kubernetes applications.
- Certifications such as Certified Kubernetes Administrator (CKA) or Google Cloud Professional DevOps Engineer are a plus.
-
Site Reliability Engineer
3 weeks ago
Bangalore, Karnataka, India Yogy HR Solutions Full timeSite Reliability Engineer Exp : 6-12 years Location : Bangalore Mandatory, Hybrid model from Bangalore location. Notice Period : Immediate to 30 days preferred. Roles and Responsibilities :- Work with development partners to shape the architecture, design, and implementation of new and existing systems to enhance their reliability, performance, efficiency,...
-
Site Reliability Engineer
3 weeks ago
Bangalore, Karnataka, India Tranzeal Incorporated Full timeSite Reliability Engineer Responsibilities : - Maintain and scale production services and servers across multiple data centers for complex cloud-based applications. - Improve scalability, reliability, capacity, and performance of production systems. - Write automation code using tools like Terraform and Ansible to provision and operate infrastructure at...
-
DevOps Engineer
3 weeks ago
Bangalore, Karnataka, India Catalyst IQ Full timeWe are a pioneering social sports betting platform in India focused on a free-to-play model. We're dedicated to enhancing user experience, fostering community engagement, and developing a loyal fan base.Overview :Join our team as a Site Reliability Engineer (SRE) / DevOps Engineer. Ensure the reliability, scalability, and performance of our social sports...
-
Site Reliability Engineer
3 weeks ago
Bangalore, Karnataka, India Altruist Technologies Full timeNeed banking domain experience.Role : Site Reliability EngineerJob Description :1. Run and monitor the production systems environment for high availability, reliability, and performance by taking a holistic view of systems health.2. Build software tools/automation scripts for periodic system admin tasks, spot failure patterns monitor service health, server...
-
Site Reliability Engineer
3 months ago
Bangalore, Karnataka, India Squareroot Consulting Pvt Ltd. Full timeSite Reliability EngineerLocation : Bangalore, IndiaDomain : CybersecurityBudget : 30 to 50 Lacks - We are looking for a hands-on devops engineer leading the design, implementation of devops/SRE practice for our infrastructure for data privacy.- The successful candidate will have experience implementing advanced DevOps & SRE techniques such as Auto...
-
Site Reliability Engineer
3 weeks ago
Bangalore, Karnataka, India Talpro Full timeAbout Us :Talpro is leading the way in transforming the talent acquisition landscape. We deliver innovative, sustainable, and cost-effective recruitment solutions tailored to today's business needs. Our mission is to offer comprehensive hiring strategies that address immediate recruitment demands while laying the groundwork for long-term success.Job...
-
Site Reliability Engineer
3 weeks ago
Bangalore, Karnataka, India Kiash Solutions LLp Full timeConsidering candidates that can onboard us in 0-15 days at our Bengaluru office. Skill : Lead Software Engineer - Devops Cloud. Note : Experience : 6.5+ Years. Mandatory Skill are : Devops + Cloud+ Migration + Ansible+ Terraform & Kubernetes +CI/CD, lead experience is needed. Location : Bangalore. Job Type - FTE. Notice Period - Immediate/ 15 Days. Budget -...
-
Senior Site Reliability Engineer
3 weeks ago
Bangalore, Karnataka, India Innovation Consulting Services Full timeWe are seeking a highly skilled and motivated Senior Site Reliability Engineer (SRE) - Disaster Recovery Specialist. The ideal candidate will be responsible for designing, implementing, and maintaining systems and processes that ensure the reliability, scalability, and disaster resilience of our infrastructure and applications. This role requires a good...
-
DevOps/Site Reliability Engineer
3 weeks ago
Bangalore, Karnataka, India TETRAHED INC Full timeJob Description :We are looking for a talented and experienced DevOps/Site Reliability Engineer (SRE) with a strong proficiency in Python.Skill & Experience :- Collaborate with development teams to design, develop, and maintain infrastructure for our highly available and scalable applications.- Automate processes using Python scripting to streamline the...
-
Site Reliability Engineer
2 days ago
Bangalore, Karnataka, India Arting Digital Full timePosting title : Site Reliability Engineer Experience : 7+ Years Location : Bangalore Work mode : WFO Primary skills : Cloud Monitoring & Operations (GCP & Azure), Python, ServiceNowQualification : Any Engineering/ Computers degreeRoles & Responsibilities : Daily Operations & Monitoring : - Actively monitor systems, applications, and infrastructure across...
-
Bangalore, Karnataka, India Groww Full timeAbout Groww :- We are a passionate group of people focused on making financial services accessible to every Indian through a multi-product platform.- Each day, we help millions of customers take charge of their financial journey.- Customer obsession is in our DNA.- Every product, every design, every algorithm down to the tiniest detail is executed keeping...
-
Senior DevOps Engineer
3 weeks ago
Bangalore, Karnataka, India NOMISO INDIA PRIVATE LIMITED Full timeAbout Company : Nomiso is a product and services engineering company. We are a team of Software Engineers, Architects, Managers, and Cloud Experts with expertise in Technology and Delivery Management. Our mission is to Empower and Enhance the lives of our customers, through efficient solutions for their complex business problems. At Nomiso we encourage...
-
Site Reliability Engineer
3 weeks ago
Bangalore, Karnataka, India Cyitechsearch Full timeWe are hiring for Site Reliability EngineerSkills :- Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff to...
-
Site Reliability Engineer
3 weeks ago
Bangalore, Karnataka, India Watson Search Partner Full timeAbout the SRE :Site reliability engineers (SREs) combine engineering experience and an innate drive to improve existing systems and processes, with the creativity to develop novel solutions to evolving challenges that can hamper reliability, performance and availability of critical platform services and applications. SRE builds solutions (Process + tools) to...
-
Site Reliability Engineer
3 weeks ago
Bangalore, Karnataka, India Flairchase Full timeJob Description : Responsibilities :- Design, build, and ship the systems and infrastructure that power our applications.- Design and development of new features.- Minimizing and hardening microservices and public-facing API gateway attack surface.- Continuous delivery using tools such as Tekton, Travis, Jenkins, Ansible, and Kubernetes.- Observability,...
-
Cloud Engineer
3 weeks ago
Bangalore, Karnataka, India Sampoorna Consultants Pvt. Ltd Full timeJob Description : The Cloud Engineer role is a hands-on role primarily responsible for the deployment, monitoring and uptime/SLA of a multi-tenanted, large scale SaaS Service.This role also has a Site Reliability Engineer (SRE) function to it, requiring an ability to use automation to push updates at scale to the SaaS service and perform automated...
-
Lead Platform Engineer
3 weeks ago
Bangalore, Karnataka, India Coders Brain Technology Private Limited Full timeRole : Lead Platform Engineer (Azure Kubernetes Services)Seeking a seasoned Lead Platform Engineer to drive the maintenance and expansion of our Azure Kubernetes Services (AKS) environment. This critical role is central to our IT services and cloud platform strategy, overseeing the architecture, design, development, implementation, and on-call production...
-
Site Reliability Engineer
3 weeks ago
Bangalore, Karnataka, India SMARTWORK IT SERVICES Full timeRole and Responsibilities :- Maintain and scale production services and servers across multiple data centers for complex, data-intensive cloud services.- Improve scalability, service reliability, capacity, and performance of cloud infrastructure.- Write automation code for provisioning and operating infrastructure at a massive scale.- Work with development...
-
Bangalore, Karnataka, India Cricbuzz Full timeAbout the job :We are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience : 3 - 5 yearsResponsibilities :- Design, implement,...
-
Bangalore, Karnataka, India Lytx, Inc Full timeJob Description :Why Lytx : Join our dynamic and passionate team of driven, low-ego engineers who are at the forefront of designing and supporting cutting-edge IoT infrastructure. As we rapidly grow and transition to the cloud, we're diving into the exciting realms of "Operations as Code," "Infrastructure as Code," and innovative infrastructure...