Senior Site Reliability Engineer
4 weeks ago
Strong background in software development and systems administration, as well as excellent problem-solving and communication skills. Improve reliability, quality, and time-to-market of our suite of software solutions Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve Identify and reduce or eliminate toil via automation to maximize the time spent on engineering and innovation Performing root cause analysis of production incidents and implementing preventive measures Strong background in software development and systems administration, as well as excellent problem-solving and communication skills. Run the production environment by monitoring availability and taking a holistic view of system health. Developing, improving, and operating the deployment and orchestration of a complex distributed system Improve reliability, quality, and time-to-market of our suite of software solutions Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve Provide primary operational and engineering Support for multiple large, distributed software applications Identify and reduce or eliminate toil via automation to maximize the time spent on engineering and innovation Collaborating with development teams to design, build, and operate scalable and resilient software systems Automating deployment, monitoring, and incident response processes Performing root cause analysis of production incidents and implementing preventive measures Conducting performance analysis and optimization of the system Ensuring compliance with security and regulatory standards Implementing and maintaining disaster recovery processes Providing technical guidance and mentorship to other team members Participating in an on-call rotation for incident response and support. 4 Year College Degree in Computer Science or Equivalent. 2-5 years’ experience with JAVA, J2EE, NoSQL/SQL Datastore, Spring Boot, GCP/AWS/Azure & Docker/K8 in Maintenance and Development of multi-tier applications. Understanding of RESTful APIs and microservices platform 2-5 Years of experience with any of APM and other monitoring tools such as Dynatrace, New Relic, ELK, Splunk, Prometheus, Sensu, Nagios, Kafka, DataDog, PagerDuty. Strong experience with product & development teams to establish error budgets by identifying the right SLOs (Service level objective), SLIs (Service level indicators), KPIs (Key performance indicators) and effectively drive the use of the budget to ensure maximum domain availability/uptime. Regularly review key site technical metrics such as transactions errors, logging, response times, caching strategies, conversion/bounce rates, capacity & resource utilization. Proactively identify stability risks & work with engineering leadership to establish appropriate mitigation plans Experience in solving complex architecture/design & business problems, work to simplify, optimize, remove bottlenecks, etc. Architect, design & develop automation to reduce toil, improve recoverability, availability, latency & scalability of supported applications with understanding of MTTD (Mean Time to Detection) & MTTR (Mean Time to Resolution) Maintain knowledge repository that includes Standard operating procedure, Release checklists, Runbooks for incident recovery
-
Senior Site Reliability Engineer
2 weeks ago
india Next-Link Full timeJob Description Senior Site Reliability Engineer Desirable Skills:Experience with additional programming languages and technologies beyond Python and Ruby.Familiarity with cloud platforms such as AWS, Azure, or GCP.Proficiency in additional logging and monitoring tools.Experience with other Infrastructure as Code (IaC) tools and practices.Knowledge of...
-
Senior Site Reliability Engineer
1 week ago
india SWAI TECHNOLOGIES PRIVATE LIMITED Full timeRole : Senior Site reliability Engineer Exp : 5 to 10 Years of experience Remote Opportunity Company Description : Tech recruitment is broken Companies say there is a shortage of talent and it's hard to find good developers, while developers find it hard to find companies that value the skill, experience and passion they bring to the table.Quite the...
-
Senior DevOps/Site Reliability Engineer
3 weeks ago
india RapidBraiins Full timeJob Description : We are seeking a highly skilled and experienced Senior DevOps Site Reliability Engineer to join our dynamic team. The ideal candidate will have a proven track record of success in DevOps, Site Reliability Engineering (SRE), or development roles within SaaS-based or enterprise applications. As a Senior DevOps SRE Engineer, you will play a...
-
Senior Site Reliability Engineer
5 days ago
india Boomi Full timeAbout Boomi and What Makes Us Special Are you ready to work at a fast-growing company where you can make a difference? Boomi aims to make the world a better place by connecting everyone to everything, anywhere. Our award-winning, intelligent integration and automation platform helps organizations power the future of business. At Boomi, you’ll work with...
-
Site Reliability Engineering Manager
5 days ago
india First American (India) Full timeThe Role: A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission. As a Site Reliability Engineering Manager...
-
Site Reliability Engineer
3 weeks ago
india Cricbuzz.com Full timeSite Reliability Engineer We are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services. Experience - 3 - 5 years Responsibilities: ●...
-
Senior Site Reliability Engineer
3 weeks ago
india Akamai Full timeAre you passionate about improving business processes? Do you enjoy working with a diverse multi-national team of engineering talents? Join our highly skilled Site Reliability team Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating and managing...
-
Senior Site Reliability Engineer
2 weeks ago
india Coforge Full timeQualifications : Experience in a DevOps / Site Reliability Engineer ( SRE ) position, dedicated to ensuring the high availability, reliability, and scalability of live systems. Proficient in observability tools like Prometheus, ELK stack, Grafana, and Azure Monitor, capable of fully managing the suite for optimal system oversight. Skilled in operating APM...
-
Site Reliability Engineer
2 weeks ago
india Korn Ferry Full timeRole - Site Reliability Engineer Exp - 5+ years Required Location - Hyderabad ( Work from Office-Hybrid) Shift Timings - 5AM -1 PM IST We are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely...
-
Site Reliability Engineer
4 weeks ago
india ViewSonic Full timeJob Requirements: Bachelor’s degree in computer science, Engineering, or a related field. 3+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role. Proficient in AWS solutions including but not limited to EC2, S3, CloudWatch, Lambda, and RDS. Strong understanding of Platform Engineering concepts and principles. Experience...
-
Site Reliability Engineer
2 weeks ago
india SID Global Solutions Full timeDear Candidates, We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes , send me your updated cv : Please...
-
Site Reliability Engineer
1 month ago
india Quiktrak, LLC Full timeJob Title: Azure Site Reliability Engineer (SRE) / DevOps Engineer Job Description: Summary: As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...
-
Site Reliability Engineer
4 weeks ago
India System Soft Technologies Full timeTitle: Site Reliability Engineer100% REMOTEThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...
-
Site Reliability Engineer
4 weeks ago
india System Soft Technologies Full timeTitle: Site Reliability Engineer 100% REMOTE The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...
-
Site Reliability Engineer
6 days ago
india Thoucentric Full timeJob Description Job Description:We are seeking a skilled and dedicated Site Reliability Engineer (SRE) to join our team. The SRE will be responsible for ensuring the reliability, performance, and scalability of our systems and applications. This role combines software development and systems engineering to build and run large-scale, distributed,...
-
Senior Site Reliability Engineer
4 weeks ago
india NVIDIA Full timeNVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s motivated by outstanding technology and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers,...
-
Senior Site Reliability Engineer
4 days ago
india Duck Creek Technologies Full timeWHO WE ARE Duck Creek Technologies is the intelligent solutions provider defining the future of the property and casualty (P&C) and general insurance industry. We are the platform upon which modern insurance systems are built, enabling the industry to capitalize on the power of the cloud to run agile, intelligent, and evergreen operations. Our modern...
-
Sr. Site Reliability Engineer
3 weeks ago
india Encora Inc. Full timeDescription Sr. Software Engineer (Site Reliability Engineer) Important Information Location: Ahmedabad Experience: 5+ years Job Mode: Full-time Work Mode: Remote Job Summary Working with DevOps SRE with good experience in Site Reliability Engineer. Responsibilities and Duties Design, implement, and maintain highly...
-
Site Reliability Engineer
4 days ago
india WaferWire Cloud Technologies Full timeRole: SRE (Site Reliability Engineer) Experience: 4+ Years About WaferWire Cloud Technologies: WaferWire Cloud Technologies is a leading provider of innovative cloud solutions aimed at transforming businesses and driving digital growth. With a focus on cutting-edge technology and customer-centric approaches, we empower organizations to thrive in the...
-
Senior Lead Site Reliability Engineer
2 months ago
india Akamai Full timeDo you like collaborating across teams to solve complex problems? Do you enjoy solving large scale distributed content delivery challenges? Join our highly skilled Site Reliability team Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We do this while maintaining...