Senior Site Reliability Engineer
3 weeks ago
The core premise for the SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned, addressing availability, scalability, latency, and efficiency challenges within the vast infrastructure here.
Responsibilities:
- Design, develop, and implement software that improves the stability, scalability, availability, and latency of the products.
- Take ownership of one or more services and have the freedom to do what is best for our business and customers.
- Solve problems occurring with our highly available production systems and build solutions and automation to prevent them from happening again.
- Build effective monitoring to supervise the health of your system, and jump in to handle outages.
- Build and run capacity tests to manage the growth of your systems.
- Plan for reliability by designing systems to work across our multinational data centers.
- Develop tools to assist the product development teams with successfully deploying 1000s of change sets every day.
- Be an advocate of engineering standard processes.
- Share the on-call rotation and be an escalation contact for incidents.
- Contribute to growth through interviewing, onboarding, or other tasks.
Requirements:
- 8 years of experience with building, operating, and maintaining sophisticated and scalable systems and with operations automation.
- Solid experience in at least one programming language. We use Java, Python, Go, Ruby, and Perl.
- Experience with Infrastructure as Code technologies.
- Knowledge of cloud computing fundamentals.
- Solid foundation in Linux administration and troubleshooting.
- Understanding of service-level agreements and objectives.
- Additional experience in OpenStack, Kubernetes, Networking, Security, or Storage is desirable.
- Supervising/observability technologies like Prometheus, Graphite, Grafana, Kibana, and Elasticsearch are a plus.
- Good interpersonal skills.
- Proficient command of the English language, both written and spoken.
- Here are some of the tools and technologies we use to achieve this: Python, Go, Puppet, Kubernetes, Elasticsearch, Prometheus, HAProxy, Cassandra, Kafka, etc.
-
Senior Site Reliability Engineer
2 weeks ago
Bangalore Urban, Karnataka, India, IN GigSky Full timeWe're Hiring: Site Reliability Engineer (5–10 Years Experience) Location: Bangalore, India | Gigsky India Private LimitedAre you passionate about building resilient, scalable, and secure infrastructure? Gigsky is looking for a seasoned Site Reliability Engineer to join our Bangalore team and help drive operational excellence across our global platform....
-
Site Reliability Engineer
3 weeks ago
Bangalore Urban, Karnataka, India, IN Trantor Full timeJob Title - Site Reliability EngineerRole- Contract (9 Months- Extendable)Exp- 5+ yearsLoc- Bangalore ( Hybrid)Notice- Immediate joiner onlyDuties:Responsible for maintaining and scaling production services and servers across multiple datacenters for complex and data-intensive cloud services Improve scalability, service reliability,capacity, and performance...
-
Site Reliability Engineer
2 weeks ago
, India, IN Sonata Software Full timeWe're Hiring: Senior Site Reliability Engineer Location: Onsite (Office: Hyderabad – Mandatory from Day 1) Employment Type: Full-time Notice Period: Immediate to 15 Days Only Experience: 8+ Years About the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact...
-
Senior Site Reliability Engineer- ELK Expert
3 weeks ago
india, IN iVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...
-
Senior Site Reliability Engineer
3 weeks ago
india, IN Sapaad Full timeWHO WE ARESapaad is a global leader in unified commerce platforms, delivering world-class software solutions for the food and beverage industry. Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering thousands of F&B businesses across 40+ countries—with many more coming onboard each day.Driven by a...
-
Site Reliability Engineer II
3 weeks ago
Bangalore Urban, Karnataka, India, IN RecRoots Full timeKey Job Responsibilities and Duties:The core premise for the SRE lies in treating operational issues as a software problem.We code our way out of problems where operations are concerned addressing availability,scalability, latency, and efficiency challenges within the vast infrastructure here.You will impact millions of people all over the globe with your...
-
Senior Performance Engineer
3 weeks ago
Bangalore Urban, Karnataka, India, IN Genesis Global Full timeJob Title: Senior Performance Test Engineer ________________________________________ Job Summary The Senior Performance Test Engineer is responsible for planning, designing, and executing performance tests to ensure that the platform and the applications meet the required performance criteria. This role involves collaborating with development teams,...
-
Senior Java Development Engineer
2 weeks ago
Bangalore Urban, Karnataka, India, IN Trellix Full timeAs a Senior Software Development Engineer, you will contribute to the design and development of Trellix’s advanced email security and threat management product suite. You will take ownership of key product areas, working on both new feature development and the maintenance/redesign of existing components. This role involves collaboration with product and...
-
Senior Data Engineer
3 weeks ago
Bangalore Urban, Karnataka, India, IN USEReady Full timeJob Title: Senior Databricks EngineerExperience Level: 5-8 YearsJob SummaryAs a Senior Databricks Engineer, you will be responsible for designing, developing, and optimizing our data architecture and pipelines on the Databricks Lakehouse Platform. You will leverage your deep expertise in Spark, Delta Lake, and cloud technologies to build scalable and...
-
Software Engineer, Site Reliability Engineering
2 weeks ago
india, IN Ecoh Full timeMinimum qualifications:Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.1 year of experience with software development in one or more programming languages during coursework/projects, research, internships, or practical experience in school, work, or Open Source projects.Strong problem-solving and analytical...