![PubMatic](https://contents.bebee.com/companies/in/pubmatic/avatar-A1BPA.png)
Site Reliability Engineer
1 week ago
PubMatic (Nasdaq:
PUBM) is an independent technology company maximizing customer value by delivering digital advertising's supply chain of the future.
PubMatic's sell-side platform empowers the world's leading digital content creators across the open internet to control access to their inventory and increase monetization by enabling marketers to drive return on investment and reach addressable audiences across ad formats and devices.
Since 2006, our infrastructure-driven approach has allowed for the efficient processing and utilization of data in real time.By delivering scalable and flexible programmatic innovation, we improve outcomes for our customers while championing a vibrant and transparent digital advertising supply chain.
As an SRE Engineer, you will be responsible for the Activate and Production Infrastructure. Your essential duties encompass ensuring the seamless operation and optimal performance of large-scale distributed software applications.Your role revolves around maintaining a robust and high-performing environment, contributing to the reliability of our services, and innovating solutions to guarantee 24/7 availability.
By leveraging your technical expertise and dedication, you contribute to maintaining a seamless experience for our users while upholding the highest standards of operational excellence.
Your specific responsibilities include:
Role and Responsibilities:
Monitoring and Alerting
Review existing and set up new monitoring tools and systems as needed to track system performance, key metrics.
Incident Management
Monitor the alerts and logs to promptly identify incidents or anomalies. Prioritize incidents based on severity and potential impact on stability and reliability. Engage in effective incident resolution, applying necessary fixes and mitigations to restore normal operations.
On-Call Responsibilities
Organize on-call schedules to ensure 24/7 coverage for incident response. Respond to alerts, troubleshoot issues, and coordinate with NOC and Engineering teams for incident resolution. Conduct post-incident reviews to identify root causes, learn from incidents, and implement preventive measures.
Automation and Tooling
Review pre-existing and build new automation scripts and tools as needed to streamline repetitive tasks, enhance efficiency, and reduce manual errors.
Performance Optimization
Analyze application performance using profiling and monitoring tools to identify bottlenecks and areas for improvement. Work on optimizations, infrastructure upgrades, and architectural improvements to enhance system performance and efficiency.
Capacity Planning and Scaling
Monitor resource utilization and trends to predict capacity needs and plan for scaling.
Scale resources, such as servers and databases, are based on usage patterns and anticipated growth to maintain performance and reliability.
Also, automate the entire sizing process.Disaster Recovery and Redundancy
Develop and maintain disaster recovery plans and procedures to ensure business continuity in case of failures or disasters. Implement redundancy and failover strategies to minimize downtime and maintain service availability during failures.
Knowledge Sharing and Documentation
Create and maintain comprehensive documentation for configurations, procedures, incidents, and best practices. Foster a culture of knowledge sharing within the team, conducting regular knowledge-sharing sessions and training programs.
Feedback Loop and Continuous Improvement
Collect feedback from incidents, post-mortems, and NOC/Dev team interactions to identify areas for improvement. Continuously iterate on processes, tools, and systems based on feedback and lessons learned to drive continuous improvement.
Collaboration and Communication
Collaborate closely with Engineering and DC/NOC teams to align goals and priorities. Ensure open and transparent communication within the team and with stakeholders, providing regular updates on incidents, progress, and initiatives.
Required Skills and QualificationsBachelor's degree in computer science or related disciplinesTotal 3+ years' experience in software application/product supportAbility to program using programming languages like Go, Scripting languages like Shell or PythonGood to have prior experience in technical engineeringA proactive approach to identify the problems, performance bottlenecks, and areas of improvementMust know, Networking, Database (MySQL) and Linux System concepts, Debugging and analyzing the core dumpsHands-on experience with monitoring and observability tools like Grafana, Nagios, Influx, ELK, etc.
Familiarity with orchestration tools like Docker and Grafana and incident management systems like ZendutyExcellent communication and collaboration skills, with the ability to work effectively across teams.
Self-motivated and positive mindset to examine any incidents#LI-DNIReturn to Office:
PubMatic employees throughout the global have returned to our offices via a hybrid work schedule (3 days "in office" and 2 days "working remotely") that is intended to maximize collaboration, innovation, and productivity among teams and across functions.
Benefits:
Our benefits package includes the best of what leading organizations provide, such as stock options, paternity/maternity leave, healthcare insurance, broadband reimbursement.
As well, when we're back in the office, we all benefit from a kitchen loaded with healthy snacks and drinks and catered lunches and much moreDiversity and Inclusion:
PubMatic is proud to be an equal opportunity employer; we don't just value diversity, we promote and celebrate it.
We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Ensono Full timeAbout Us (Ensono) Ensono is an expert technology adviser and managed service provider. As a relentless ally, we accelerate clients' digital transformation to achieve business outcomes that stand to last. Our dedicated team helps organizations optimize today's systems across any hybrid environment with services such as consulting, mainframe and...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Creospan Private Limited Full timeCreospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. "Tomorrow's ideas, built today" In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Creospan Private Limited Full timeCreospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. "Tomorrow's ideas, built today" In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Whizz HR Full timeJob Description :As a Site Reliability Engineer (SRE), you will have a key role in ensuring our systems and services run smoothly and efficiently. You will work with various teams to design, build, and maintain robust infrastructure and applications, contributing to top-notch services and exceptional user experiences.Key Focus Areas : System Architecture :...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India HCLSoftware Full timeThe Role:HCL BigFix is looking for a Site Reliability Engineer to work on infrastructure for a newproduct that will help keep our customers' end points secure. You will be a part of a teamthat leverages modern technological solutions to drive growth and efficiency. Your dailyresponsibilities will be centered on HCL BigFix's cloud infrastructure, with daily...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Thinkproject Full timeWant to work in a workplace built on mutual trust and respect? How about having the flexibility to balance work with your life? A career with Thinkproject could be the perfect fit for you. What is our Focus? Thinkproject is a top player in digital tools for construction firms in Europe. In the past, construction companies relied on manual paperwork for their...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Arista Networks Full timeSite Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery.They develop software solutions to enhance, harden and support our service delivery processes.This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India HCLSoftware Full timeThe Role:HCL BigFixis looking for aSite Reliability Engineerto work on infrastructure for a new product that will help keep our customers' end points secure. You will be a part of a team that leverages modern technological solutions to drive growth and efficiency. Your daily responsibilities will be centered on HCL BigFix's cloud infrastructure, with daily...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India HCLSoftware Full timeThe Role:HCL Big Fix is looking for a Site Reliability Engineer to work on infrastructure for a new product that will help keep our customers' end points secure.You will be a part of a team that leverages modern technological solutions to drive growth and efficiency.Your daily responsibilities will be centered on HCL Big Fix's cloud infrastructure, with...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Arista Networks Full timeSite Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery.They develop software solutions to enhance, harden and support our service delivery processes.This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India GfK Full timeDescription About You You are a DevOps or Site Reliability Engineer with a passion for cloud infrastructure and automation. You're a self-starter and you love keeping up to date with the latest developments in cloud, configuration management and container technologies. You understand the benefits of an immutable infrastructure and you enjoy enabling...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Global Payments Full timeEvery day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant services. Our worldwide team helps over 3 million companies, more than 1,300 financial institutions and over 600 million cardholders grow with confidence and achieve amazing...
-
Site reliability engineer
1 week ago
Pune, Maharashtra, India Roche Full timeThe Position KEY ROLES & RESPONSIBILITIES (required): Responsibilities: Design, implement, and maintain site reliability engineering (SRE) practices that ensure the reliability and performance of our production systems. Design and implement SRE practices that align with the company's overall reliability and performance goals. Develop and...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Arista Networks Full timeSite Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Zensar Technologies Full timeSite Reliability Engineer (SRE) will focus on Scalability, High Availability, Performance, Stability and Reliability of Software Applications. SRE will build automations to simplify operations and processes, collaborate with cross-functional teams to create proactive engineering mechanisms and ensure positive end user experiences. SRE with a good...
-
Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India FIS Global Full timePosition Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Associate's Degree Travel Percentage : 0%Site Reliability Engineer (SRE)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in financial services and...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Jobs for Humanity Full timeJob Description Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Associate's Degree Travel Percentage : 0%Site Reliability Engineer (SRE) Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India FIS Full timePosition Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Associate's Degree Travel Percentage : 0% Site Reliability Engineer (SRE) Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in financial...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India TechVerito Full timeAs a Site Reliability Engineer, you will be involved in exciting technical challenges by analyzing, troubleshooting, and designing vital services, platforms, and infrastructure while always thinking about reliability, scalability, resilience, security, and performance.Responsibilities:Owning Infra architecture and non-functional requirements, ensuring they...
-
Site Reliability Engineer
1 week ago
Pune, Maharashtra, India Etraveli Group Full timeEtraveli is one of the leading global flight centric Online Travel Agencies (OTAs) with €4bn+in annual gross sales. We also operate , the #1 meta searcher in Sweden and Tripstack, the independent B2B arm of the group offering a variety of complex technology solutions. Our diverse, dynamically growing team of 1000+ talented professionals is always on the...