Site Reliability Engineer
7 days ago
At Urbint, our mission is to make communities more resilient. We do this by pairing external data with artificial intelligence to identify areas of high risk and prevent catastrophic loss for utilities across the country. We are a team of close-knit engineers, entrepreneurs, and data geeks who obsess over problem-solving, new technologies and making a positive impact in our communities.
Job Summary
We are seeking a Site Reliability Engineer to take charge of our servers, deployments and overall systems.
You will have a passion for the practical side of managing large, complex systems and services and planning for maximum uptime leveraging modern tools. Urbint has a mix of self-hosted services deployed within Google Cloud with most managed through Google Container Engine (Kubernetes) and a need to support on-premise deployments to address specific security postures of some clients.
What You'll Do
- Design High-Availability Systems - ensure that all of the systems that we deploy and depend on are configured to maintain full uptime. Planning out deployment strategies to ensure that uptime is maintained during upgrades and maintenance. Designing and building out an infrastructure-as-code project.
- Guiding Development Team with Best Practices - working with the Development team to ensure that the software being built will be practical to deploy and maintain.
- Maintaining System and Network Security - patch management, ensuring that dependencies are kept up to date. Staying informed about zero-day vulnerabilities and any risks that cannot be immediately patched and coming up with alternative methods to mitigate their risk.
- Logging, Metrics and Alerting - managing and organizing an on-call schedule through Pagerduty, connected to metrics and log events. On-call responsibilities will be shared.
- Build Engineering - managing build/deployment pipelines and ensuring best practices are followed in this.
Who You Are
- 2+ years of experience designing and maintaining application systems
- A friendly person first and a technologist second
A deep understanding of operating systems and computer architecture experience with:
Linux - at least 2 years
- GCP or AWS experience - at least 2 years
- Terraform - at least 2 years
- Kubernetes experience - at least 1 year
- Docker - at least 1 year
- Monitoring systems (Graphite/prometheus/grafana/statsd/DataDog…)
Strong shell scripting ability
Solid programming abilities - to help build any glue components between service
Ideally professional Python dev experience
Excellent communication and organizational skills a must
Benefits
- Mission Driven - Some companies use AI to serve better digital ads and trade stocks, we seek to make our communities safer and more resilient
- Competitive compensation package
- Generous Paid Time off, Paid Company Holidays including Mental Health Days
- Medical Insurance covering self, spouse, 2 children and parents/in-laws
- Hybrid work - Monday, Tuesday and Wednesday at office; Thursday and Friday at home
We're an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.
-
Specialist - Site Reliability Engineer
1 day ago
Pune, Maharashtra, India Accelya Group Full time ₹ 20,00,000 - ₹ 25,00,000 per yearFor more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...
-
Specialist - Site Reliability Engineer
1 day ago
Pune, Maharashtra, India Accelya Group Full time ₹ 15,00,000 - ₹ 25,00,000 per yearFor more than 40 years, Accelya has been the industry's partner for change, simplifying airline financial and commercial processes and empowering the air transport community to take better control of the future. Whether partnering with IATA on industry-wide initiatives or enabling digital transformation to simplify airline processes, Accelya drives the...
-
Site Reliability Engineer
2 weeks ago
Pune, Maharashtra, India ENGEL Full time ₹ 6,00,000 - ₹ 18,00,000 per yearCompany DescriptionENGEL is a global leader in the production of injection moulding machines and their automation. The company produces systems that manufacture plastic parts used in various industries such as automotive, packaging, and consumer goods. With nine production plants worldwide and subsidiaries and representatives in over 85 countries, ENGEL...
-
Site Reliability Engineer
1 day ago
Pune, Maharashtra, India Idox Full time ₹ 9,00,000 - ₹ 12,00,000 per yearSite Reliability Engineer (AWS)Pune, IndiaAbout the roleWe are seeking a driven and detail-oriented Site Reliability Engineer (SRE) with a strong passion for building resilient, scalable cloud infrastructure. This role offers an exciting opportunity for professionals with 2 to 4 years of experience in DevOps, Cloud, or Infrastructure to deepen their...
-
Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Reveille Technologies Full timeJob Summary :We are seeking a skilled and proactive Site Reliability Engineer (SRE) with a strong DevOps mindset and hands-on experience in application troubleshooting. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our applications and infrastructure. This role requires a blend of software engineering,...
-
Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India Allianz Full timeSite Reliability Engineer (SRE) - One Identity Access ManagementThe primary objective of the Site Reliability Engineer (SRE) specializing in One Identity Access Management is to ensure the seamless operation, reliability, and scalability of IAM systems within the organization.This role is critical in maintaining system integrity, optimizing performance, and...
-
Site Reliability Engineer
3 weeks ago
Pune, Maharashtra, India Uplers Full timeJob DescriptionMust have skills required :Azure DevOps, SRE concepts, TerraData, CDC, CDC tool, NEWRELGood to have skills :Aws cloudwatchReflections Info Systems (One of Uplers Clients) is Looking for:Site Reliability Engineer who is passionate about their work, eager to learn and grow, and who is committed to delivering exceptional results. If you are a...
-
Site Reliability Engineering
2 days ago
Pune, Maharashtra, India Deutsche Bank Full time ₹ 10,00,000 - ₹ 25,00,000 per yearSite Reliability Engineering (SRE) Lead, VPJob ID: R0402474Full/Part-Time: Full-timeRegular/Temporary: RegularListed: Location: PunePosition OverviewJob Title: Site Reliability Engineering (SRE) LeadCorporate Title: Vice PresidentLocation: Pune, IndiaRole DescriptionWe are seeking an experienced and highly capable Site Reliability Engineering (SRE) Lead to...
-
Site Reliability Engineer
3 weeks ago
Pune, Maharashtra, India LanceSoft, Inc Full timeRole and Responsibilities : Reporting to Engineering, the Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions, Payments, and Capital Markets business. In this role, the candidate will have the opportunity to make a lasting impact on the company's transformation journey, drive customer-centric...
-
- Site Reliability Engineer
4 weeks ago
Pune, Maharashtra, India ZOOP Full timeRole : Site Reliability Engineer. Location : Pune (on-site). Experience : 3+ years. Someone who has experience setting up an in-house monitoring platform with 99.99% uptime SLA using Victoria Metrics & Prometheus in Multi Region. Site Reliability Engineer Zoop. The Opportunity : We're seeking a Senior Site Reliability Engineer to elevate and standardize our...