SRE Consultant
5 hours ago
Mandatory skills : AWS, Microsoft Azure, Iac, Sre, Site Reliability Engineering, Cloud Operations, software development, Golang, Ruby, Ruby Rails, automation, Cloud Infrastructure.
SRE Consultant Job Description Overview The Site Reliability Engineer (SRE) Consultant plays a critical role in enhancing the reliability and performance of the organization's software systems and services. This position is pivotal in bridging the gap between development and operations, ensuring seamless integration and deployment of applications. The SRE Consultant will utilize their expertise in software engineering and systems administration to build scalable and reliable systems. Their focus will be on improving the uptime and overall reliability of services, proactively identifying potential points of failure, and implementing effective solutions. By leveraging automation and monitoring tools, the SRE Consultant will work to optimize the existing infrastructure while cultivating a culture of operational excellence within the organization. This role is vital for driving customer satisfaction through efficient systems and reliable service delivery. Key Responsibilities
- Design and implement scalable and reliable infrastructure solutions.
- Develop and maintain tools for deployment, monitoring, and operations.
- Manage on-call operations to ensure quick recovery from system outages.
- Collaborate with development teams to ensure reliable feature deployments.
- Identify and resolve performance bottlenecks in systems and applications.
- Establish and enhance service-level objectives (SLOs) and indicators (SLIs).
- Continuously improve monitoring and alerting strategies to minimize downtime.
- Conduct root cause analysis for incidents and implement preventive measures.
- Automate manual processes to increase reliability and efficiency.
- Implement CI/CD pipelines to facilitate rapid and reliable code deployments.
- Optimize resource utilization and capacity planning to manage loads effectively.
- Perform regular system assessments to identify vulnerabilities and required improvements.
- Provide training and mentorship to junior team members on SRE best practices.
- Document operational procedures and system designs for future reference.
- Stay updated with the latest industry trends and technologies to recommend improvements.
- Bachelor's degree in Computer Science, Engineering, or a related field.
- Minimum of 5 years of experience in site reliability engineering or a related role.
- Proficiency in cloud platforms like AWS, Google Cloud, or Azure.
- Experience with container orchestration systems such as Kubernetes or Docker.
- Strong skills in scripting languages like Python, Bash, or Go.
- In-depth knowledge of networking protocols and services.
- Experience with monitoring and logging tools such as Prometheus, Grafana, or ELK stack.
- Familiarity with configuration management tools like Terraform, Ansible, or Chef.
- Strong understanding of CI/CD processes and tools like Jenkins or GitLab CI.
- Proven experience in incident management and response best practices.
- Ability to analyze system performance and perform data-driven optimizations.
- Excellent problem-solving skills and ability to work under pressure.
- Strong communication skills for collaboration across teams.
- Knowledge of Agile methodologies and DevOps principles.
- Relevant certifications (AWS Certified DevOps Engineer, Google Professional Cloud DevOps Engineer, etc.) are a plus.
-
SRE Consultant
2 weeks ago
Bengaluru, Karnataka, India Cigres Technologies Full time ₹ 10,00,000 - ₹ 25,00,000 per yearBengaluru, Karnataka, IndiaJob TypeFull TimeAbout the RoleDesign and Architect SRE element into all the existing and new apps and services along with defining several controls/processes that ensures SLAs/KPIs are met.Define SLAs/SLIs/SLOs metrics at a technical level and ensure 100% adherence.Proactively maintain services once they are live by measuring and...
-
Azure AI_ML SRE
2 weeks ago
Bengaluru, Karnataka, India NTT DATA Global Delivery Services Ltd Full time ₹ 20,00,000 - ₹ 25,00,000 per yearAzure AI_ML SRE Req ID: 338862 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Azure AI_ML SRE to join our team in Bengaluru, Karnātaka (IN-KA), India (IN). Job...
-
Linux SRE Engineer
12 hours ago
Bengaluru, Karnataka, India Central Business Solutions Full time ₹ 12,00,000 - ₹ 24,00,000 per yearThe Enterprise Computing (EC) Core Infrastructure Services organization is looking for a Site Reliability Engineering to manage the operations, reliability and services for Morgan Stanley's suite of Software Distribution product ecosystem products that are part of Artifact Curation and Distribution Control squad. This squad is responsible for providing...
-
SRE – Cloud Security and Observability
2 days ago
Bengaluru, Karnataka, India RapidCircle Advisory Full time ₹ 12,00,000 - ₹ 36,00,000 per yearMaking a difference and driving positive change is what we do every day at Rapid Circle. Our Cloud Pioneers help our clients in their digital transformation. Are you someone who goes for constant, positive change? Then this vacancy is for youAs a Cloud Pioneer at Rapid Circle, you will work with our customers on different projects. For example, making impact...
-
DevOps Consultant
4 hours ago
Bengaluru, Karnataka, India MHP – A Porsche Company Full time ₹ 20,00,000 - ₹ 25,00,000 per yearMHP India is seeking a Senior Consultant, DevOps & SRE to join our team. In this role, you will be responsible for designing and building mission-critical infrastructure on public and private cloud platforms, as well as developing the tools and processes necessary to ensure the highest levels of availability and reliability for our clients. You'll work with...
-
DevOps SRE Consultant
2 weeks ago
Bengaluru, Karnataka, India NTT DATA, Inc. Full time ₹ 15,00,000 - ₹ 25,00,000 per yearDevelop W365 monitoring, alerting and telemetry3+ years of experience in a DevOps/Site Reliability Engineering role or similarSolid experience of version control, continuous integration, deployment, and configuration management toolsExperience developing, configuring and working with Azure Services, AVD/Windows 365
-
Postgres DBA cum SRE
1 week ago
Bengaluru, Karnataka, India NTT DATA Full time ₹ 1,50,000 - ₹ 2,00,000 per yearReq ID: 336226NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Postgres DBA cum SRE to join our team in Bangalore, Karnātaka (IN-KA), India (IN). Position is for a...
-
SRE Specialist
2 days ago
Bengaluru, Karnataka, India Capco Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title: Senior Data Engineer/DeveloperNumber of Positions: 2Job Description:The Senior Data Engineer will be responsible for designing, developing, and maintaining scalable data pipelines and building out new API integrations to support continuing increases in data volume and complexity. They will collaborate with analytics and business teams to improve...
-
Manager - JAVA
15 hours ago
Bengaluru, Karnataka, India KPMG Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJOB DESCRIPTION About KPMG in India KPMG entities in India are professional services firm(s). These Indian member firms are affiliated with KPMG International Limited. KPMG was established in India in August 1993. Our professionals leverage the global network of firms, and are conversant with local laws, regulations, markets and competition. KPMG has...
-
AWS Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India AKSHAYA BUSINESS IT SOLUTIONS PRIVATE LIMITED Full time ₹ 15,00,000 - ₹ 25,00,000 per yearDescription : Role Overview : As an AWS SRE, youll leverage DevOps and SRE best practices to build, automate, and maintain scalable, reliable cloud infrastructure. Your focus will be on elevating system performance, observability, and incident response while fostering operational excellence.Key Responsibilities : - Define, monitor, and uphold...