
IT Manager
3 days ago
Job Description
Overview:
- Blue Yonder is the proven leader in artificial intelligence and machine learning (AI/ML)-driven supply chain and retail solutions for 4,000 of the world's leading retail, manufacturing, and logistics companies. Blue Yonder's world-class client brands include 75 of the top 100 retailers, 77 of the top 100 consumer goods companies, and 8 of the top 10 global 3PLs. Running Blue Yonder, you can plan to deliver.
- The Observability and Automation Manager will be responsible in building and managing enterprise-grade monitoring, observability, and automation frameworks.
- This role will be responsible for defining strategy, implementing tools, and driving adoption of observability and automation practices across infrastructure, applications, and business services. This role requires strong technical expertise in monitoring platforms, AIOps, automation frameworks, and a proven ability to collaborate across engineering, operations, and business teams.
Scope:
- Collaborate with Product owners, Engineering, and internal IT teams to achieve business objectives.
- Define and drive the observability and automation strategy aligned with business and IT objectives.
- Lead a team of engineers and specialists responsible for monitoring, observability, and automation initiatives.
- Partner with application, infrastructure, and DevOps teams to ensure observability and automation standards are adopted enterprise wide.
- Own the design, implementation, and operations of enterprise observability platforms (APM, log analytics, metrics, tracing, synthetic monitoring).
- Ensure end-to-end visibility of applications, infrastructure, and cloud environments to proactively detect and resolve issues.
- Define and implement IT automation and orchestration strategies across infrastructure and operations.
- Build and maintain automation frameworks (provisioning, remediation, workflows, runbooks, self-healing systems).
- Partner with ITSM and DevOps teams to automate incident, problem, and change management processes.
- Continuously identify opportunities to reduce manual effort, improve efficiency, and enhance service reliability.
- Work closely with business stakeholders to ensure observability and automation meet compliance and security standards.
- Develop governance models and best practices for monitoring, alerting, and automation usage.
- Provide executive-level reporting on system reliability, performance trends, and automation outcomes.
- Manage the deploy and maintenance of Windows, Unix, Linux, VMware systems infrastructure in OnPrem and MS Azure.
- Establish KPIs, SLIs, SLOs, and dashboards to measure and report system reliability and performance.
- Works with senior leadership and Architecture and Engineering team in the planning, development, and execution of short term and long-term goals.
- Developing processes to streamline and drive team to automate routine tasks.
- Assisting in writing technical documentation and Work Instructions.
- Stay current with emerging technologies and trends in the IT industry and recommend innovative solutions to improve operational efficiency and effectiveness.
Our current technical environment:
- Operating System: Windows & Linux
- Hyper converged Environment: VMWare
- Programming languages: Python, PowerShell, and Shell scripting
- Cloud Architecture: MS Azure (Terraform, ARM templates, AKS, Virtual Networks, Azure AD)
- Configuration management tools: Ansible and Terraform
- DevOps Tools: GIT, GitLab/GitHub and Docker
- Storage : NetApp
What you'll do:
- Collaborate with Product owners, Engineering, and internal IT teams to achieve business objectives.
- Define and drive the observability and automation strategy aligned with business and IT objectives.
- Lead a team of engineers and specialists responsible for monitoring, observability, and automation initiatives.
- Partner with application, infrastructure, and DevOps teams to ensure observability and automation standards are adopted enterprise wide.
- Own the design, implementation, and operations of enterprise observability platforms (APM, log analytics, metrics, tracing, synthetic monitoring).
- Ensure end-to-end visibility of applications, infrastructure, and cloud environments to proactively detect and resolve issues.
- Define and implement IT automation and orchestration strategies across infrastructure and operations.
- Build and maintain automation frameworks (provisioning, remediation, workflows, runbooks, self-healing systems).
- Partner with ITSM and DevOps teams to automate incident, problem, and change management processes.
- Continuously identify opportunities to reduce manual effort, improve efficiency, and enhance service reliability.
- Work closely with business stakeholders to ensure observability and automation meet compliance and security standards.
- Develop governance models and best practices for monitoring, alerting, and automation usage.
- Provide executive-level reporting on system reliability, performance trends, and automation outcomes.
- Manage the deploy and maintenance of Windows, Unix, Linux, VMware systems infrastructure in OnPrem and MS Azure.
- Establish KPIs, SLIs, SLOs, and dashboards to measure and report system reliability and performance.
- Works with senior leadership and Architecture and Engineering team in the planning, development, and execution of short term and long-term goals.
- Developing processes to streamline and drive team to automate routine tasks.
What we are looking for:
- Bachelor's degree in computer science, MIS or engineering related field or equivalent work experience.
- 10+ years of combined related work experience and minimum of 5 years of experience observability, monitoring, or automation leadership roles.
- Strong hands-on knowledge of observability tools such as Datadog, Dynatrace, AppDynamics, Splunk, Elastic, Prometheus, Grafana.
- Expertise in automation tools/frameworks: Ansible, Terraform, Puppet, ServiceNow Orchestration, scripting (Python, PowerShell, Shell).
- Experience with cloud platforms preferably Azure and hybrid environments.
- Strong understanding of DevOps, SRE practices, CI/CD, and ITIL processes.
- Excellent leadership, communication, and global stakeholder management skills.
- Knowledge of Unix/Linux or Windows operating systems, VMware, Network, Backup and Storage experience with supporting and troubleshooting stability and performance issues.
- Demonstrated problem-solving and decision-making capabilities to meet the organizations developing needs and growth.
- Demonstrate agility and responsiveness.
- Ability to work in a fast-paced environment and meet tight periodic reporting deadlines.
- Ability to work under strict deadlines to meet or exceed team goals.
- Experience in Microsoft technologies - MS Azure, Azure AD & O365.
- Experience working with virtual and remote team members and stakeholders.
- Knowledge of Information Security regulations and compliance standards.
- Basic knowledge on Network switching, routing, firewalls and MPLS circuits
- Strong focus on people development.
- Strong technical experience with IT Infrastructure, systems administration, Platform Sizing, capacity planning and Infrastructure Cost Reduction
- Good Knowledge of ticketing tools like ServiceNow.
- Relevant certifications such as PMP, Six Sigma, and ITIL are preferred.
- Knowledge in Cloud Technologies - Private, Public, Hybrid, IaaS+, PaaS, SaaS
- Basic Knowledge of Palo Alto SDWAN &Prisma
Our Values
If you want to know the heart of a company, take a look at their values. Ours unite us. They are what drive our success - and the success of our customers. Does your heart beat like ours Find out here:
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.
-
[Urgent] Account Manager
3 days ago
Bengaluru, India IT Company Full timeJob Description Company Description We suggest you enter details here. Role Description This is a full-time on-site role for an Account Manager, based in Bengaluru. The Account Manager will handle ongoing relationships with clients, ensuring their needs are met and surpassing their expectations. Daily tasks include managing client accounts, strategizing...
-
Project Manager
1 week ago
Hyderabad, Telangana, India Sage It Full time ₹ 1,04,000 - ₹ 1,30,878 per yearObjectives of this role• Build and develop the project team to ensure maximum performance, by providing purpose, direction, and motivation• Lead projects from requirements definition through deployment, identifying schedules, scopes, budget estimations, and implementation plans, including risk mitigation• Coordinate internal and external resources to...
-
Project Manager
15 hours ago
Hyderabad, India Sage It Full timeObjectives of this role • Build and develop the project team to ensure maximum performance, by providing purpose, direction, and motivation • Lead projects from requirements definition through deployment, identifying schedules, scopes, budget estimations, and implementation plans, including risk mitigation • Coordinate internal and external resources...
-
Facility Manager
2 weeks ago
India Inspiroz IT service pvt Full time ₹ 1,00,000 per yearDescription The Facilities Manager will oversee the operations and maintenance of our facilities. The ideal candidate will play a critical role in ensuring that our facilities are safe, functional, and conducive to our work environment. He will be responsible for managing the day-to-day operations, coordinating maintenance requests, and ensuring compliance...
-
Service Delivery Manager
6 days ago
Hyderabad, Telangana, India Sage It Full time ₹ 15,00,000 - ₹ 28,00,000 per yearRole & responsibilitiesWe are looking for a high-calibre, experienced Service Delivery Manager (SDM) to lead service operations and strategic delivery for the project. This role demands a proactive leader with a deep understanding of Network and Security Operations, strong stakeholder management abilities, and a proven history of end-to-end IT service...
-
Service Delivery Manager
16 hours ago
Hyderabad, India Sage It Full timeRole & responsibilities We are looking for a high-calibre, experienced Service Delivery Manager (SDM) to lead service operations and strategic delivery for the project. This role demands a proactive leader with a deep understanding of Network and Security Operations, strong stakeholder management abilities, and a proven history of end-to-end IT service...
-
IT Manager
5 days ago
Hyderabad, Telangana, India YO IT CONSULTING Full timeJob Title : IT ManagerExperience : 8 to 12 years Location : Hyderabad ( WHO 5 days working ) Job Description: Key Responsibilities :IT Operations & Team Leadership :- Lead and manage the local IT team across helpdesk, infrastructure, and systems functions.- Ensure 24/7 availability and performance of IT systems, including hardware, software, and networking.-...
-
India Flexing It® Full time ₹ 9,00,000 - ₹ 12,00,000 per yearOur client, a leading global FMCG organisation, is looking to engage with a Consultant: Project Manager for Global Health & Well-Being for 12 12-month remote project. The Project Manager will be responsible for providing professional management to complex projects aimed at improving the reliability of on-time delivery. This role involves the best utilization...
-
Senior Automation Engineer
2 weeks ago
India IT Full time ₹ 9,00,000 - ₹ 12,00,000 per yearSenior Automation engineerExperience : 5 to 6 yrsLocation : Anywhere in IndiaJob Description :We are looking for a Senior Automation Engineer (5-6 yrs) with expertise in Python scripting, GitLab CI/CD, and Ansible. You will design and maintain CI/CD pipelines, automate infrastructure, and enhance deployment processes. Key Responsibilities :Develop and...
-
Linux Administrator
2 weeks ago
India IT Full time ₹ 9,00,000 - ₹ 12,00,000 per yearExperience : 2 to 4 years. Key Responsibilities :Administer and maintain RedHat Linux servers, ensuring high availability and performance. Install, configure, and troubleshoot RHEL-based systems and applications. Monitor system performance, security, and logs for efficient operation. Perform updates, patches, and security hardening as per best practices. ...