Senior Databricks Administrator
1 day ago
Sonatype is the software supply chain security company. We provide the world's best end-to-end software supply chain security solution, combining the only proactive protection against malicious open source, the only enterprise grade SBOM management and the leading open source dependency management platform. This empowers enterprises to create and maintain secure, quality, and innovative software at scale.
As founders of Nexus Repository and stewards of Maven Central, the world's largest repository of Java open-source software, we are software pioneers and our open source expertise is unmatched. We empower innovation with an unparalleled commitment to build faster, safer software and harness AI and data intelligence to mitigate risk, maximize efficiencies, and drive powerful software development.
More than 2,000 organizations, including 70% of the Fortune 100 and 15 million software developers, rely on Sonatype to optimize their software supply chains.
Role Summary
- The Databricks Administrator will be responsible for the overall health, security, and performance of the Databricks platform. This includes managing user access, implementing and enforcing data governance policies, optimizing cluster resources, and ensuring data sensitivity policies are effectively applied across the data lakehouse. The administrator will also be crucial in identifying, reporting, and resolving discrepancies within the platform's operation and configuration.
Key Responsibilities
- User Provisioning and Management:
- Onboard and offboard users, groups, and service principals within Databricks, including integration with identity providers (IdPs) like Azure Active Directory or Okta via SCIM.
- Manage user roles and entitlements at both the account and workspace levels (Account Admins, Workspace Admins, Metastore Admins, etc.).
- Implement and maintain role-based access control (RBAC) and attribute-based access control (ABAC) to ensure appropriate data and resource access.
- Data Lake Governance (Unity Catalog focus):
- Configure and manage Unity Catalog metastores, catalogs, schemas, and tables.
- Define and enforce data access policies (e.g., table-level, column-level, row-level security) using Unity Catalog.
- Manage data lineage and auditing capabilities to track data flow and usage.
- Collaborate with data owners and stakeholders to define data quality standards and ensure data integrity.Implement data retention and lifecycle management policies.
- Aligning Data Sensitivity Policy to Enforceable Data Governance:
- Translate organizational data classification and sensitivity policies into technical controls within Databricks.
- Utilize features like data masking and encryption to protect sensitive information.
- Ensure compliance with regulatory requirements (e.g., GDPR, HIPAA, CCPA) by implementing appropriate security measures.
- Conduct regular security audits and vulnerability assessments.
- Managing Cluster and Budget Policies:
- Define and implement compute policies to control cluster creation, configuration, and resource usage, ensuring cost optimization.
- Monitor and manage serverless budget policies to attribute usage to specific teams or projects.
- Optimize cluster configurations for performance and cost-effectiveness, leveraging features like auto-scaling and auto-termination.
- Manage cluster pools to reduce startup times and improve resource allocation.
- Reporting and Addressing Discrepancies:
- Monitor Databricks platform health, performance, and resource utilization.
- Identify and troubleshoot issues related to user access, data availability, cluster performance, and policy violations.
- Generate reports on platform usage, costs, security incidents, and compliance.
- Investigate and resolve discrepancies in data, reports, or system behavior in collaboration with data engineers, data scientists, and other teams.
- Develop and maintain comprehensive documentation of configurations, procedures, and best practices.
- Collaboration and Support:
- Provide technical support and guidance to Databricks users, data engineers, and data scientists.
- Collaborate with cloud infrastructure teams (AWS, Azure, GCP) to manage underlying cloud resources.
- Stay up-to-date with the latest Databricks features, best practices, and industry trends.
Technical Skills
- Databricks Platform Expertise:
- Deep understanding of Databricks architecture, workspaces, and key components (Unity Catalog, Delta Lake, Spark, SQL Analytics).Proficiency in Databricks administration console and APIs.
- Experience with Databricks Workflows, Jobs, and Delta Live Tables (DLT) for orchestration and pipeline management.
- Cloud Platform Knowledge:
- Strong experience with AWS and its relevant services.
- Data Governance & Security:
- Solid understanding of data governance principles, data classification, and data lifecycle management.
- Experience implementing security controls, access policies (RBAC), and encryption.
- Familiarity with compliance standards (GDPR, HIPAA, CCPA) and auditing practices.
- Programming & Scripting:
- Proficiency in SQL for data querying and access control.
- Deep expertise in Terraform is essential, extending beyond basic knowledge to managing complex, multi-project infrastructure. This includes hands-on experience with custom Terraform modules crucial for Data Mesh orchestration.
- Scripting skills (e.g., Python, Terraform) for automation and administrative tasks.
- Familiarity with Spark and PySpark concepts for troubleshooting and optimization.
- Identity and Access Management (IAM):
- Experience with enterprise identity providers (e.g., Azure AD, Okta, Active Directory) and SCIM provisioning.
- Networking Concepts:
- Understanding of network security, VPNs, VPCS, private links, VPC peering, and connectivity within cloud environments.
- Monitoring & Logging Tools:
- Experience with monitoring tools (e.g., Datadog, Observe, cloud-native monitoring) for platform health and performance.
Soft Skills
- Problem-Solving and Troubleshooting: Ability to diagnose and resolve complex technical issues efficiently.
- Communication: Excellent verbal and written communication skills to interact with technical and non-technical stakeholders.
- Attention to Detail: Meticulous in configuring policies, managing access, and ensuring data integrity.
- Proactive and Self-Driven: Ability to anticipate issues, recommend solutions, and continuously improve the platform.
- Collaboration: Work effectively with cross-functional teams (data engineers, data scientists, security teams).
- Analytical Thinking: Ability to analyze data and system logs to identify trends and discrepancies.
At Sonatype, we value diversity and inclusivity. We offer perks such as parental leave, diversity and inclusion working groups, and flexible working practices to allow our employees to show up as their whole selves. We are an equal-opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. If you have a disability or special need that requires accommodation, please do not hesitate to let us know.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
-
Senior Databricks Administrator
1 week ago
Hyderabad, Telangana, India Sonatype Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescriptionKey Responsibilities :User Provisioning And ManagementOnboard and offboard users, groups, and service principals within Databricks, including integration with identity providers (IdPs) like Azure Active Directory or Okta via SCIM.Manage user roles and entitlements at both the account and workspace levels (Account Admins, Workspace Admins,...
-
Senior Databricks Engineer
1 week ago
Hyderabad, Telangana, India SID Information Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per yearRole: Senior Data Engineer (Python, Spark/Databricks, SQL, AWS)Experience:6–12 yearsLocation:HyderabadWork Mode:Hybrid (3 days/week in-office)Join Time:ImmediateMust-Have Technical Skills:Strong programming skills in Python, DatabricksHands-on experience with Apache Spark & Databricks for big data processing on AWS cloudProficiency with AWS services ...
-
Hyderabad, Telangana, India EverestDX Inc Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a Cloud Databricks Administrator to join our Cloud Engineering team.In this role, you will monitor and maintain Databricks jobs, troubleshoot issues, ensure platform reliability, and manage access controls.You will play a key part in operational support, defect management, and proactive issue resolution to keep our data pipelines running...
-
Databricks with Python
1 week ago
Hyderabad, Telangana, India Tata Consultancy Services Full time ₹ 15,00,000 - ₹ 25,00,000 per yearRole - Databricks with PythonExp - 5 to 9 YrsLocation - HyderabadNote - This is a scheduled drive. Only eligible candidates who receive invite from us will be allowed inside Campus.Job DescriptionExtensive expertise in designing and implementing data load processes using Azure Data Factory, Azure Databricks, Delta Lake, Azure Delta Lake Storage and...
-
Senior Azure Databricks Engineer
1 week ago
Hyderabad, Telangana, India Innova AM Tech Full time ₹ 12,00,000 - ₹ 24,00,000 per yearAbout the Role We are seeking a highly skilled and experienced Senior Azure Databricks Engineer to join our dynamic data engineering team. As a Senior Azure Databricks Engineer, you will play a critical role in designing, developing, and implementing data solutions on the Azure Databricks platform. You will be responsible for building and maintaining...
-
aws databricks with devops
5 days ago
Hyderabad, Telangana, India Primus Global Technologies Full time ₹ 15,00,000 - ₹ 25,00,000 per yearType of RoleClient-facing roleMust to Have SkillSenior: 6+ YOE overall in DevOps2+ hands on databricksDevOps LeadDriving automationCI/CD pipelines optimising deploymentsInfra + code deploymentsCollab with data engineers across teamsAutomation process and DevOps solutionsGitHub and Databricks DevOps, AWSAzure DevOps/Integrated with GitHubControl
-
Alteryx Server Administrator
6 days ago
Hyderabad, Telangana, India JSS Pro Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWe are hiring on behalf of our client in theITsector.Position: Senior Data Platform Administrator (Alteryx)Experience: 8+ yearsLocation: Hyderabad, IndiaEmail your CVs: Job Summary:We are seeking a highly skilled and certified Senior Data Platform Administratorwith extensive experience in administeringAlteryx Server,Power BI, andDatabricksplatforms. The...
-
Senior Databricks Engineer
1 week ago
Hyderabad, Telangana, India Xenon7 Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout us:Where elite tech talent meets world-class opportunitiesAt Xenon7, we work with leading enterprises and innovative startups on exciting, cutting-edge projects that leverage the latest technologies across various domains of IT including Data, Web, Infrastructure, AI, and many others. Our expertise in IT solutions development and on-demand resources...
-
Databricks MLOps Engineer
1 week ago
Hyderabad, Telangana, India Xenon7 Full time ₹ 15,00,000 - ₹ 30,00,000 per yearAbout us:Where elite tech talent meets world-class opportunitiesAt Xenon7, we work with leading enterprises and innovative startups on exciting, cutting-edge projects that leverage the latest technologies across various domains of IT including Data, Web, Infrastructure, AI, and many others. Our expertise in IT solutions development and on-demand resources...
-
Senior Administrator
1 day ago
Hyderabad, Telangana, India Primera Medical Technologies Full time ₹ 20,00,000 - ₹ 25,00,000 per yearWe are hiring Senior Administrator - EDM with 7+ years of experience.Role & responsibilitiesA solid understanding of healthcare and patient flow is essential to align with application and end-user workflows.Manage OnBase system administration responsibilities, which encompass installation, configuration, monitoring, maintenance support, and enhancements of...