
Lead Cloud Data Engineer
3 days ago
We are seeking a highly skilled Python Platform Engineer to lead our architecture and performance efforts. As a key member of our team, you will be responsible for designing and building a shared component library/SDK for pipelines, as well as defining patterns/templates for Apache Beam pipelines and Databricks jobs.
- Key Responsibilities:
- Architecture and Reuse:
- Design a shared component library/SDK for ingestion, parsing/OCR, extraction, validation, enrichment, and publishing.
- Define patterns/templates for Apache Beam pipelines and Databricks jobs, standardizing configuration, packaging, versioning, CI/CD, and documentation.
- Create pluggable interfaces for multiple teams to swap extractors (Regex/LLM), OCR providers, and EMR publishers without code rewrites.
- Develop a repository strategy with shared/child repositories for each use case.
- Performance and Reliability:
- Profiling and tuning: cProfile/py-spy/line_profiler, memory (tracemalloc), CPU vs I/O analysis.
- Instrument services with Elastic APM and correlate traces/metrics with Splunk logs; build dashboards and runbooks.
- Implement concurrency best practices: asyncio for I/O-bound, ThreadPool/ProcessPool for CPU-bound, batching, rate limiting, retries, etc.
- Implement robust LLM API rate limiting/governance: enforce provider TPM and concurrency caps, request queueing/token budgeting, and emit APM/Splunk metrics (throttle rate, queue depth, cost per job) with alerts.
- Establish Service Level Objectives (SLOs)/alerts for throughput, latency, error rates; set up Dead-Letter Queues (DLQs) and recovery patterns.
- Team Enablement:
- Mentor developers, lead design reviews, codify best practices, write clear documentation and examples.
- Partner with ML engineers on the future LLM/SLM path (evaluation harness, safety/PII, cost/perf).
- Architecture and Reuse:
Requirements
- 7+ years Python experience with strong depth in performance and concurrency (asyncio, concurrent.futures, multiprocessing), profiling and memory tuning.
- Observability expertise: Elastic APM instrumentation and dashboarding; Splunk for logs and correlation; OpenTelemetry familiarity.
- Mandatory experience with Large Language Model (LLM) based solutions and supporting them in production.
- API engineering for high-throughput integrations (REST, OAuth2), resilience patterns, and secure handling of sensitive data.
- Strong architecture/design skills: clean interfaces, packaging shared libraries, versioning, CI/CD (GitHub Actions/Azure DevOps), testing.
- 3+ years building large-scale data pipelines with Apache Beam and/or Spark, including hands-on Databricks experience (Jobs, Delta Lake, cluster tuning).
- Document processing: OCR (Tesseract, AWS Textract, Azure Form Recognizer), PDF parsing, text normalization.
- LLM/SLM integration experience (e.g., OpenAI/Azure AI, local SLMs), prompt/eval frameworks, PII redaction/guardrails.
- Cloud and tooling: AWS/Azure/GCP, Dataflow/Flink, Terraform, Docker; cost/performance tuning on Databricks.
- Security/compliance mindset (HIPAA), secrets management, least-privilege access.
What We Offer
- Competitive salary and benefits package
- Culture focused on talent development with quarterly promotion cycles and company-sponsored higher education and certifications
- Opportunity to work with cutting-edge technologies
- Employee engagement initiatives such as project parties, flexible work hours, and Long Service awards
- Annual health check-ups
- Insurance coverage: group term life, personal accident, and Mediclaim hospitalization for self, spouse, two children, and parents
A Message from Our Organization
We are committed to fostering diversity and inclusion in the workplace. We invite applications from all qualified individuals, including those with disabilities, and regardless of gender or gender preference. We welcome diverse candidates from all backgrounds.
- We support hybrid work and flexible hours to fit diverse lifestyles.
- Our office is accessibility-friendly, with ergonomic setups and assistive technologies to support employees with physical disabilities.
- If you are a person with disabilities and have specific requirements, please inform us during the application process or at any time during your employment.
-
Chief Cloud Data Engineer
3 days ago
Surat, Gujarat, India beBeeCloudMigration Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Senior Cloud ArchitectWe are seeking a highly experienced Senior Cloud Architect to spearhead large-scale cloud migration projects.The ideal candidate will possess a minimum of 10 years of experience in data engineering and ETL, with a strong background in migrating on-premise data systems to the cloud.Key Responsibilities:Design, develop, and deploy robust...
-
Cloud Native Data Engineer
23 hours ago
Surat, Gujarat, India beBeeData Full time ₹ 18,00,000 - ₹ 24,00,000Senior Data Solutions ArchitectWe are seeking a highly skilled Senior Data Solutions Architect to join our team.The successful candidate will be responsible for leading the design and implementation of our data warehouse solution, ensuring that it meets the business requirements and is scalable, secure, and efficient.A strong understanding of cloud-native...
-
Senior Data Engineer
3 days ago
Surat, Gujarat, India beBeeData Full time ₹ 18,00,000 - ₹ 20,00,000About the RoleWe are seeking a seasoned and proficient Senior Python Data Engineer with substantial experience in cloud technologies.As a pivotal member of our data engineering team, you will play a crucial role in designing, implementing, and optimizing data pipelines, ensuring seamless integration with cloud platforms.Data Pipeline Development:Design,...
-
Azure Data Engineering Lead
4 days ago
Surat, Gujarat, India beBeeDataEngineering Full time ₹ 15,00,000 - ₹ 25,00,000We are seeking a seasoned professional to assume the role of Azure Data Engineering Lead. As an integral member of our team, you will be responsible for designing and developing end-to-end data pipelines.Key responsibilities include:Developing scalable data solutions on AzureCollaborating with stakeholders to drive business insights and visualization...
-
Cloud Engineer with Scalable Data Solutions
5 days ago
Surat, Gujarat, India beBeeDataArchitect Full time US$ 10,00,000 - US$ 15,00,000Senior Data ArchitectWe are seeking a highly skilled Senior Data Architect to lead the design, development, and optimization of our data engineering architecture.The ideal candidate will have a deep understanding of cloud-native data engineering on Microsoft Azure, and experience with AWS and GCP.They will be responsible for building scalable data pipelines,...
-
Cloud Data Architect
3 days ago
Surat, Gujarat, India beBeeDataEngineer Full time ₹ 9,00,000 - ₹ 12,00,000Job DescriptionWe are seeking a skilled and motivated Data Engineer to design, develop, and maintain scalable data pipelines that process and transform large volumes of structured and unstructured data.The ideal candidate will have a strong background in data engineering, cloud technologies, and data architecture, and will collaborate closely with...
-
BFSI Cloud Data Engineer Position Available
2 days ago
Surat, Gujarat, India beBeeData Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Cloud Data Engineer for BFSI DomainWe are seeking a skilled Cloud Data Engineer to join our team in the BFSI domain. In this role, you will collaborate with data science teams to build scalable data pipelines and develop models that empower business decision-making through advanced analytics.Key Responsibilities:Collaborate with data science teams to...
-
Cloud Engineering Expert
3 days ago
Surat, Gujarat, India beBeeCloudEngineeringExpert Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job Title: Cloud Engineering ExpertWe are seeking a skilled Cloud Engineer to join our team. As a Cloud Engineer, you will be responsible for designing, building, and maintaining scalable and secure cloud infrastructure.Key Responsibilities:Design and implement cloud-based systems and applications.Maintain and optimize cloud infrastructure for maximum...
-
Cloud Data Architect
4 days ago
Surat, Gujarat, India beBeeData Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Cloud Data ArchitectWe are seeking a seasoned Cloud Data Architect to design and build scalable data solutions on cloud platforms.About the Role:The ideal candidate will have 4+ years of experience in designing and maintaining cloud-based data architectures, including data lakes, lakehouses, and warehouses.Key Responsibilities:Build real-time and batch...
-
Senior Cloud Data Specialist
4 days ago
Surat, Gujarat, India beBeeDataEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000High-Performance Data EngineerWe are seeking a highly skilled data engineer with strong hands-on experience in Databricks and AWS to join our dynamic team.Key Responsibilities:Design, develop, and maintain scalable data pipelines using Databricks and PySpark.Implement and manage data solutions on AWS, including services like S3, Glue, Lambda, EMR, and...