Advanced Distributed Systems Architect

4 days ago


Salem, Tamil Nadu, India beBeeDistributed Full time US$ 1,00,000 - US$ 1,50,000
Lead Developer of Advanced Data Intelligence

Our organization is engaged in the creation of a cutting-edge data intelligence platform based on a microservices architecture. This system requires an expert with hands-on leadership experience to design and implement distributed graph computing solutions for processing massive amounts of entities and relationships.

The platform utilizes:

  • Microservices architecture with Kubernetes orchestration
  • Apache Spark for large-scale distributed processing
  • Elasticsearch for real-time search and fuzzy matching
  • Scala as the primary development language
  • Data mesh principles with API-first design

Key Responsibilities:

Entity Resolution Service

Develop efficient algorithms for distributed entity resolution, capable of handling billions of records.

Implement blocking strategies (e.g., LSH, canopy clustering) optimized for Apache Spark at scale.

Design and develop fuzzy matching algorithms leveraging Elasticsearch's capabilities.

Create ML-enhanced matching with explainable AI for match decisions.

Implement incremental resolution supporting real-time and batch modes.

Design APIs for entity lookup with sub-100ms latency requirements.

Network Generation Service

Architect distributed graph generation pipelines using GraphX/GraphFrames.

Implement graph analytics algorithms (PageRank, community detection, centrality measures).

Design storage strategies for multi-billion edge graphs in Parquet/distributed file systems.

Build temporal graph support for time-evolving networks.

Develop high-performance graph serving APIs with complex query capabilities.

Optimize graph partitioning to minimize shuffle and maximize locality.

AI Model Development

Build Graph Neural Networks (GNNs): Develop GNN models (e.g., GraphSAGE, GATv2) using PyTorch Geometric or DGL to analyze corporate and transaction networks, detecting fraud rings and risk patterns.

Implement Entity Resolution: Design algorithms for fuzzy matching, semantic matching (Sentence-BERT), and clustering to unify entities across heterogeneous data sources (e.g., CSVs, APIs, PDFs).

Develop Risk Scoring Models: Combine rule-based, supervised (XGBoost), and unsupervised (Isolation Forest) methods to generate composite risk scores, optimized for real-time and large data processing in trillions.

Advance Composite AI: Leverage our proprietary approach, integrating symbolic AI, vector embeddings, and graph AI for robust entity resolution and network analytics.

Explainable AI (XAI)

Champion Transparency: Integrate SHAP, LIME, and GNNExplainer to provide clear, interpretable explanations for model predictions, meeting regulatory and ethical standards.

Ensure Fairness: Audit models for bias and fairness, embedding ethical principles into every stage of development.

Cross-Service Responsibilities

Ensure seamless integration between entity resolution and network generation.

Design data lineage tracking across both services.

Implement comprehensive monitoring and observability.

Contribute to API design and service contracts.

Optimize for 10x scale growth.

Required Qualifications

7+ years of experience in large-scale distributed computing and big data systems.

5+ years specifically in entity resolution and graph analytics at scale.

Expert-level Scala programming skills.

Deep experience with Apache Spark, including custom optimizations.

Production experience with Elasticsearch for search and matching.

Domain Knowledge

Strong understanding of blocking algorithms and their trade-offs.

Experience with probabilistic record linkage and similarity measures.

Expertise in graph algorithms and their distributed implementations.

Knowledge of graph storage formats and query optimization.

Understanding of machine learning applications in entity resolution.

Systems Design

Experience designing microservices architectures.

Track record of building fault-tolerant, scalable systems.

API design experience with GraphQL or REST.

Performance optimization and capacity planning expertise.

Preferred Qualifications

PhD in Computer Science or related field with focus on graphs/entity resolution.

Contributions to open-source projects (especially Spark, GraphX, Elasticsearch).

Experience with graph databases (Neo4j, Neptune, JanusGraph) or equivalent.

Publications or conference talks on entity resolution or graph analytics.

Experience with real-time stream processing (Kafka, Spark Streaming).

Technical Environment

Languages: Scala (primary), Python, Java.

Big Data: Apache Spark 3.x, Hadoop ecosystem.

Search: Elasticsearch 8.x.

Orchestration: Kubernetes, Docker.

Storage: HDFS/S3/GCS, Parquet.

Monitoring: Prometheus, Grafana, Jaeger.

CI/CD: Modern DevOps practices.

What We're Looking For

Someone who thinks in distributed systems and can optimize for both latency and throughput.

A technical leader who can make architectural decisions and implement them.

Strong communicator who can explain complex graph concepts to stakeholders.

Self-directed engineer who can own large technical initiatives end-to-end.

Performance-obsessed developer who benchmarks everything.

Impact You'll Make

Define the architecture for entity resolution serving multiple business domains.

Build the graph intelligence layer powering advanced analytics and ML.

Create systems that will process billions of entities with millisecond latencies.

Establish best practices for graph computing in our organization.

Mentor other engineers on distributed graph algorithms.

Compensation & Benefits

Competitive senior/staff-level compensation.

Flexible remote work arrangements.

Latest hardware and cloud resources for development.

LTIP - Long term Incentive plan.

75% of base as Bonus payment at the end of 4th year in service.

Equity potential of upto in excess of USD 150K every year.

Interview Process

Technical screen focusing on distributed systems and graph algorithms.

System design session on entity resolution at scale.

Coding session implementing a graph algorithm in Scala.

Architecture discussion with the team.

Final round with leadership.

To Apply

Please include:

Links to relevant open-source contributions.

Brief description of the largest graph system you've built (nodes/edges scale).

Your approach to a specific entity resolution challenge you've solved.

Any publications or talks on graph computing or entity resolution.

We are developing a leading-edge data intelligence platform and need someone who can advance its capabilities in distributed graph computing.

],

  • Salem, Tamil Nadu, India beBeeDataEngineer Full time ₹ 18,00,000 - ₹ 25,00,000

    Job Role: Data EngineerAbout the JobThis is an exciting opportunity to join our organization as a Data Engineer. We are looking for a highly skilled professional with experience in designing, building, and maintaining large-scale data systems.Key ResponsibilitiesDesign and develop data pipelines using Trino (PrestoSQL)Optimize SQL queries for improved...


  • Salem, Tamil Nadu, India beBeeControl Full time ₹ 15,00,000 - ₹ 20,00,000

    Job Title: Control Systems SpecialistWe are seeking a skilled Control Systems Specialist to join our team. In this role, you will be responsible for executing the entire lifecycle of Distributed Control System (DCS) projects.Key Responsibilities:Requirements analysis, design, engineering, and commissioning of DCS systemsOn-site loop checking, system...


  • Salem, Tamil Nadu, India beBeeTechnical Full time ₹ 1,00,00,000 - ₹ 1,20,00,000

    Transform the Future of Fulfillment with UsWe develop leading-edge technology that gets the right product to the right place at the right time.Our AI-driven solutions specialize in bringing warehouses and distribution centers into the future. We're looking for modern minds to join us and solve today's fulfillment challenges.About Our SolutionsOur technology...


  • Salem, Tamil Nadu, India beBeeBlockchainDeveloper Full time ₹ 15,00,000 - ₹ 20,10,000

    Senior Blockchain DeveloperWe are seeking a highly skilled Senior Blockchain Developer to join our team. As a key member of our development team, you will design and build cutting-edge decentralized systems that scale globally.Key Responsibilities:Architect, develop, and deploy secure smart contracts (Solidity/Rust) for Ethereum, EVM-compatible chains, and...


  • Salem, Tamil Nadu, India beBeePricing Full time ₹ 20,00,000 - ₹ 35,00,000

    Job OverviewBusiness Requirements Specialist - Sales & Distribution (S&D) FunctionalityThis role demands deep expertise in ERP systems and hands-on experience in project execution across multiple modules.Key Responsibilities:Transition Planning & ExecutionDesign and implement processes for divestiture scenarios including customer/supplier master segregation,...

  • Oracle Systems Expert

    17 hours ago


    Salem, Tamil Nadu, India beBeeDatabase Full time ₹ 1,23,34,627 - ₹ 1,57,21,953

    Job DescriptionA Senior Oracle Exadata DBA will lead the implementation, configuration, and maintenance of Oracle RDBMS systems, including Exadata and Oracle Cloud Infrastructure (OCI). Their primary goal is to ensure availability and consistent performance across all environments.Key Responsibilities:Implement, configure, maintain, and perform critical...

  • System Integrator

    15 hours ago


    Salem, Tamil Nadu, India beBeeIntegration Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    Job DescriptionAbout this roleThis position is focused on designing and implementing enterprise integrations across supply chain, manufacturing, and industrial automation environments.Key ResponsibilitiesDesign, develop, and implement integration solutions between ERP, MES, supply chain platforms, and industrial automation systems.Apply standards such as...


  • Salem, Tamil Nadu, India beBeeArtificialintelligence Full time ₹ 1,80,00,000 - ₹ 2,50,00,000

    Senior AI SpecialistOur organization seeks an experienced AI/ML Expert to lead the development of advanced machine learning and deep learning models for business-specific problems.Design and develop custom machine learning and deep learning models for business-specific problems (NLP, computer vision, predictive modeling, recommendation systems, etc.).Build...


  • Salem, Tamil Nadu, India beBeeBackend Full time ₹ 18,00,000 - ₹ 25,00,000

    Backend EngineerWe are seeking a skilled Backend Engineer to design, build, and scale our applications using modern Java and Spring Boot.Main Responsibilities:Develop and Scale: Design, implement, and maintain scalable backend services and robust REST APIs using Java and Spring Boot.Data Driven: Optimize and manage our MySQL and PostgreSQL databases,...


  • Salem, Tamil Nadu, India beBeeCompliance Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    AI Solutions Architect — Expert in Building Industrial-Grade AIAs an expert in industrial-grade AI, you will be responsible for architecting systems that prevent accidents and ensure compliance.Key Responsibilities:Architect systems to prevent accidents and ensure compliance.Required Skills:Expertise in building industrial-grade AI.Strong understanding of...