Python Spark Developer
1 day ago
Job Summary
Synechron is seeking a skilled Python Spark Developer to design and optimize large-scale data pipelines and processing systems. The successful candidate will leverage expertise in Python and Apache Spark to build scalable, high-performance data workflows, supporting enterprise analytics, fraud detection, and real-time data applications. This role is instrumental in driving data architecture advancements, operational excellence, and delivering solutions aligned with business and technical standards.
Software Requirements
Required Skills:
3+ years of professional experience in Python development with a focus on data engineering and Big Data processing
Hands-on expertise with Apache Spark (preferably Spark 2.x or 3.x) in batch and streaming environments
Strong SQL skills with experience working with relational and distributed data systems (e.g., Hive, Snowflake, NoSQL databases)
Experience with data pipeline orchestration and management tools (e.g., Airflow, Jenkins, Git)
Solid understanding of software engineering principles, clean code practices, and design patterns
Familiarity with system design for scalable, data-intensive applications
Preferred Skills:
Exposure to cloud data platforms such as Snowflake, Databricks, AWS Glue, or GCP DataProc
Experience working with Kafka, Redis, or similar messaging systems
Knowledge of observability tools like OpenTelemetry, Grafana, Loki, Tempo
Understanding of containerization using Docker, orchestration with Kubernetes, and GitOps workflows
Overall Responsibilities
Design, develop, and optimize scalable data pipelines and workflows utilizing Python and Apache Spark
Build high-performance data processing applications emphasizing pushdown optimization, partitioning, clustering, and streaming
Integrate modern data platforms and tools into existing enterprise architectures for improved data accessibility and security
Engineer feature pipelines to support real-time fraud detection and other critical analytics systems
Define data models and processing strategies aligned with distributed architecture principles to ensure scalability and consistency
Develop solutions that are production-ready, maintainable, and feature observability and operational monitoring capabilities
Adhere to clean code standards, SOLID principles, and architecture best practices to enable extensibility and robustness
Participate in code reviews, testing, deployment, and performance tuning activities
Contribute to architectural governance, innovation initiatives, and continuous improvement efforts
Technical Skills (By Category)
Programming Languages:
Essential: Python (version 3.7+)
Preferred: Scala, Java for integration purposes
Frameworks & Libraries:
Essential: Apache Spark, Spark Streaming, Spark SQL, PySpark
Preferred: Kafka clients, Flink, or other streaming frameworks
Data & Databases:
Essential: SQL (PostgreSQL, MySQL), Spark dataframes, Hive, or similar distributed storage
Preferred: NoSQL databases (MongoDB, Cassandra), Data Lake architectures
Cloud & Infrastructure:
Preferred: Cloud platforms such as Snowflake, Databricks, AWS, or GCP
Experience with containerization: Docker, Kubernetes, Helm
Infrastructure automation: Terraform, CloudFormation (desirable)
DevOps & Monitoring:
Essential: CI/CD (Jenkins, GitHub Actions), observability tools (OpenTelemetry, Prometheus, Grafana)
Preferred: Log aggregation tools like Loki, Tempo; metrics collection
Experience Requirements
3+ years of hands-on experience developing data pipelines in Python with Apache Spark
Proven experience designing scalable, reliable ETL/ELT workflows in enterprise environments
Demonstrated ability to optimize Spark jobs for performance in batch and streaming scenarios
Experience working in distributed system architectures with a focus on data security and compliance
Background in financial, fraud detection, or data-intensive environments is preferred; relevant industry experience is desirable
Proven ability to collaborate across cross-functional teams and influence technical decision-making
Day-to-Day Activities
Develop and maintain large-scale data pipelines supporting enterprise analytics and real-time applications
Optimize Spark jobs and workflows for throughput, latency, and resource utilization
Implement pushdown optimizations, partitioning strategies, and clustering techniques to improve data processing efficiency
Collaborate with data architects, platform teams, and stakeholders to evaluate new tools and platforms for data solutions
Troubleshoot technical issues, resolve data pipeline failures, and improve system observability
Conduct code reviews and participate in agile planning, deployment, and operational activities
Document architecture, processes, and best practices to facilitate knowledge sharing and operational excellence
Stay current with industry trends, emerging tools, and best practices in big data engineering
Qualifications
Bachelor's or Master's degree in Computer Science, Software Engineering, Data Science, or related field
Additional certifications in Big Data, Spark, or cloud data services are a plus
Extensive hands-on experience developing large-scale data pipelines and processing solutions with Python and Apache Spark
Professional Competencies
Strong analytical and problem-solving skills for complex data workflows
Excellent collaboration and communication skills with technical and non-technical stakeholders
Ability to lead initiatives, influence best practices, and mentor junior engineers
Adaptability to evolving technologies and organizational needs
Focus on operational excellence, observability, and sustained performance
Commitment to continuous learning and process improvement
SYNECHRON'S DIVERSITY & INCLUSION STATEMENT
Diversity & Inclusion are fundamental to our culture, and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity, Equity, and Inclusion (DEI) initiative 'Same Difference' is committed to fostering an inclusive culture – promoting equality, diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger, successful businesses as a global company. We encourage applicants from across diverse backgrounds, race, ethnicities, religion, age, marital status, gender, sexual orientations, or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements, mentoring, internal mobility, learning and development programs, and more.
All employment decisions at Synechron are based on business needs, job requirements and individual qualifications, without regard to the applicant's gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law.
Candidate Application Notice
-
Python Developer
3 days ago
Mumbai, Maharashtra, India ERGO Technology & Services Full time ₹ 5,00,000 - ₹ 12,00,000 per yearAbout ERGO Technologies and Services IndiaET&S India, a part of ERGO Technology & Services Management, is the latest addition as an IT outsourcing provider for ERGO Group Worldwide. Supported by ERGO Group, an 18 billion Euro organization operating in over 25 countries, ET&S India aims to offer technology services to the ERGO group. In the near future, ET&S...
-
Python Software Developer
2 days ago
Mumbai, Maharashtra, India Aquilai Solutions Full time ₹ 12,00,000 - ₹ 24,00,000 per yearAbout the Role:We are seeking a Python Developer with a strong background in security, data warehousing, and big data technologies to join our team in building an advanced open data platform. As part of this role, you will collaborate closely with the Security Architect, to design, develop, and implement security solutions, leveraging your expertise in data...
-
Mumbai, Maharashtra, India JPMorganChase Full time ₹ 60,000 - ₹ 1,20,000 per yearDescriptionBe part of a dynamic team where your distinctive skills will contribute to a winning culture and team. As a Data Engineer III at JPMorgan Chase within the Corporate Technology, you serve as a seasoned member of an agile team to design and deliver trusted data collection, storage, access, and analytics solutions in a secure, stable, and scalable...
-
Mumbai, Maharashtra, India JPMorganChase Full time ₹ 1,00,00,000 - ₹ 3,00,00,000 per yearJOB DESCRIPTIONBe part of a dynamic team where your distinctive skills will contribute to a winning culture and team.As a Data Engineer III at JPMorgan Chase within the Corporate Technology, you serve as a seasoned member of an agile team to design and deliver trusted data collection, storage, access, and analytics solutions in a secure, stable, and scalable...
-
Python Developer
4 days ago
Mumbai, Maharashtra, India Wama Technology Full time ₹ 15,00,000 - ₹ 16,00,000 per yearTitle: Python DeveloperLocation: Onsite – Mumbai, MaharashtraExperience: 4- 5 years in Python developmentJoining: ImmediateAbout the RoleWe are building cutting-edge AI products designed for enterprise-scale applications and arelooking for a Senior Python Developer to join our core engineering team. You will beresponsible for designing and delivering...
-
Spark Developer
2 weeks ago
Navi Mumbai, Maharashtra, India de08f93d-a340-40f8-a383-346ed8e74486 Full time ₹ 10,80,000 - ₹ 12,00,000 per yearLocation: Navi Mumbai (Work From Office)Employment Type: Full-timeExperience: 3–4 Years (PySpark Developer)We are looking for a skilled PySpark Developer with strong hands-on experience in Python, SQL, and cloud technologies to join our growing data engineering team in Navi Mumbai.Mandatory SkillsPySpark / Python scriptingSQLAWS Cloud...
-
Python Developers
2 weeks ago
Mumbai, Maharashtra, India Gray Matrix Full time ₹ 5,00,000 - ₹ 25,00,000 per yearApplication DevelopmentLocation : MumbaiVacancies : 1Roles OpenSoftware Engineer (SE) – Python: 2 to 4 yearsNote: These are typical experience ranges — but we're not rigid about years.If you've built deeply, thought critically, and solved meaningful problems — we're open to letting your work speak louder than your resume.What matters most to us is how...
-
Python Developer
3 days ago
Navi Mumbai, Maharashtra, India AgileEngine Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards. WHY JOIN US If you're looking for a place to grow, make an...
-
Spark Data Engineer
1 week ago
Mumbai, Maharashtra, India Mactores Full time ₹ 12,00,000 - ₹ 36,00,000 per yearMactores is a trusted leader among businesses in providing modern data platform solutions. Since 2008, Mactores have been enabling businesses to accelerate their value through automation by providing End-to-End Data Solutions that are automated, agile, and secure. We collaborate with customers to strategize, navigate, and accelerate an ideal path forward...
-
Spark Data Engineer
2 weeks ago
Mumbai, Maharashtra, India Mactores Full time ₹ 12,00,000 - ₹ 36,00,000 per yearMactores is a trusted leader among businesses in providing modern data platform solutions. Since 2008, Mactores have been enabling businesses to accelerate their value through automation by providing End-to-End Data Solutions that are automated, agile, and secure. We collaborate with customers to strategize, navigate, and accelerate an ideal path forward...