Senior Data Engineer

2 weeks ago


Hyderabad, India Appen Full time

About Appen
Appen is a leader in AI enablement for critical tasks such as model improvement, supervision, and evaluation. To do this we leverage our global crowd of over one million skilled contractors, speaking over 180 languages and dialects, representing 130 countries. In addition, we utilize the industry's most advanced AI-assisted data annotation platform to collect and label various types of data like images, text, speech, audio, and video.

Our data is crucial for building and continuously improving the world's most innovative artificial intelligence systems and Appen is already trusted by the world's largest technology companies. Now with the explosion of interest in generative AI, Appen is helping leaders in automotive, financial services, retail, healthcare, and governments the confidence to deploy world-class AI products.

At Appen, we are purpose driven. Our fundamental role in AI is to ensure all models are helpful, honest, and harmless, so we firmly believe in unlocking the power of AI to build a better world. We have a learn-it-all culture that values perspective, growth, and innovation. We are customer-obsessed, action-oriented, and celebrate winning together.

At Appen, we are committed to creating an inclusive and diverse workplace. We are an equal opportunity employer that does not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Position Summary
We're hiring a
Senior Data Engineer
with strong experience in
AWS and Databricks
to build scalable data solutions that power next-gen AI and machine learning. Join our fast-growing team to work on impactful projects, collaborate with top talent, and drive innovation at scale.

Key Responsibilities

  • Design, build, and manage large-scale data infrastructures using a variety of AWS technologies such as Amazon Redshift, AWS Glue, Amazon Athena, AWS Data Pipeline, Amazon Kinesis, Amazon EMR, and Amazon RDS.
  • Design, develop, and maintain scalable data pipelines and architectures on Databricks using tools such as Delta Lake, Unity Catalog, and Apache Spark (Python or Scala), or similar technologies.
  • Integrate Databricks with cloud platforms like AWS to ensure smooth and secure data flow across systems.
  • Build and automate CI/CD pipelines for deploying, testing, and monitoring Databricks workflows and data jobs.
  • Continuously optimize data workflows for performance, reliability, and security, applying Databricks best practices around data governance and quality.
  • Ensure the performance, availability, and security of datasets across the organization, utilizing AWS's robust suite of tools for data management.
  • Collaborate with data scientists, software engineers, product managers, and other key stakeholders to develop data-driven solutions and models.
  • Translate complex functional and technical requirements into detailed design proposals and implement them.
  • Mentor junior and mid-level data engineers, fostering a culture of continuous learning and improvement within the team.
  • Identify, troubleshoot, and resolve complex data-related issues.
  • Champion best practices in data management, ensuring the cleanliness, integrity, and accessibility of our data.
  • Optimize and fine-tune data queries and processes for performance. Evaluate and advise on technological components, such as software, hardware, and networking capabilities, for database management systems and infrastructure.
  • Stay informed on the latest industry trends and technologies to ensure our data infrastructure is modern and robust.

Qualifications

  • 5-7 years of hands-on experience with AWS data engineering technologies, such as Amazon Redshift, AWS Glue, AWS Data Pipeline, Amazon Kinesis, Amazon RDS, and Apache Airflow.
  • Hands-on experience working with Databricks, including Delta Lake, Apache Spark (Python or Scala), and Unity Catalog.
  • Demonstrated proficiency in SQL and NoSQL databases, ETL tools, and data pipeline workflows.
  • Experience with Python, and/or Java.
  • Deep understanding of data structures, data modeling, and software architecture.
  • Experience with AI and machine learning technologies is highly desirable.
  • Strong problem-solving skills and attention to detail.
  • Self-motivated and able to work independently, with excellent organizational and multitasking skills.
  • Exceptional communication skills, with the ability to explain complex data concepts to non-technical stakeholders.
  • Bachelor's Degree in Computer Science, Information Systems, or a related field. A Master's Degree is preferred.

Appen is the global leader in data for the AI Lifecycle with more than 25 years' experience in data sourcing, annotation, and model evaluation. Through our expertise, platform, and global crowd, we enable organizations to launch the world's most innovative artificial intelligence products with speed and at scale. Appen maintains the industry's most advanced AI-assisted data annotation platform and boasts a global crowd of more than 1 million contributors worldwide, speaking more than 235 languages. Our products and services make Appen a trusted partner to leaders in technology, automotive, finance, retail, healthcare, and government. Appen has customers and offices globally.



  • Hyderabad, India Data Economy Full time

    We are seeking a Lead/Senior Data Engineer with 7-12 years of experience to architect, develop, and optimize data solutions in a cloud-native environment. The role requires strong expertise in AWS Glue, PySpark, and Python with a proven ability to design scalable data pipelines and frameworks for large-scale enterprise systems. Prior exposure to financial...


  • Hyderabad, Telangana, India Data Economy Full time ₹ 20,00,000 - ₹ 35,00,000 per year

    We are seeking a Lead/Senior Data Engineer with 7-12 years of experience to architect, develop, and optimize data solutions in a cloud-native environment. The role requires strong expertise in AWS Glue, PySpark, and Python with a proven ability to design scalable data pipelines and frameworks for large-scale enterprise systems. Prior exposure to financial...


  • Hyderabad, Telangana, India Enable Data Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Experience Required: 8+YearsMode of work: RemoteSkills Required: Azure DataBricks, Eventhub, Kafka, Architecture,Azure Data Factory, Pyspark, Python, SQL, SparkNotice Period : Immediate Joiners/ Permanent/Contract role (Can join within September 15th 2025)Design, develop, and maintain scalable and robust data solutions in the cloud using Apache Spark and...

  • Senior Data Engineer

    2 weeks ago


    Hyderabad, India Enable Data Incorporated Full time

    Experience Required: 8+Years Mode of work: Remote Skills Required: Azure DataBricks, Eventhub, Kafka, Architecture,Azure Data Factory, Pyspark, Python, SQL, Spark Notice Period : Immediate Joiners/ Permanent/Contract role (Can join within September 15th 2025) Design, develop, and maintain scalable and robust data solutions in the cloud using Apache Spark and...


  • Hyderabad, Telangana, India Enable Data Incorporated Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Experience Required: 8+YearsMode of work: RemoteSkills Required: Azure DataBricks, Eventhub, Kafka, Architecture,Azure Data Factory, Pyspark, Python, SQL, SparkNotice Period : Immediate Joiners/ Permanent/Contract role (Can join within September 15th 2025)Design, develop, and maintain scalable and robust data solutions in the cloud using Apache Spark and...


  • Hyderabad, Telangana, India NTT Data Full time

    Job DescriptionReq ID:330296NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.We are currently seeking a Data Engineer Senior Consultant to join our team in Hyderabad, Telangana (IN-TG), India (IN).- Understand...


  • Hyderabad, India ICE Data Services Full time

    Job Purpose The Property Data Engineeris responsible for developing and maintaining data conversion programs that transform raw property assessment data into standardized formats based on specifications by Property Data Analyst and Senior Analysts. This role requires not only advanced programming and ETL skills but also a deep understanding of the structure,...

  • Property Data Engineer

    12 hours ago


    Hyderabad, Telangana, India ICE Data Services Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Job PurposeThe Property Data Engineeris responsible for developing and maintaining data conversion programs that transform raw property assessment data into standardized formats based on specifications by Property Data Analyst and Senior Analysts. This role requires not only advanced programming and ETL skills but also a deep understanding of the structure,...

  • Senior Data Engineer

    3 weeks ago


    Hyderabad, Telangana, India Enable Data Incorporated Full time

    Experience Required: 8+YearsMode of work: RemoteSkills Required: Azure DataBricks, Eventhub, Kafka, Architecture, Azure Data Factory, Pyspark, Python, SQL, SparkNotice Period : Immediate Joiners/ Permanent/Contract role (Can join within September 29th 2025)- Translate business rules into technical specifications and implement scalable data solutions.- Manage...

  • Aws Data Engineer

    2 weeks ago


    Hyderabad, India Data Economy Full time

    We are seeking a highly skilled and experienced Senior Data Engineer to lead the end-to-end development of complex models for compliance and supervision. The ideal candidate will have deep expertise in cloud-based infrastructure, ETL pipeline development, and financial domains, with a strong focus on creating robust, scalable, and efficient solutions. Key...