Data Engineer II
3 days ago
Job Summary:Building on the foundation of the SDE-I role, the DE- II position takes on a greater level of responsibility and leadership. You'll play a crucial role in driving the evolution and efficiency of our data collection and analytics platform, capable of handling terabyte-scale data and billions of data points. Key ResponsibilitiesLead the design, development, and optimization of large-scale data pipelines and infrastructures using technologies like Apache Airflow, Spark, Kafka, and more. Architect and implement distributed data processing solutions to handle terabyte-scale datasets and billions of records efficiently across multi-region cloud infrastructure (AWS, GCP, DO). Develop and maintain real-time data processing solutions for high-volume data collection operations using technologies like Spark Streaming and Kafka. Optimize data storage strategies using technologies such as Amazon S3, HDFS, and Parquet/Avro file formats for efficient querying and cost management. Build and maintain high-quality ETL pipelines, ensuring robust data collection and transformation processes with a focus on scalability and fault tolerance. Collaborate with data analysts, researchers, and cross-functional teams to define and maintain data quality metrics, implement robust data validation, and enforce security best practices. Mentor junior engineers (SDE-I) and foster a collaborative, growth-oriented environment. Participate in technical discussions, contributing to architectural decisions, and proactively identifying improvements for scalability, performance, and cost-efficiency. Ensure application performance monitoring (APM) is in place, utilizing tools like Datadog, New Relic, or similar to proactively monitor and optimize system performance, detect bottlenecks, and ensure system health. Implement effective data partitioning strategies and indexing for performance optimization in distributed databases such as DynamoDB, Cassandra, or HBase. Stay current with advancements in data engineering, orchestration tools, and emerging cloud technologies, continually enhancing the platform’s capabilitiesQualifications & Experience:4-5+ years of hands-on experience with Apache Airflow and other orchestration tools for managing large-scale workflows and data pipelines. Expertise in AWS technologies, Athena, AWS Glue, DynamoDB, Apache Spark, PySpark, SQL, and NoSQL databases. Experience in designing and managing distributed data processing systems that scale to terabyte and billion-scale datasets using cloud platforms like AWS, GCP, or Digital Ocean. Proficiency in web crawling frameworks, including Node.js, HTTP protocols, Puppeteer, Playwright, and Chromium for large-scale data extraction. Experience with monitoring and observability tools such as Grafana, Prometheus, Elasticsearch, and familiarity with monitoring and optimizing resource utilization in distributed systems. Strong understanding of infrastructure as code using Terraform, automated CI/CD pipelines with Jenkins, and event-driven architecture with Kafka. Experience with data lake architectures and optimizing storage using formats such as Parquet, Avro, or ORC. Strong background in optimizing query performance and data processing frameworks (Spark, Flink, or Hadoop) for efficient data processing at scale. Knowledge of containerization (Docker, Kubernetes) and orchestration for distributed system deployments. Deep experience in designing resilient data systems with a focus on fault tolerance, data replication, and disaster recovery strategies in distributed environments. Strong data engineering skills, including ETL pipeline development, stream processing, and distributed systems. Excellent problem-solving abilities, with a collaborative mindset and strong communication skills.
-
Data Engineer
3 weeks ago
Mount Abu, India Forage AI Full timeExperience Level: Data Engineer- 3- 7 years of relevant experience in data engineering.About Forage AI: Forage AI is a pioneering AI-powered data extraction and automation company that transforms complex, unstructured web and document data into clean, structured intelligence. Our platform combines web crawling, NLP, LLMs, and agentic AI to deliver highly...
-
Data Engineer with Databricks
3 weeks ago
Mount Abu, India KPG99 INC Full timeJob Title: Databricks Engineer Location: RemoteDuration: 12 months Contract with extensions Job Description: REQUIRED SKILLS AND EXPERIENCE- 3–5 years of experience in data engineering roles - Strong hands-on experience with Databricks for data processing and pipeline development. - Proficiency in SQL for data querying, transformation, and troubleshooting....
-
Freelance Data Engineer
3 weeks ago
Mount Abu, India upGrad Full timeWe are seeking a highly skilled and motivated Data Engineer to join our team. The ideal candidate will be responsible for designing, developing, and optimizing large-scale data pipelines and data warehouse solutions, utilizing a modern, cloud-native data stack. You'll play a crucial role in transforming raw data into actionable insights, ensuring data...
-
Cloud Data Engineer
2 weeks ago
Mount Abu, India Lemongrass Full timeAbout LemongrassLemongrass is a software-enabled services provider, synonymous with SAP on Cloud, focused on delivering superior, highly automated Managed Services to Enterprise customers. Our customers span multiple verticals and geographies across the Americas, EMEA and APAC. We partner with AWS, SAP, Microsoft and other global technology leaders.We are...
-
Senior Data Engineer
3 weeks ago
Mount Abu, India Globex Digital Full timeData Engineer RoleLocation: India RemoteAre you a passionate Data Engineer ready to design and build cutting-edge, scalable, and reliable ML/AI solutions? Join our dynamic cross-functional feature team in India and drive innovation from concept to industrialization. We're looking for an expert to integrate and optimize algorithms across the entire product...
-
Full stack data
4 weeks ago
Mount Abu, India ContexQ Full timeSenior Full Stack Data Engineer – Enterprise Analytics PlatformLocation: Remote (Preference for India-based candidates)Compensation: Competitive salary + significant equity packageCompany: Contex Q (Singapore HQ)About Contex QContex Q, a Singapore-based B2 B Saa S AI startup, is dedicated to transforming financial crime, fraud, and risk management through...
-
Social Media Data Engineer
4 weeks ago
Mount Abu, India Deeter Investments LLP Full timeNOTE: IF YOU HAVE NO EXPERIENCE AQUIRING BULK SOCIAL MEDIA DATA DO NOT APPLY-Please explain your relevant experience clearly at the TOP of your CV. If you do not you will not be considered About Deeter InvestmentsDeeter Investments is a founder-led proprietary trading firm built around real-time, data-driven decision-making. We prize curiosity,...
-
Senior Data Engineer with AWS_ Exp: 6+ Years
3 days ago
Mount Abu, India Atyeti Inc Full timeRequired Skills & QualificationsBachelor’s or Master’s degree in Computer Science or equivalent experience5+ years of hands-on experience as an Application Developer or in similar software engineering rolesAdvanced skills in Python; strong SQL and cloud-native development experience (AWS)Proven expertise in architecting scalable data pipelines (ETL/ELT)...
-
Lead Data Architect
3 weeks ago
Mount Abu, India ACL Digital Full timeTechnology Skills and Project Experience:12 years of experience in modeling and business system designs.10 years hands on experience as Data Engineer/Architect with proven experience on Big Data and Data Lakes on AWS7 years of experience in below TechnologiesScripts/Programming language: SQL, Python, PROC, Windows FunctionsGitHubAWS: S3, RedShift, GlueBig...
-
Data Scientist
3 weeks ago
Mount Abu, India Lingaro Full timeRole: Data Scientist - Gen AILocation: India, RemoteAbout Lingaro:Lingaro Group is the end-to-end data services partner to global brands and enterprises. We lead our clients through their data journey, from strategy through development to operations and adoption, helping them to realize the full value of their data.Since 2008, Lingaro has been recognized by...