Data Engineer Ii
3 days ago
Job Summary: Building on the foundation of the SDE-I role, the DE- II position takes on a greater level of responsibility and leadership. You'll play a crucial role in driving the evolution and efficiency of our data collection and analytics platform, capable of handling terabyte-scale data and billions of data points. Key Responsibilities Lead the design, development, and optimization of large-scale data pipelines and infrastructures using technologies like Apache Airflow, Spark, Kafka, and more. Architect and implement distributed data processing solutions to handle terabyte-scale datasets and billions of records efficiently across multi-region cloud infrastructure (AWS, GCP, DO). Develop and maintain real-time data processing solutions for high-volume data collection operations using technologies like Spark Streaming and Kafka. Optimize data storage strategies using technologies such as Amazon S3, HDFS, and Parquet/Avro file formats for efficient querying and cost management. Build and maintain high-quality ETL pipelines, ensuring robust data collection and transformation processes with a focus on scalability and fault tolerance. Collaborate with data analysts, researchers, and cross-functional teams to define and maintain data quality metrics, implement robust data validation, and enforce security best practices. Mentor junior engineers (SDE-I) and foster a collaborative, growth-oriented environment. Participate in technical discussions, contributing to architectural decisions, and proactively identifying improvements for scalability, performance, and cost-efficiency. Ensure application performance monitoring (APM) is in place, utilizing tools like Datadog, New Relic, or similar to proactively monitor and optimize system performance, detect bottlenecks, and ensure system health. Implement effective data partitioning strategies and indexing for performance optimization in distributed databases such as DynamoDB, Cassandra, or HBase. Stay current with advancements in data engineering, orchestration tools, and emerging cloud technologies, continually enhancing the platform’s capabilities Qualifications & Experience: 4-5+ years of hands-on experience with Apache Airflow and other orchestration tools for managing large-scale workflows and data pipelines. Expertise in AWS technologies, Athena, AWS Glue, DynamoDB, Apache Spark, PySpark, SQL, and NoSQL databases. Experience in designing and managing distributed data processing systems that scale to terabyte and billion-scale datasets using cloud platforms like AWS, GCP, or Digital Ocean. Proficiency in web crawling frameworks, including Node.Js, HTTP protocols, Puppeteer, Playwright, and Chromium for large-scale data extraction. Experience with monitoring and observability tools such as Grafana, Prometheus, Elasticsearch, and familiarity with monitoring and optimizing resource utilization in distributed systems. Strong understanding of infrastructure as code using Terraform, automated CI/CD pipelines with Jenkins, and event-driven architecture with Kafka. Experience with data lake architectures and optimizing storage using formats such as Parquet, Avro, or ORC. Strong background in optimizing query performance and data processing frameworks (Spark, Flink, or Hadoop) for efficient data processing at scale. Knowledge of containerization (Docker, Kubernetes) and orchestration for distributed system deployments. Deep experience in designing resilient data systems with a focus on fault tolerance, data replication, and disaster recovery strategies in distributed environments. Strong data engineering skills, including ETL pipeline development, stream processing, and distributed systems. Excellent problem-solving abilities, with a collaborative mindset and strong communication skills.
-
Caesar II Instructor
2 weeks ago
Bharatpur, India Augmintech Education Pvt. Ltd. Full timeWe’re Hiring: Caesar II Instructor (Remote | Part-Time)Are you an expert in Pipe Stress Analysis with a passion for teaching and mentoring aspiring engineers?Join Augmintech Education Pvt. Ltd., a leader in Building Services and Plant Design education, and an Autodesk Authorized Learning Partner, to empower the next generation of mechanical design...
-
Caesar II Instructor
2 weeks ago
Bharatpur, India Augmintech Education Pvt. Ltd. Full timeWe’re Hiring: Caesar II Instructor (Remote | Part-Time)Are you an expert in Pipe Stress Analysis with a passion for teaching and mentoring aspiring engineers?Join Augmintech Education Pvt. Ltd., a leader in Building Services and Plant Design education, and an Autodesk Authorized Learning Partner, to empower the next generation of mechanical design...
-
Data Engineer
1 day ago
Bharatpur, India Whatjobs IN C2 Full timeJob Title: Azure Data Engineer Location: Bangalore (Hybrid) Experience: 5-10 Years (STRICTLY) Employment Type: Full-Time / Permanent Notice Period: Immediate to 15 Days Preferred Interview Mode : F2F About the Company Our client is a leading financial technology transformation partner helping global banks and financial institutions streamline their lending...
-
Software Development Engineer II
4 days ago
Bharatpur, India ClearDemand Full timeJob Summary: Data is the foundation of our business, and your work will ensure that we continue to deliver high-quality competitive intelligence at scale. Web platforms are constantly evolving, deploying sophisticated anti-bot measures—your job is to stay ahead of them. If you thrive on solving complex technical challenges and enjoy working with real-world...
-
Data Engineer
4 weeks ago
Bharatpur, India KPG99 INC Full timePosition : Data Engineer Location : India Duration : 6+ Months TOP 3 MUST HAVE TECHNICAL SKILLS (Required/Preferred): Must have excellent English written and verbal communication skills.8+ years of experience in data engineeringExperience with Snowflake SQL, ADF (Azure Data Factory), Microsoft MDS (Master Data Service)
-
Senior data engineer
4 weeks ago
Bharatpur, India PASO Co Full timeJob descriptionWe’re Hiring: Senior Data Engineer | Remote (Pan India) | Full-TimeAbout the RoleOne of our client l is looking for a Senior Data Engineer to design and optimize data pipelines, ensure smooth data flow, and support multiple teams with reliable, scalable data solutions. If you love building data systems from the ground up and have hands-on...
-
Gcp data engineer
3 weeks ago
Bharatpur, India People Prime Worldwide Full timeAbout Company: Our Client is a leading Indian multinational IT services and consulting firm. It provides digital transformation, cloud computing, data analytics, enterprise application integration, infrastructure management, and application development services. The company caters to over 700 clients across industries such as banking and financial services,...
-
Data Quality Engineer
1 day ago
Bharatpur, India Whatjobs IN C2 Full timeLooking for Data Quality Engineer| Bangalore to join a team of rockstar developers. The candidate should have a minimum of 8+yrs of experience. About CodeVyasa: CodeVyasa is a mid-sized product engineering company that works with top-tier product/solutions companies such as McKinsey, Walmart, RazorPay, Swiggy, and others. We are about 550+ people strong and...
-
Data/ML Engineer
3 weeks ago
Bharatpur, India Lilo Full timeLilo is revolutionizing procurement for Commercial Real Estate (CRE) businesses by providing a single platform to automate and optimize procurement processes using AI. From hotels to gyms, schools, and senior living homes, we streamline workflows like invoicing, vendor management, and price comparisons, saving our clients both time and money. We are already...
-
Ai/Ml & Data Engineer
2 weeks ago
Bharatpur, India Whatjobs IN C2 Full timeAbout the Job We are looking for an experienced AI/ML & Data Engineer to design, develop, and deploy scalable machine learning models and data infrastructure on AWS. You will work closely with cross-functional teams to deliver AI-driven solutions, integrate large language models (LLMs), and optimize data workflows while ensuring security, scalability, and...