Current jobs related to Big Data with Pyspark - Remote - Achutha Associates

  • Big Data Trainer

    1 day ago


    Remote, India REGex Software Services Full time

    Required Big Data Trainer who is expert in following topics: Python Introduction to LINUX Operating System and Basic LINUX commands Hadoop(HDFS) Hadoop 2.0 & YARN Sqoop Hive Programming PySpark ETL **Job Types**: Part-time, Freelance, Contractual / Temporary Contract length: 6-8 weeks Part-time hours: 10-12 per week **Salary**: ₹500.00 -...


  • Remote, India SureEvents Full time

    **Job Code**:SE / RPD / 00598**Job Location**:Remote**Experience**:2-4 years**Job Location**: Remote **Job Type**: Contract(3 months) **Experience**: 2-4 years **Skills Required**: - Bachelor’s/Master’s degree in Engineering, Computer Science (or equivalent experience). - At least 3+ years of relevant experience as a Data Scientist/Data Analyst. - 2+...


  • Remote, India Databricks Full time

    CSQ225R23 As a** Resident Solutions Architect (Big Data)**, you will work with clients on short to medium term customer engagements on their big data challenges using the Databricks platform. You will provide data engineering, data science, and cloud technology projects which require integrating with client systems, training, and other technical tasks to...

  • Big Data Architect

    3 days ago


    Remote, India Databricks Full time

    CSQ424R34 As a** Big Data Architect **in our Professional Services team you will work with clients on short to medium term customer engagements on their **Big Data challenges using the Databricks platform**. You will provide data engineering, data science, and cloud technology projects which require integrating with client systems, training, and other...

  • Data Scientist

    2 weeks ago


    Remote, India Technology-Next Full time

    **Job Title**: Data Scientist **Location**: Pune / Mumbai (On-site) **Experience**: 3 to 4 Years **Notice Period**: Immediate Joiners Only **Working Days**: 5 Days a Week (On-site) **Role Summary** We are seeking a proactive and experienced Data Scientist who can contribute from day one. You’ll work on high-impact projects using machine learning and...


  • Remote/Ahmedabad, India PRAMA INNOVATIONS INDIA PRIVATE LIMITED Full time

    About the Role : We are seeking a highly skilled Python/PySpark Developer to join our dynamic data engineering team. In this role, you will be responsible for developing and maintaining high-performance data pipelines using PySpark on cloud platforms (preferably AWS). You will work closely with data engineers to extract, transform, and load large datasets...

  • Data Scientist

    2 weeks ago


    Remote, India Unique Inspiration Full time

    Working knowledge in basic statistical concepts such as properties of distributions, statistical tests and their proper usage - Mine and analyze data from various sources to drive optimization and improvement of product development. - Develops and validates analytical methods; coordinated methods and technical transfers - Use advanced analytics methods to...


  • Remote, India Expertel SA - Proceedit Full time

    Are you a tech-savvy individual with a passion for data, IoT, and analytics? If you're eager to dive into the world of big data and IoT while gaining hands-on experience with diverse data science methodologies and technologies, we invite you to join our Data Unit at Proceedit. We're seeking ambitious interns who are ready to explore, analyze, and unlock the...

  • Python+pyspark

    5 days ago


    Remote, India Covalense Global Full time

    **Benefits**: **Our compensation and benefits packages differ from country to country in accordance with local preferences and legislation**: Equal Employment Opportunity : We comply with all applicable laws prohibiting discrimination or harassment against any applicant or employee. Competitive salary packages & benefits : We are continually monitoring and...


  • Remote, India Data PlatformExperts Full time

    **Job Description: Azure Data Architect** **Key Responsibilities**: - **Design and Implementation**: - Design and implement end-to-end data solutions (data models, pipelines, data lakes, data warehouses) in Azure. - Optimize and maintain existing Azure data structures and integration processes. - Ensure architectural solutions are scalable, maintainable,...

Big Data with Pyspark

3 weeks ago


Remote, India Achutha Associates Full time

**Job Title**: Big Data Engineer (PySpark)
**Location**: Bengaluru, India
**Experience**: 5+ years
**Employment Type**: Full-time

**Job Summary**:
**Key Responsibilities**:

- Design, develop, and maintain scalable **big data pipelines** using **PySpark** and other big data technologies.
- Work with **Hadoop, Spark, Kafka, Hive, and other distributed data processing frameworks**.
- Optimize **ETL workflows** and ensure efficient data processing.
- Implement **data quality checks, monitoring, and validation** to ensure high data integrity.
- Collaborate with **data scientists, analysts, and business teams** to understand requirements and deliver solutions.
- Optimize **Spark performance** by tuning jobs and implementing best practices for distributed computing.
- Manage and process **structured and unstructured data** from multiple sources.
- Work with **cloud platforms** like AWS, Azure, or GCP for big data storage and processing.
- Troubleshoot and debug **performance issues** related to big data systems.

**Required Skills**:

- Strong experience with **PySpark and Spark (RDD, DataFrame, Spark SQL)**.
- Proficiency in **Hadoop ecosystem** (HDFS, Hive, HBase, Oozie, etc.).
- Experience with **Kafka, Airflow, or other data orchestration tools**.
- Strong **SQL** skills for querying and optimizing data processing.
- Experience with **cloud platforms** (AWS Glue, EMR, Azure Databricks, GCP BigQuery, etc.).
- Proficiency in **Python and Scala** for big data processing.
- Knowledge of **data lake and data warehouse concepts**.
- Experience in **CI/CD pipelines for data engineering** is a plus.
- Strong problem-solving skills and the ability to work in an **agile environment**.

Pay: ₹50,000.00 - ₹100,000.00 per month

Schedule:

- Day shift

**Experience**:

- Big data with PySpark: 6 years (required)

Work Location: Remote