Dataproc Lead, Spark, OSS Technologies, Cloud
2 days ago
Minimum qualifications:
- Bachelor's degree or equivalent practical experience.
- 5 years of experience with software development in one or more programming languages, and with data structures/algorithms.
- Experience in software development and engineering, incorporating design methodologies, leveraging open source technologies, and working with distributed computing systems, including Apache Spark, Apache Hadoop, and Apache Hive.
- Experience in Open Source technologies, Big Data, Data Analytics, Artificial Intelligence, Machine Learning, and Database Internals.
Preferred qualifications:
- Experience with database optimizations such as query and executor optimizations.
- Experience with data lakes like Apache Iceberg, Apache Hudi, Delta Lake, etc.
- Experience with Open Telemetry, JMX and other monitoring solutions.
- Experience with OSS projects like Spark, Hive, Trino, Ray, Flink, etc.
- Experience working with data science tools such as Jupyter notebooks.
- Experience developing Cloud or SaaS products.
About The Job
Google Cloud's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google Cloud's needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. You will anticipate our customer needs and be empowered to act like an owner, take action and innovate. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
Cloud Dataproc enables open source data analytics users (Apache Hadoop, Spark, Trino, Flink, etc.) to lift and modernize their workloads into the cloud. Dataproc is a fast, easy-to-use, fully managed cloud service for running Apache Spark, Apache Hadoop and dozens of other OSS software in a simpler, performant and cost-efficient way. Dataproc also easily integrates with other Google Cloud Platform (GCP) services like BigQuery, Dataplex (governance, lineage), Catalog Stores to give a powerful and complete platform for data processing, analytics, and machine learning.
Google Cloud accelerates every organization's ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google's cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.
Responsibilities
- Build high-impact customer-facing features which make Cloud Dataproc the best place to run Spark, Ray, Trino, Flink and newer technologies in the cloud.
- Define the roadmap for Open Source technologies like Spark, Ray, Trino, Flink, etc.
- Define and implement the next generation Data Lakes and Lake Houses focusing on technologies like Iceberg, Hudi and Delta.
- Optimize the open source technologies for performance and efficiency.
- Design and build software stack to take advantage of Google technologies for faster cluster setup, efficient cluster operations, comprehensive monitoring and observability.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form .
-
Senior Software Engineer, Dataproc
2 weeks ago
Bengaluru, Karnataka, India Google Full time ₹ 20,00,000 - ₹ 25,00,000 per yearMinimum qualifications:Bachelor's degree or equivalent practical experience.5 years of experience with software development in one or more programming languages.3 years of experience testing, maintaining, or launching software products, and 1 year of experience with software design and architecture.3 years of experience with developing large-scale...
-
Senior Software Engineer, Dataproc
2 weeks ago
Bengaluru, Karnataka, India Google Full time ₹ 20,00,000 - ₹ 25,00,000 per yearMinimum qualifications:Bachelor's degree or equivalent practical experience.5 years of experience with software development in one or more programming languages.3 years of experience testing, maintaining, or launching software products, and 1 year of experience with software design and architecture.3 years of experience with developing large-scale...
-
Bengaluru, Karnataka, India Google Full time ₹ 20,00,000 - ₹ 25,00,000 per yearMinimum qualifications:Bachelor's degree or equivalent practical experience.8 years of experience with one or more general purpose programming languages, including Java, C/C++ or Python.3 years of experience with software design and architecture.3 years of experience with open source or developer technologies.Experience in software development and...
-
Senior Engineer, OSS
2 days ago
Bengaluru, Karnataka, India Rakuten Symphony Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWhy should you choose us?Rakuten Symphony is reimagining telecom, changing supply chain norms and disrupting outmoded thinking that threatens the industry's pursuit of rapid innovation and growth. Based on proven modern infrastructure practices, its open interface platforms make it possible to launch and operate advanced mobile services in a fraction of the...
-
Java Spark Lead
2 weeks ago
Bengaluru, Karnataka, India Infosys Full time ₹ 5,00,000 - ₹ 15,00,000 per yearJava Spark LeadStrong understanding of distributed computing and big data concepts. Experience with Hadoop ecosystem, Kafka, Hive, and other data tools is a plus. Proficiency in writing optimized Spark jobs and handling large-scale data. Familiarity with cloud platforms (AWS, Azure, GCP) and containerization (Docker, Kubernetes). Excellent problem-solving,...
-
senior data engineer
2 weeks ago
Bengaluru, Karnataka, India Happiest Minds Technologies Full time ₹ 15,00,000 - ₹ 25,00,000 per yearAbout Happiest Minds:**Happiest Minds is a leading digital transformation and technology services company, empowered by the mission of enhancing the happiness of our customers, employees, and society at large. We provide cutting-edge solutions leveraging advanced technologies to drive meaningful outcomes for our clients.Location: Manyata Tech Park. JD:Need...
-
GCP cloud engineer
2 days ago
Bengaluru, Karnataka, India Impetus Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Descriptions for Big data or Cloud EngineerPosition Summary:We are looking for candidates with hands on experience in Big Data with GCP cloud.Qualifications4-7 years of IT experience range is preferred.Able to effectively use GCP managed services e.g. Dataproc, Dataflow, pub/sub, Cloud functions, Big Query, GCS - At least 4 of these Services.Good to have...
-
Senior Software Engineer, ML Infrastructure
2 weeks ago
Bengaluru, Karnataka, India Google Full time ₹ 12,00,000 - ₹ 36,00,000 per yearMinimum qualifications:Bachelor's degree or equivalent practical experience.5 years of coding experience in one or more of the following languages: C, C++, Java, or Python.3 years of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).Experience with Cloud Security and Compliance.Experience...
-
Java Spark Lead
2 weeks ago
Bengaluru, Karnataka, India Infosys Full time ₹ 15,00,000 - ₹ 25,00,000 per yearKey Responsibilities:Lead the design and development of scalable data processing solutions using Java and Apache SparkCollaborate with data architects analysts and other stakeholders to understand data requirementsOptimize Spark jobs for performance and scalability in distributed environmentsEnsure code quality through code reviews unit testing and best...
-
GCP cloud engineer
2 weeks ago
Bengaluru, Karnataka, India, Karnataka Impetus Full timeJob Descriptions for Big data or Cloud EngineerPosition Summary:We are looking for candidates with hands on experience in Big Data with GCP cloud.Qualifications4-7 years of IT experience range is preferred.Able to effectively use GCP managed services e.g. Dataproc, Dataflow, pub/sub, Cloud functions, Big Query, GCS - At least 4 of these Services.Good to have...