Vector DB Engineer – Data Scientist
4 days ago
Career Area:
Technology, Digital and Data
Job Description:
Your Work Shapes the World at Caterpillar Inc.
When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.
Your Impact Shapes the World at Caterpillar Inc
When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.
Job Summary
Caterpillar is seeking a Vector DB Engineer – Data Scientist join Applications Development & Intelligence Automation -CAT IT Division.
The incumbent would be responsible for designing, implementing, and optimizing vector databases that enable high-performance, large-scale data processing and retrieval. You will work closely with our data science, machine learning, and software engineering teams to build robust solutions that support our clients' data-intensive applications.
The preference for this role is to be based out of Bangalore – Caterpillar PSN
What you will do
- Design, implement, and manage vector databases to support large-scale data storage and retrieval, ensuring low latency and high availability.
- Develop efficient data models that facilitate fast vector operations such as similarity search, nearest neighbor search, and other vector-based queries.
- Optimize database performance through indexing, partitioning, sharding, and other techniques to handle large-scale datasets.
- Integrate vector databases with existing systems and applications, ensuring seamless data flow and accessibility.
- Design and implement solutions that scale with growing data volumes, ensuring the database infrastructure can handle increased load and complexity.
- Implement security best practices to protect data at rest and in transit, including encryption, access controls, and audit logging.
- Monitor database performance and troubleshoot issues as they arise, ensuring system reliability and availability.
- Work closely with data scientists, machine learning engineers, and software developers to understand their needs and provide database solutions that meet their requirements.
- Maintain comprehensive documentation for database schemas, configurations, and procedures to support operational excellence and knowledge sharing.
What you will have
Must Have Skills:
- Deep understanding and hands-on experience with vector databases, including their architecture, query languages, and optimization techniques.
- Strong programming skills in languages such as Python, C++, or Java, with experience in developing and optimizing database operations.
- Solid understanding of data structures, algorithms, and computational geometry, particularly related to vector search and similarity measures
- Experience with cloud platforms (e.g., AWS, GCP, Azure) and managed database services.
- Understanding of machine learning concepts, particularly those related to embedding vectors and similarity searches.
- Strong problem-solving skills with a focus on performance optimization and scalability.
Excellent communication skills, with the ability to articulate complex technical concepts to non-technical stakeholders.
This position requires candidate to work a 5-day -a -week schedule in the office
- Shift Timing :01:00PM -10:00PM IST
Skills desired:
Business Statistics: Knowledge of the statistical tools, processes, and practices to describe business results in measurable scales; ability to use statistical tools and processes to assist in making business decisions.
Level Working Knowledge:
- Explains the basic decision process associated with specific statistics.
- Works with basic statistical functions on a spreadsheet or a calculator.
- Explains reasons for common statistical errors, misinterpretations, and misrepresentations.
- Describes characteristics of sample size, normal distributions, and standard deviation.
- Generates and interprets basic statistical data.
Accuracy and Attention to Detail: Understanding the necessity and value of accuracy; ability to complete tasks with high levels of precision.
Level Extensive Experience:
- Evaluates and makes contributions to best practices.
- Processes large quantities of detailed information with high levels of accuracy.
- Productively balances speed and accuracy.
- Employs techniques for motivating personnel to meet or exceed accuracy goals.
- Implements a variety of cross-checking approaches and mechanisms.
- Demonstrates expertise in quality assurance tools, techniques, and standards.
Analytical Thinking: Knowledge of techniques and tools that promote effective analysis; ability to determine the root cause of organizational problems and create alternative solutions that resolve these problems.
Level Working Knowledge:
- Approaches a situation or problem by defining the problem or issue and determining its significance.
- Makes a systematic comparison of two or more alternative solutions.
- Uses flow charts, Pareto charts, fish diagrams, etc. to disclose meaningful data patterns.
- Identifies the major forces, events and people impacting and impacted by the situation at hand.
- Uses logic and intuition to make inferences about the meaning of the data and arrive at conclusions.
Machine Learning: Knowledge of principles, technologies and algorithms of machine learning; ability to develop, implement and deliver related systems, products and services.
Level Working Knowledge:
- Completes specific tasks and initiatives utilizing machine learning technologies, such as search engine optimization.
- Utilizes specific tools and techniques to process descriptive and inferential statistics.
- Applies specific computing languages and tools in machine learning, such as R and Python.
- Explores to use machine learning in one own areas to make business improvements.
- Conducts data mining and cleaning initiatives.
Programming Languages: Knowledge of basic concepts and capabilities of programming; ability to use tools, techniques and platforms in order to write and modify programming languages.
Level Working Knowledge:
- Participates in the implementation and support of specialized programming languages.
- Conducts basic reviews on writing a specific programming language within a specific platform.
- Assists with the design and development of specialized programming languages.
- Follows an organization's standards, policies and guidelines for structured programming specifications.
- Diagnoses and reports minor or routine programming language problems.
Query and Database Access Tools: Knowledge of data management systems; ability to use, support and access facilities for searching, extracting and formatting data for further use.
Level Working Knowledge:
- Defines, creates and tests simple queries by using associated command language in a specific environment.
- Applies appropriate query tools used to connect to the data warehouse.
- Obtains and analyzes query access path information and query results.
- Employs tested query statements to retrieve, insert, update and delete information.
- Works with advanced features and functions including sorting, filtering and making simple calculations.
Requirements Analysis: Knowledge of tools, methods, and techniques of requirement analysis; ability to elicit, analyze and record required business functionality and non-functionality requirements to ensure the success of a system or software development project.
Level Working Knowledge:
- Follows policies, practices and standards for determining functional and informational requirements.
- Confirms deliverables associated with requirements analysis.
- Communicates with customers and users to elicit and gather client requirements.
- Participates in the preparation of detailed documentation and requirements.
- Utilizes specific organizational methods, tools and techniques for requirements analysis.
What you will get:
- Work Life Harmony
- Earned and medical leave.
- Relocation assistance
Holistic Development
- Personal and professional development through Caterpillar 's employee resource groups across the globe
- Career developments opportunities with global prospects
Health and Wellness
- Medical coverage -Medical, life and personal accident coverage
- Employee mental wellness assistance program
Financial Wellness
- Employee investment plan
- Pay for performance -Annual incentive Bonus plan.
Additional Information:
Caterpillar is not currently hiring individuals for this position who now or in the future require sponsorship for employment visa status; however, as a global company, Caterpillar offers many job opportunities outside of the U.S. which can be found through our employment website at
Posting Dates:
October 21, November 3, 2025
Caterpillar is an Equal Opportunity Employer. Qualified applicants of any age are encouraged to apply
Not ready to apply? Join our Talent Community.
-
Data Scientist
2 weeks ago
Bengaluru, Karnataka, India IITMATRIX Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescription: Roles & Responsibilities : - Lead the roadmap for Generative AI (GenAI) solutions across product and platform teams. - Architect and optimise RAG pipelines (retrieval, embeddings, hybrid search, re-ranking, caching, latency-cost tradeoffs). - Establish robust evaluation frameworks (automatic human-in-the-loop) to measure LLM outputs...
-
Principal Data Scientist
6 days ago
Bengaluru, Karnataka, India Ecolab Global Services Full time ₹ 1,50,00,000 - ₹ 3,00,00,000 per yearThe Principal Data Scientist/AI Engineer is a senior technical expert who leads complex, high-impact AI initiatives across the organization. This role is responsible for driving architectural decisions, establishing reusable frameworks, and influencing the AI strategy at an enterprise level. The Principal AI Engineer operates as a cross-functional advisor,...
-
Data Scientist
4 days ago
Bengaluru, Karnataka, India Straive Full timeAbout the RoleStraive is looking for a talented and drivenConsultant / Data Scientist / GenAI Engineerto join ourAnalytics & GenAI delivery team. In this role, you will work under the guidance of the Senior Project Manager / Engagement Manager to design, develop, and deploy advanced AI/ML and Generative AI solutions for global enterprise clients. You will be...
-
Principal Data Scientist
2 weeks ago
Bengaluru, Karnataka, India IITMatrix Full time ₹ 12,00,000 - ₹ 36,00,000 per yearPosition DetailsProfile Name: Principal Data Scientist – GenAIQualification & EligibilityBachelor's or higher degree in a quantitative discipline (Computer Science, Statistics, Engineering, Applied Mathematics).Work ExperienceMinimum 8+ years of experience .Startup experience preferred; Edtech work experience is a plus.Roles & ResponsibilitiesLead the...
-
Data Scientists
6 days ago
Bengaluru, Karnataka, India Skill On Tap Full time ₹ 8,00,000 - ₹ 24,00,000 per yearJob Title: Data Scientists (GenAI/Agentic)Experience: 3 to 5 yearsLocation: BangaloreWork Hours: 12:30 PM till 9:30 PM; (~2-3 hours overlap to US Central Time)Work Arrangement: Hybrid (Monday, Tuesday and Wednesday work from office, Thursday and Friday work remotely)Client is in direction to implement full 5 days working from office. Resource need to be...
-
Data Scientist with Gen ai
2 weeks ago
Bengaluru, Karnataka, India People Prime Worldwide Full time ₹ 9,00,000 - ₹ 12,00,000 per yearOur is Client is a largest Top 5 Software giant in India, with over 11.3 USD billion dollars revenue, Global work force 2,40,000 employees, It delivers end-to-end technology, consulting, and business process services to clients across the globe, Presence: 60+ countries and Publicly traded company NSE & BSE (India), NYSE (USA).· Job Title: Data Scientist...
-
Data Scientist
6 days ago
Bengaluru, Karnataka, India KPMG Assurance and Consulting Services LLP Full time ₹ 15,00,000 - ₹ 25,00,000 per yearExciting Opportunity: Data Scientist (AI DevOps / MLOps / LLMs / Cloud)Location: BangaloreExperience: 4 to 6 yearsDesignation: ConsultantHi folksI have got an exciting opportunity for a Data Scientist role with a strong focus on AI DevOps and MLOps. If you feel you're a good fit, feel free to apply or drop your resume directly at You can also reach out to...
-
Data Engineer
2 weeks ago
Bengaluru, Karnataka, India VDart Software Services Full time ₹ 6,00,000 - ₹ 18,00,000 per yearData Engineer / MLOpsExperience: 2 - 5 yrsLocation: Bangalore (In office)Team: AI PracticeNotice: ImmediateRole summary Build ingestion pipelines, orchestrate feature/embedding pipelines, enable reproducible model deployments and monitoring.Key responsibilitiesBuild ETL/ingestion jobs for documents, logs, and structured data.Use Spark framework or...
-
Bengaluru, Karnataka, India VECTOR Informatik Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJob Profile for:Software Engineer – Test Engineering ServicesExperience: yearsLocation: Bengaluru, India.Job descriptionDeliver comprehensive turn-key HIL test systems and framework test automation to meet customer requirements.Proactively initiate customer contact on a regular basis to understand their goals, challenges, and requirements for test system...
-
Data Scientist
2 weeks ago
Bengaluru, Karnataka, India CoffeeBeans Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout CoffeeBeans ConsultingCoffeeBeans is a tech-driven software consulting company that helps businesses solve complex problems using modern data, AI, and engineering solutions. We blend deep technical expertise with a product mindset to build scalable, intelligent, and high-impact solutions across industries. Our data science team works on end-to-end...