Data Engineer – Data Integration
Location: Pune, Maharashtra, India
Domain: Consulting
Experience Level: Early Professional
Introduction
You’ll work in one of IBM’s Consulting Client Innovation Centers (Delivery Centers).
Delivery centers provide deep technical and industry expertise for clients across public and private sectors.
These centers support innovation and technology adoption by offering locally based skills and technical expertise.
Your Role and Responsibilities
As a Data Engineer at IBM, you will:
Harness data to discover stories and identify patterns.
Participate in data gathering, storage, and real-time & batch processing.
Collaborate with diverse teams to select suitable data systems.
Identify critical data for meaningful analysis.
Overcome challenges in database integration.
Manage and process complex, unstructured datasets.
Responsibilities include:
7. Implementing and validating predictive models.
8. Creating and maintaining statistical models using big data, ML, and statistical techniques.
9. Designing and implementing enterprise search tools like Elasticsearch and Splunk.
10. Working in an Agile and collaborative environment with cross-functional teams.
11. Writing programs to cleanse and integrate data efficiently.
12. Developing predictive/prescriptive models and evaluating results.
Education
Required: Bachelor's Degree
Preferred: Master’s Degree
Required Technical and Professional Expertise
Strong in designing scalable data warehouses in Snowflake (schema design, tuning, optimization).
Experienced in building data pipelines using Talend for structured/unstructured data.
Skilled in integrating data from cloud sources into Snowflake using Talend and native tools.
Proficient in dimensional and relational data modeling for analytics/reporting.
Preferred Technical and Professional Experience
Understanding of optimizing Snowflake workloads (clustering keys, caching, query profiling).
Capable of implementing data validation, cleansing, and governance in ETL.
Strong in SQL and/or Shell scripting for custom transformations and automation.