Instagram
youtube
Facebook

Associate – Site Reliability Engineer (Data Engineering)

1–4 years
4 -6 LPA
11 June 24, 2025
Job Description
Job Type: Full Time Education: Bachelor’s / Master’s Degree in Computer Science / Engineering / Technology / Related Fields Skills: Python, .net, React Native, Django, Javascript, HTML, CSS, Typescript, Communication Skills, Power Bi, Numpy Pandas, Sql, machine learning, Data Analysis, Coimbatore, Data Science, Java, Adobe XD, Figma, php, wordpress, Artificial Intelligence, Excel

Job Title: Associate – Site Reliability Engineer (Data Engineering)
Location: Bengaluru, Karnataka, India
Employment Type: Full Time (Permanent)
Experience Required: 1–4 Years

Role Overview

  1. As an Associate Site Reliability Engineer (SRE) within the Data Engineering team at Goldman Sachs, you will work at the intersection of software engineering and platform reliability to ensure robust, scalable, and cost-effective data systems.

  2. You will support some of Goldman Sachs’s largest data platforms, driving observability, operational excellence, and strategic platform enhancements.

  3. You will partner with engineers and analysts across the firm to enable smooth, reliable, and performant access to critical data systems.

Key Responsibilities

  1. Drive cloud adoption strategies for data processing and warehousing solutions.

  2. Lead SRE strategy and implementation for large-scale data platforms including Data Lake and Lakehouse systems.

  3. Engage with data consumers and producers to align platform reliability with business cost and performance expectations.

  4. Design and implement observability, performance, and cost monitoring strategies using modern SRE tools.

  5. Develop and maintain automation for DevOps and CI/CD pipelines in support of platform efficiency and risk management.

  6. Take full lifecycle responsibility for platforms — from design through deprecation — including on-call rotation and postmortem analysis.

  7. Use data-driven approaches to inform platform decisions and continuous improvement efforts.

  8. Contribute to documentation, knowledge sharing, and best practice development across engineering teams.

Basic Qualifications

  1. Bachelor’s or Master’s degree in Computer Science, Applied Mathematics, Engineering, or a related quantitative field.

  2. 1–4 years of professional experience in software engineering or SRE roles.

  3. Minimum 1–2 years of hands-on coding experience (Python or Java preferred).

  4. Understanding of SRE and DevOps principles, automation, and operational risk mitigation.

  5. Experience working with cloud infrastructure (AWS, Azure, or GCP).

  6. Proven track record in data strategy, platform reliability, and cost optimization.

  7. Proficiency in SQL and data warehousing concepts including schema design, indexing, and partitioning.

  8. Familiarity with relational and columnar databases and performance tuning.

  9. Strong understanding of data governance including quality, traceability, latency, and security.

  10. Excellent analytical and problem-solving skills with a commercial focus.

  11. Effective communication skills with the ability to translate complex data concepts into actionable strategies.

  12. Self-motivated, collaborative, and capable of working in fast-paced global environments.

Preferred Qualifications

  1. Experience with Snowflake or other cloud-based databases such as BigQuery.

  2. Familiarity with Data Lake / Lakehouse architectures, including technologies like Apache Iceberg.

  3. Working knowledge of open-source tools such as AWS Lambda, Prometheus, and OpenTelemetry.

  4. Experience with observability tools like Grafana and PromQL.

  5. Exposure to GitLab CI/CD, infrastructure as code, and microservices-based environments.

  6. Understanding of data modeling concepts and entitlement implementations.

Technical Stack

  1. Programming: Python, Java

  2. Platforms: AWS, Snowflake, BigQuery

  3. Tools: Prometheus, Grafana, OpenTelemetry, GitLab CI/CD

  4. Databases: SQL (relational and columnar), NoSQL

  5. SRE Concepts: Monitoring, Reliability, Capacity Planning, Automation

Jobs in other cities