Rahul Singh

Data Engineer

https://talent.gravityer.com/rahul-singh-8753917

Data Engineer with less than a year in Big Data & Cloud

Noida, Uttar Pradesh, India

Key Strengths

Strong foundational knowledge in data engineering concepts, including ETL/ELT, data ingestion, and batch processing.
Demonstrated experience with cloud platforms (Azure, AWS, GCP) and big data technologies (Spark, PySpark, Hadoop).
Practical experience in designing Lakehouse architectures (Bronze, Silver, Gold layers) and data modeling (fact/dimension tables).
Proficiency in PySpark for data transformations, quality, and consistency.
Experience with incremental loading strategies using watermark-based CDC.
Familiarity with Azure Data Factory for pipeline orchestration.

Cultural & Operational Fit

Cultural Fit Analysis

The candidate's academic projects demonstrate a strong alignment with the Data Engineer role, focusing on end-to-end data pipelines and incremental loading. The breadth of skills across various cloud platforms (Azure, AWS, GCP) and big data tools (Hadoop, Spark, PySpark) indicates a willingness to learn and adapt to different technologies. The certifications further support a continuous learning mindset, which is positive for cultural fit in a dynamic environment. However, the lack of professional experience means the diversity of real-world problem-solving scenarios is limited to academic projects.

Soft Skills & Operational Fit

The candidate's project descriptions indicate an ability to collaborate and ensure data consistency and reliability. The academic background and certifications suggest a proactive learning attitude. However, without direct work experience, it's difficult to assess operational fit, stress handling, or team collaboration in a professional setting.

AI is analyzing your overall score…

Identifying your key strengths…

Evaluating your skill match against the job requirements…

Assessing your cultural and operational fit

Projects

End-to-End Data Engineering Pipeline (E-Commerce Domain)/PROJECT 01

January 1, 2024 – June 1, 2025

Designed Bronze → Silver → Gold Lakehouse architecture using Azure. Ingested structured and unstructured data from multiple sources including: GitHub (CSV files), REST APIs (HTTP source), MySQL database (https://filess.io), Azure SQL Database. Designed and managed data ingestion pipelines using ADF pipelines with Lookup + ForEach activities, supporting scalable pipeline design. Implemented data transformations and processing in PySpark ensuring data quality, consistency, and reliability. Created curated Silver layer and fact/dimension tables in Gold layer using proper data modeling. Integrated Azure Synapse Analytics with ADLS Gen2 to support reporting and analytics. Version-controlled entire project using GitHub. Collaborated to design and maintain reliable data pipelines, ensuring data consistency and supporting reporting and analytics.

Incremental Loading Data Pipeline Using Watermark Strategy/PROJECT 02

January 1, 2024 – June 1, 2025

Simulated an OLTP database and designed incremental data pipelines events using a watermark-based CDC approach. Extracted only new and updated records to optimize data processing and maintain consistency. Loaded raw incremental data into Bronze layer Performed data cleansing, deduplication, and transformations in Silver layer using PySpark. analytics-ready datasets and fact tables in Gold layer for reporting. Ensured idempotent execution and pipeline reliability across runs. Managed and supported end-to-end pipeline execution, assisting in monitoring and ensuring reliable data processing.

View Project

Certifications

AWS Cloud computing certificate

ShapemySkills

June 1, 2026 – Present

Python for Data Science Certificate

IBM

June 1, 2026 – Present

Master Big Data with Hadoop, Spark, Kafka & Cloud | Build Real-World Projects & Scalable Data Pipelines from Scratch

Udemy

June 1, 2026 – Present

The modern JavaScript course for everyone! Master JavaScript with projects, challenges and theory. Many courses in one!

Udemy

June 1, 2026 – Present

Master modern React from beginner to advanced! Next.js, Context API, React Query, Redux, Tailwind, advanced patterns

Udemy

June 1, 2026 – Present

Key Strengths

Strong foundational knowledge in data engineering concepts, including ETL/ELT, data ingestion, and batch processing.
Demonstrated experience with cloud platforms (Azure, AWS, GCP) and big data technologies (Spark, PySpark, Hadoop).
Practical experience in designing Lakehouse architectures (Bronze, Silver, Gold layers) and data modeling (fact/dimension tables).
Proficiency in PySpark for data transformations, quality, and consistency.
Experience with incremental loading strategies using watermark-based CDC.
Familiarity with Azure Data Factory for pipeline orchestration.

Cultural & Operational Fit

Cultural Fit Analysis

Soft Skills & Operational Fit

Rahul Singh

Key Strengths

Cultural & Operational Fit

About

Top Skills

Skills

Education

Projects

Certifications

Key Strengths

Cultural & Operational Fit