Data Engineer with less than a year in Big Data & Cloud
AI is analyzing your overall score…
Identifying your key strengths…
Evaluating your skill match against the job requirements…
Assessing your cultural and operational fit
Aspiring Big Data Engineer with strong computer science fundamentals and hands-on experience building data pipelines using modern big data technologies. Skilled in designing ETL/ELT workflows, data ingestion, incremental loading, and batch processing on cloud computing platforms using Apache Spark, PySpark, Hadoop, SQL, and Python to support analytics teams.
Amity University Online
Master of Compute Application
August 1, 2023 – June 30, 2025
ITS Mohan Nagar GZB
Bachelor of Compute Application
August 1, 2020 – June 30, 2023
End-to-End Data Engineering Pipeline (E-Commerce Domain)/PROJECT 01
January 1, 2024 – June 1, 2025
Designed Bronze → Silver → Gold Lakehouse architecture using Azure. Ingested structured and unstructured data from multiple sources including: GitHub (CSV files), REST APIs (HTTP source), MySQL database (https://filess.io), Azure SQL Database. Designed and managed data ingestion pipelines using ADF pipelines with Lookup + ForEach activities, supporting scalable pipeline design. Implemented data transformations and processing in PySpark ensuring data quality, consistency, and reliability. Created curated Silver layer and fact/dimension tables in Gold layer using proper data modeling. Integrated Azure Synapse Analytics with ADLS Gen2 to support reporting and analytics. Version-controlled entire project using GitHub. Collaborated to design and maintain reliable data pipelines, ensuring data consistency and supporting reporting and analytics.
Incremental Loading Data Pipeline Using Watermark Strategy/PROJECT 02
January 1, 2024 – June 1, 2025
Simulated an OLTP database and designed incremental data pipelines events using a watermark-based CDC approach. Extracted only new and updated records to optimize data processing and maintain consistency. Loaded raw incremental data into Bronze layer Performed data cleansing, deduplication, and transformations in Silver layer using PySpark. analytics-ready datasets and fact tables in Gold layer for reporting. Ensured idempotent execution and pipeline reliability across runs. Managed and supported end-to-end pipeline execution, assisting in monitoring and ensuring reliable data processing.
View ProjectAWS Cloud computing certificate
ShapemySkills
June 1, 2026 – Present
Python for Data Science Certificate
IBM
June 1, 2026 – Present
Master Big Data with Hadoop, Spark, Kafka & Cloud | Build Real-World Projects & Scalable Data Pipelines from Scratch
Udemy
June 1, 2026 – Present
The modern JavaScript course for everyone! Master JavaScript with projects, challenges and theory. Many courses in one!
Udemy
June 1, 2026 – Present
Master modern React from beginner to advanced! Next.js, Context API, React Query, Redux, Tailwind, advanced patterns
Udemy
June 1, 2026 – Present
Cultural Fit Analysis
The candidate's academic projects demonstrate a strong alignment with the Data Engineer role, focusing on end-to-end data pipelines and incremental loading. The breadth of skills across various cloud platforms (Azure, AWS, GCP) and big data tools (Hadoop, Spark, PySpark) indicates a willingness to learn and adapt to different technologies. The certifications further support a continuous learning mindset, which is positive for cultural fit in a dynamic environment. However, the lack of professional experience means the diversity of real-world problem-solving scenarios is limited to academic projects.
Soft Skills & Operational Fit
The candidate's project descriptions indicate an ability to collaborate and ensure data consistency and reliability. The academic background and certifications suggest a proactive learning attitude. However, without direct work experience, it's difficult to assess operational fit, stress handling, or team collaboration in a professional setting.