ALI RAZA

AI Engineer

https://talent.gravityer.com/ali-raza-5785193

AI Engineer with less than a year in Machine Learning & Data Science

Key Strengths

Strong foundational knowledge in Machine Learning and Data Science, demonstrated through numerous projects.
Proficiency in Python and key ML/DS libraries (Scikit-learn, TensorFlow, XGBoost, Pandas, NumPy, Matplotlib, Seaborn, Plotly).
Experience with various ML paradigms: collaborative filtering, NLP (TF-IDF, Sentence Transformers), clustering (K-Means), classification (Logistic Regression, Random Forest, XGBoost), and regression (Linear Regression, Random Forest Regressor).
Ability to build end-to-end ML pipelines, including data cleaning, feature engineering, model training, evaluation, and deployment (Flask).
Exposure to Deep Learning and Generative AI concepts, though practical application details are limited.
Experience with recommender systems, which is highly relevant for AI Engineer roles.

Cultural & Operational Fit

Cultural Fit Analysis

The candidate's extensive portfolio of personal projects, including several Kaggle-style challenges, indicates a strong passion for AI/ML and a self-starter mentality. This aligns well with a culture that values initiative and continuous learning. The diversity of projects (recommendation systems, NLP, customer analytics, prediction models) shows a broad interest in applying AI to different domains. However, the lack of team-based projects or professional experience makes it difficult to assess collaboration and adaptability in a corporate environment.

Soft Skills & Operational Fit

The candidate demonstrates a proactive and self-driven attitude through a large number of personal and academic projects. The detailed descriptions indicate an ability to articulate technical processes and outcomes. However, without direct work experience or psychometric test results, it's difficult to assess stress handling, team collaboration, or broader operational fit.

AI is analyzing your overall score…

Identifying your key strengths…

Evaluating your skill match against the job requirements…

Assessing your cultural and operational fit

About

Software Engineering graduate with hands-on experience in machine learning, data science, and full-stack web development. Built 14+ ML projects on Kaggle covering recommender systems, NLP, clustering, classification, and regression on real-world datasets with up to 10,000+ records. Skilled in building end-to-end pipelines from data cleaning and feature engineering through model training, evaluation, and deployment. Seeking a Master's degree in AI / Data Science to deepen expertise in intelligent systems and advanced machine learning.

Top Skills

PythonDeep LearningLangchainNlp

Projects

Book Recommendation System (Final Year Project)

June 1, 2024 – June 1, 2025

Built a full-stack web app that recommends books based on similar reader preferences using collaborative filtering. Designed a homepage displaying the top 50 popular books with title, author, rating, and cover images. Added a real-time search feature where users type a book name and instantly get similar suggestions. Cleaned and preprocessed a dataset of books, users, and ratings by removing missing values and duplicates. Started with a popularity-based model (minimum 250 ratings filter) before moving to collaborative filtering for better personalization. Trained the recommendation model on the processed dataset, achieving approximately 97% accuracy. Deployed the complete application online as a live web service using Flask.

AI Tool Recommender System

August 1, 2023 – May 1, 2024

Built a dual recommender system on a dataset of 10,000+ real AI tools that takes natural language queries and returns the most relevant tools. Implemented two approaches side by side: TF-IDF with Cosine Similarity for fast keyword matching and Sentence Transformers (all-MiniLM-L6-v2) for deeper semantic understanding. Engineered features like Context_Window_K, Has_API, and combined_text fields to improve NLP matching quality. Added user filters for Pricing Model, Primary Domain, and API Availability to refine recommendations. Achieved 99.70% accuracy with Logistic Regression and 99.90% with Random Forest on domain classification. Cached Sentence Transformer embeddings after first run to make subsequent searches instant.

Customer360: Segmentation, Churn & CLV Prediction

January 1, 2023 – July 1, 2023

Built an end-to-end customer analytics pipeline covering EDA, segmentation, churn prediction, and lifetime value estimation. Created RFM (Recency, Frequency, Monetary) features to describe customer behavior for better model inputs. Applied K-Means clustering with Elbow Method and 3D interactive Plotly scatterplots for meaningful customer grouping. Trained Logistic Regression for churn prediction (classification) and Random Forest Regressor for CLV prediction (regression). Included train vs test evaluation metrics to detect overfitting, making results production-ready.

LinguisticTimePredictor

November 1, 2022 – January 1, 2023

Deep learning regression model for predicting time-related linguistic features from text.

Player Market Value Prediction

September 1, 2022 – November 1, 2022

Regression model predicting football player market values from performance stats.

Football Match Outcome Prediction

September 1, 2022 – January 1, 2023

Built a model that predicts whether the home team will win, lose, or draw based on football match statistics. Performed data cleaning (removed nulls, handled missing referees) and engineered features like goal difference, total goals, and season extraction. Compared Logistic Regression, Random Forest, and XGBoost with automatic model selection for the best-performing algorithm. Achieved approximately 99% accuracy with evaluation via classification reports and confusion matrices.

GDP Growth Modeling

July 1, 2022 – September 1, 2022

Compared Logistic Regression, Random Forest, and XGBoost for economic growth prediction.

Car Value Depreciation Model

May 1, 2022 – September 1, 2022

Built a model predicting how much a car's price decreases over time based on year, fuel type, kilometers driven, and condition. Compared three models: Linear Regression (R² ≈ 95%), Decision Tree (R² ≈ 99.5%), and Random Forest (R² ≈ 99.8%). Selected Random Forest Regressor as the final model for best accuracy and lowest error. Performed cross-validation to confirm model stability, consistently showing >99% R² score across folds.

DataCamp Learning Path Recommender

May 1, 2022 – July 1, 2022

Content-based recommender system with 80% accuracy for personalized learning paths.

LuxuryDrive AI: BMW Sales Predictor

March 1, 2022 – May 1, 2022

Sales prediction model achieving 90% accuracy on BMW vehicle data.

Student Learning Profile Clustering

January 1, 2022 – May 1, 2022

Analyzed a student academic performance dataset to identify hidden patterns using unsupervised learning. Applied K-Means clustering to group students into meaningful categories based on study behavior and past performance. Performed data preprocessing, feature selection, and cluster validation to ensure robust groupings.

Stress Risk Profiling

January 1, 2022 – March 1, 2022

Clustering model to categorize individuals into stress risk profiles based on health indicators.

Award-Winning Disney Titles Prediction

November 1, 2021 – March 1, 2022

Built a model predicting whether a Disney title is award-winning using metadata like genre, runtime, IMDb ratings, language, and release year. Engineered new predictive features from runtime, rating, and genre data to improve model performance. Trained ensemble models (Random Forest and XGBoost), achieving 82% overall accuracy and 94% recall. Optimized for recall to minimize missed predictions of award-winning content.

Key Strengths

Strong foundational knowledge in Machine Learning and Data Science, demonstrated through numerous projects.
Proficiency in Python and key ML/DS libraries (Scikit-learn, TensorFlow, XGBoost, Pandas, NumPy, Matplotlib, Seaborn, Plotly).
Experience with various ML paradigms: collaborative filtering, NLP (TF-IDF, Sentence Transformers), clustering (K-Means), classification (Logistic Regression, Random Forest, XGBoost), and regression (Linear Regression, Random Forest Regressor).
Ability to build end-to-end ML pipelines, including data cleaning, feature engineering, model training, evaluation, and deployment (Flask).
Exposure to Deep Learning and Generative AI concepts, though practical application details are limited.
Experience with recommender systems, which is highly relevant for AI Engineer roles.

Cultural & Operational Fit

Cultural Fit Analysis

Soft Skills & Operational Fit

ALI RAZA

Key Strengths

Cultural & Operational Fit

About

Top Skills

Skills

Education

Projects

Key Strengths

Cultural & Operational Fit