Data Scientist with 1+ years in Data Analytics & Machine Learning
AI is analyzing your overall score…
Identifying your key strengths…
Evaluating your skill match against the job requirements…
Assessing your cultural and operational fit
Results-driven and ethical data professional with expertise in data analytics, data science, and software engineering. Skilled in building end-to-end data solutions using machine learning, Python, Power BI, and SQL, from data pre-processing and feature engineering to model development and deployment. Experienced in CI/CD workflows and scalable backend systems. AWS Cloud Practitioner certified with hands-on cloud experience. Strong focus on delivering reliable, production-ready data solutions. Continuously advancing skills through real-world projects and certifications.
University of Eastern Africa, Baraton
Bachelor of Science · Software Engineering
September 1, 2020 – August 31, 2024
Bunyore Girls' High School
Kenya Certificate of Secondary Education
January 1, 2016 – December 31, 2019
MFS Technologies Limited
Junior Data Scientist
September 1, 2025 – Present
India
Nairobi County Governor's Delivery Unit, Presidential Digital Talent Programme (PDTP)
Data Analytics and AI intern
January 1, 2025 – August 31, 2025
India
Integrated Software Systems Limited (iSOFT Systems)
Software Development Intern
May 1, 2023 – August 31, 2023
India
World Quant University Applied Data Science Projects - Housing Price Analysis in Buenos Aires and Mexico
June 28, 2026 – Present
Cleaned and transformed housing datasets (Buenos Aires, Mexico) for analysis and modelling. Explored variable relationships through correlation analysis and visualizations. Built ML pipelines with feature encoding and imputation, improving model readiness and predictive performance.
Credit Risk Modelling (PD, LGD, EAD) – End-to-End Risk Analytics Project
June 28, 2026 – Present
Built an end-to-end credit risk modelling pipeline in Python to estimate PD, LGD, and EAD, aligned with industry-standard risk frameworks. Transformed and engineered loan data using encoding, scaling, and feature construction techniques to enhance risk separation between borrower segments. Developed and validated logistic (PD) and regression (LGD/EAD) models in Scikit-learn, optimizing performance using AUC and error-based metrics. Quantified portfolio exposure by calculating Expected Loss (EL), converting model outputs into financial risk insights. Translated model results into practical credit strategies across risk-based pricing, approval decisions, and loss mitigation.
Cybershujaa Data Science and AI Program Projects
June 28, 2026 – Present
Build regression and classification models, improving predictive performance through feature engineering and tuning. Developed transformer-based NLP solutions enabling automated text classification and analysis. Implemented RAG-based Generative AI applications for context-aware responses using structured retrieval pipelines. Deployed ML models via Streamlit, enabling real-time predictions and improving solution accessibility. Transformed raw datasets (titanic, Netflix, scraped data) into actionable insights through EDA and interactive dashboards (power BI, tableau).
View ProjectPower BI Dashboard Solutions – Sales performance and Inspection Reporting Projects
June 28, 2026 – Present
Delivered end-to-end BI dashboards, improving KPI visibility and operational decision making. Automated data transformations, reducing manual processing time by 70%. Optimized data models and visuals for faster report load and better user experience. Published secure, real-time reports with role-based access, scheduled refresh, and governance best practices.
Customers and Orders API
June 28, 2026 – Present
Built and deployed a production-ready REST API to manage customer and order workflows with secure authentication. Integrated SMS notifications via Africa's Talking API to enhance user interaction. Implemented CI/CD pipelines to automate testing and deployment reliability.
Train Travel Management System | Final Year Project
June 28, 2026 – Present
Developed a full-stack booking system with role-based access and secure transactions. Integrated Mpesa (Daraja API) and email notifications for automated payments and communication. Implemented RESTful APIs and admin dashboards to track bookings and user activity trends. Added sentiment analysis to capture and evaluate user feedback.
Applied Data Science
World Quant
June 1, 2026 – Present
Data & AI
Cybershujaa
January 1, 2025 – Present
AWS cloud practitioner
Ajira
January 1, 2024 – Present
Cultural Fit Analysis
The candidate's academic and personal projects demonstrate a strong initiative and a continuous learning mindset, which aligns well with a growth-oriented culture. The diversity of projects, from credit risk modeling to NLP and full-stack development, shows a broad interest and adaptability. The certifications (World Quant, Cybershujaa, AWS) further emphasize a commitment to skill development. The target role 'Data Scientist' aligns well with the candidate's demonstrated skills and project focus.
Soft Skills & Operational Fit
The candidate's project descriptions and experience highlight collaboration with cross-functional teams, attention to data quality, and a focus on delivering reliable solutions. These indicate a good operational fit and an understanding of teamwork. The descriptions also suggest an ability to communicate insights effectively through dashboards and reports.