
PhD student in Computer Science at the University of Cambridge working on NLP and Machine Learning.
AI is analyzing your overall score…
Identifying your key strengths…
Evaluating your skill match against the job requirements…
Assessing your cultural and operational fit
University of Cambridge
Data Scientist
June 29, 2026 – Present
pico-dataset
December 2, 2024 – March 19, 2025
Scripts used to create the pretokenized-dolma and pretokenized-paloma datasets.
View Projecttokenisation-bias
August 28, 2024 – August 25, 2025
tokenisation-bias — GitHub repository
View Projectmemorisation-profiles
September 26, 2023 – March 25, 2025
This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".
View Projectawesome-hallucination-detection
September 15, 2023 – Present
List of papers on hallucination detection in LLMs.
View Projectefficient-dialogue-state-tracking-by-sequential-information-processing
July 5, 2023 – February 21, 2024
efficient-dialogue-state-tracking-by-sequential-information-processing — GitHub repository
View Projectanchoral
January 23, 2023 – April 15, 2025
This is the official PyTorch implementation for our NAACL 2024 paper: "AnchorAL: Computationally Efficient Active Learning for Large and Imbalanced Datasets".
View Projectlightning-quick-start
April 4, 2022 – October 4, 2024
lightning-quick-start — GitHub repository
View Projectenergizer
December 16, 2021 – May 4, 2024
An active learning library for Pytorch based on Lightning-Fabric.
View Projectquica
November 5, 2020 – November 9, 2020
quica is a tool to run inter coder agreement pipelines in an easy and effective ways. Multiple measures are run and results are collected in a single table than can be easily exported in Latex
View ProjectCultural Fit Analysis
The candidate's projects are predominantly personal and research-oriented, focusing on academic contributions (e.g., ACL, NAACL papers). While this demonstrates strong individual drive and technical depth, there is limited evidence of experience in diverse team environments or projects with direct business impact, which might affect cultural fit in a fast-paced, product-driven organization. The single listed professional experience is current and also academic.
Soft Skills & Operational Fit
Insufficient data to assess soft skills and operational fit. The candidate's project descriptions are concise and technically focused, but there is no information regarding collaboration, communication style, or problem-solving approaches in a team setting.